LESSWRONG
LW

697
Casey Barkan
69Ω19210
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Do LLMs know what they're capable of? Why this matters for AI safety, and initial findings
Casey Barkan3mo42

Agreed that successful sandbagging will likely require Schelling coordination, and my guess is that this will be extremely difficult for models to pull off! Great to see that you're investigating this topic.

Reply
51Do LLMs know what they're capable of? Why this matters for AI safety, and initial findings
Ω
3mo
Ω
5
29AI could cause a drop in GDP, even if markets are competitive and efficient
6mo
0