LESSWRONG
LW

3213
Casey Barkan
69Ω19210
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
Do LLMs know what they're capable of? Why this matters for AI safety, and initial findings
Casey Barkan3mo42

Agreed that successful sandbagging will likely require Schelling coordination, and my guess is that this will be extremely difficult for models to pull off! Great to see that you're investigating this topic.

Reply
No wikitag contributions to display.
51Do LLMs know what they're capable of? Why this matters for AI safety, and initial findings
Ω
3mo
Ω
5
29AI could cause a drop in GDP, even if markets are competitive and efficient
6mo
0