LESSWRONG
LW

Maria Kapros
8210
Message
Dialogue
Subscribe

Sequences

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
Provable AI Alignment (ProvAIA)
Weak-To-Strong Generalization (W2SG)
W2SG: Introduction
Maria Kapros1y10

Wasn't aware of it. Thanks!

Reply
No wikitag contributions to display.
9Feature-Based Analysis of Safety-Relevant Multi-Agent Behavior
4mo
0
2W2SG: Introduction
1y
2