LESSWRONG
LW

894
Maria Kapros
9210
Message
Dialogue
Subscribe

Sequences

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Provable AI Alignment (ProvAIA)
Weak-To-Strong Generalization (W2SG)
W2SG: Introduction
Maria Kapros2y10

Wasn't aware of it. Thanks!

Reply
10Feature-Based Analysis of Safety-Relevant Multi-Agent Behavior
6mo
0
2W2SG: Introduction
2y
2