LESSWRONG
LW

Michael Tontchev
7110
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research
Michael Tontchev2y20

What differentiates a "model organism" from just a normal AI model?

Reply
7Meta is hiring for LLM red teaming position
2y
0