LESSWRONG
LW

408
Matthew Rahtz
58000
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
44Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla
Ω
2y
Ω
3
69Specification gaming: the flip side of AI ingenuity
Ω
5y
Ω
9