This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
2327
Matthew Rahtz — LessWrong
Matthew Rahtz
Posts
Sorted by New
Wikitag Contributions
Comments
Sorted by
Newest
44
Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla
Ω
2y
Ω
3
69
Specification gaming: the flip side of AI ingenuity
Ω
5y
Ω
9
Comments