x
On Anthropic’s Sleeper Agents Paper — LessWrong