x
Introducing Alignment Stress-Testing at Anthropic — LessWrong