x
Examples of Subtle Alignment Failures from Claude and Gemini — LessWrong