x
Opus 4.6 Reasoning Doesn't Verbalize Alignment Faking, but Behavior Persists — LessWrong