x
Alignment as Coherence: Predicting Deceptive Alignment as a Phase Transition — LessWrong