Convergent Linear Representations of Emergent Misalignment — LessWrong