Open problems in emergent misalignment — LessWrong