x
Misalignments and RL failure modes in the early stage of superintelligence — LessWrong