x
Reinforcement Learning Goal Misgeneralization: Can we guess what kind of goals are selected by default? — LessWrong