x
More examples of goal misgeneralization — LessWrong