Reward hacking behavior can generalize across tasks — LessWrong