x
Goodhart's Law in Reinforcement Learning — LessWrong