x
Confusion around the term reward hacking — LessWrong