x
Defining and Characterising Reward Hacking — LessWrong