Defining and Characterising Reward Hacking — LessWrong