Thoughts on gradient hacking — LessWrong