x
Research note on negated reward hacking — LessWrong