x
Reproducing METR’s RE-bench Reward Hacking Results — LessWrong