x
Eval Hacking: A new frontier in AI eval with a unified taxonomy — LessWrong