x
METR Research Update: Algorithmic vs. Holistic Evaluation — LessWrong