x
Towards a Science of Evals for Sycophancy — LessWrong