x
Systematic Sandbagging Evaluations on Claude 3.5 Sonnet — LessWrong