x
Sabotage Evaluations for Frontier Models — LessWrong