x
Blind deep-deployment evals for control & sabotage — LessWrong