Why "training against scheming" is hard — LessWrong