x
Training for corrigability: obvious problems? — LessWrong