What is wrong with this approach to corrigibility? — LessWrong