x
Corrigibility as Constrained Optimisation — LessWrong