x
Formalizing Policy-Modification Corrigibility — LessWrong