x
Corrigibility Scales To Value Alignment — LessWrong