x
Corrigibility through stratified indifference — LessWrong