Corrigibility through stratified indifference — LessWrong