x
A toy model of a corrigibility problem — LessWrong