A toy model of the treacherous turn — LessWrong