x
Safe exploration and corrigibility — LessWrong