x
Reinforcement Learner Wireheading — LessWrong