Reinforcement and Short-Term Rewards as Anti-Akratic — LessWrong