On Akrasia, Habits and Reward Maximization — LessWrong