Reward button alignment — LessWrong