Preferences over non-rewards — LessWrong