Reward Is Not Necessary: How To Create A Compositional Self-Preserving Agent For Life-Long Learning — LessWrong