The Theoretical Reward Learning Research Agenda: Introduction and Motivation — LessWrong