How to Contribute to Theoretical Reward Learning Research — LessWrong