x
The Theoretical Foundations of Reward Learning — LessWrong