x
We need a field of Reward Function Design — LessWrong