x
The reward engineering problem — LessWrong