x
Thoughts on reward engineering — LessWrong