Problems integrating decision theory and inverse reinforcement learning — LessWrong