Model Mis-specification and Inverse Reinforcement Learning — LessWrong