x
Misspecification in Inverse Reinforcement Learning — LessWrong