Problems with learning values from observation — LessWrong