Paper review: “The Unreasonable Effectiveness of Easy Training Data for Hard Tasks” — LessWrong