The next wave of model improvements will be due to data quality — LessWrong