Make Data Pipelines Debuggable by Storing All Source References — LessWrong