Broad Basins and Data Compression — LessWrong