VLMs can Aggregate Scattered Training Patches — LessWrong