x
VLMs can Aggregate Scattered Training Patches — LessWrong