VLMs can Aggregate Scattered Training Patches
This is the abstract and summary of our new paper. We show that vision-language models can learn to reconstruct harmful images from benign-looking patches scattered across training data—a phenomenon we call visual stitching. This ability allows dangerous content to bypass moderation and be reassembled during inference, raising critical safety concerns...
Jun 16, 20252