[Linkpost] Robustified ANNs Reveal Wormholes Between Human Category Percepts

Bogdan Ionut Cirstea

[Linkpost] Robustified ANNs Reveal Wormholes Between Human Category Percepts

by Bogdan Ionut Cirstea

1 min read17th Aug 20232 comments

6

AI

Frontpage

This is a linkpost for https://arxiv.org/abs/2308.06887.

The visual object category reports of artificial neural networks (ANNs) are notoriously sensitive to tiny, adversarial image perturbations. Because human category reports (aka human percepts) are thought to be insensitive to those same small-norm perturbations -- and locally stable in general -- this argues that ANNs are incomplete scientific models of human visual perception. Consistent with this, we show that when small-norm image perturbations are generated by standard ANN models, human object category percepts are indeed highly stable. However, in this very same "human-presumed-stable" regime, we find that robustified ANNs reliably discover low-norm image perturbations that strongly disrupt human percepts. These previously undetectable human perceptual disruptions are massive in amplitude, approaching the same level of sensitivity seen in robustified ANNs. Further, we show that robustified ANNs support precise perceptual state interventions: they guide the construction of low-norm image perturbations that strongly alter human category percepts toward specific prescribed percepts. These observations suggest that for arbitrary starting points in image space, there exists a set of nearby "wormholes", each leading the subject from their current category perceptual state into a semantically very different state. Moreover, contemporary ANN models of biological visual processing are now accurate enough to consistently guide us to those portals.

New Comment

2 comments, sorted by

top scoring

Click to highlight new comments since: Today at 1:20 AM

[-]Mitchell_Porter8mo30

For those readers who might skip this paper: it's studying questions like, what is the least number of pixels you need to change, to make a dog look like a bird / crab / primate / frog / etc. It's creepy stuff reminiscent of Deep Dream.

Reply

[-]Radford Neal8mo10

Perhaps of relevance:

How to Tell the Birds from the Flowers

Reply

Moderation Log

LESSWRONG
LW

[Linkpost] Robustified ANNs Reveal Wormholes Between Human Category Percepts

6

New to LessWrong?