LESSWRONG
LW

Most neural networks don’t have anything comparable to specialised brain areas, at least structurally, so you can’t see which areas light up given some stimulus to determine what that part does. You can do it with individual neurons or channels, though. The best UI I know of to explore this is the “Dataset Samples” option in the OpenAI Microscope, that shows which inputs activate each unit.

Add Comment

mtaran

Jun 13, 2022

The most similar analysis tool I'm aware of is called an activation atlas (https://distill.pub/2019/activation-atlas/), though I've only seen it applied to visual networks. Would love to see it used on language models!

Add Comment

1 comment, sorted by

top scoring

Click to highlight new comments since: Today at 1:45 AM

[-]Dagon3y30

This has proven invaluable in understanding brains.

It has? It's proven quite useful in understanding some types of injury and malfunction. And it may have given hints to developmental and very general structures. But I don't think it's helped very much in understanding cognitive effects or ideas.

Moderation Log

3

[ Question ]

Can you MRI a deep learning model?

3

3

2 Answers sorted by top scoring

Jun 13, 2022

Jun 13, 2022

2 Answers sorted by
top scoring