LESSWRONG
LW

3

A day in the life of a mechanistic interpretability researcher

28th Nov 2023

1 min read

3

Interpretability (ML & AI)AIWorld Modeling

3

A day in the life of a mechanistic interpretability researcher

New Comment

3 comments, sorted by

Click to highlight new comments since: Today at 3:10 AM

[-]Ben Pace9mo31

That was fun to watch. But I would appreciate someone spelling out the implied connection to mechanistic interpretability.

[-]Joyee Chen9mo10

Hint: does Charlie Chaplin have a gears-level understanding of the system?

[-]Bill Benzon9mo30

LOL! Plus he's clearly lost in a vast system he can't comprehend. How do you comprehend a complex network of billions upon billions of weights? Is there any way you can get on top of the system to observe its operations, to map them out?