New to LessWrong?

New Comment
3 comments, sorted by Click to highlight new comments since: Today at 4:16 PM

That was fun to watch. But I would appreciate someone spelling out the implied connection to mechanistic interpretability.

Hint: does Charlie Chaplin have a gears-level understanding of the system?

LOL! Plus he's clearly lost in a vast system he can't comprehend. How do you comprehend a complex network of billions upon billions of weights? Is there any way you can get on top of the system to observe its operations, to map them out?