A Technical Primer on Mechanistic Interpretability
> Note: This is a static version of an interactive primer with animated visualizations and a glossary sidebar. I recommend reading the full version, but the text stands on its own. Motivation & Background On the one hand, I write this as a neuroscientist who believes the techniques developed by...
Feb 191