Three things I found while building a circuit discovery toolkit
I've been building a mechanistic interpretability toolkit for the last few months. Not writing about it — actually building it, running into walls, rewriting things. This post is about two implementation problems that cost me real time, and a metric I had to invent because I couldn't find what I...
Mar 171