Scaling Sparse Feature Circuit Finding to Gemma 9B
[This is an interim report and continuation of the work from the research sprint done in MATS winter 7 (Neel Nanda's Training Phase)] Try out binary masking for a few residual saes in this colab notebook: [Github Notebook] [Colab Notebook] TL;DR: We propose a novel approach to: 1. Scaling SAE...
Jan 10, 202588