x

LESSWRONG

LW

Jatin Nainani — LessWrong

Jatin Nainani

Jatin Nainani

Message

83

6

2y

Jatin Nainani

83

2y

Scaling Sparse Feature Circuit Finding to Gemma 9B

by Diego Caples, Jatin Nainani, CallumMcDougall, and rrenaud

[This is an interim report and continuation of the work from the research sprint done in MATS winter 7 (Neel Nanda's Training Phase)] Try out binary masking for a few residual saes in this colab notebook: [Github Notebook] [Colab Notebook] TL;DR: We propose a novel approach to: 1. Scaling SAE...

Jan 10, 2025•88