x
Mechanistic interpretability through clustering — LessWrong