x
Automating Mechanistic Interpretability via Program Synthesis — LessWrong