x
The Endgame for Mechanistic Interpretability — LessWrong