AGI-Automated Interpretability is Suicide — LessWrong