x
High-level approaches to rigor in interpretability — LessWrong