Internal Interfaces Are a High-Priority Interpretability Target — LessWrong