x
Martian Interpretability Challenge: The Core Problems In Interpretability — LessWrong