x
Reverse-engineering using interpretability — LessWrong