x
The Defender’s Advantage of Interpretability — LessWrong