x
Linear vs Non-linear Probes for Interpretability — LessWrong