x
Exploring Concept-Specific Slices in Weight Matrices for Network Interpretability — LessWrong