x
Towards Multimodal Interpretability: Learning Sparse Interpretable Features in Vision Transformers — LessWrong