x
[Linkpost] Interpreting Multimodal Video Transformers Using Brain Recordings — LessWrong