LESSWRONG
LW

1562
gbcolborne
1010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable
gbcolborne3y20

Interesting post. I just wanted to mention that your first two SVD matrix illustrations (for heads 10 and 15 of layer 22) are identical, apart from the labeled axes.

Reply