LESSWRONG
LW

gbcolborne
1010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable
gbcolborne3y20

Interesting post. I just wanted to mention that your first two SVD matrix illustrations (for heads 10 and 15 of layer 22) are identical, apart from the labeled axes.

Reply
No wikitag contributions to display.
No posts to display.