LESSWRONGTags
LW

Transformer Circuits

EditHistorySubscribe
Discussion (1)
Help improve this page
EditHistorySubscribe
Discussion (1)
Help improve this page
Transformer Circuits
Random Tag
Contributors
Posts tagged Transformer Circuits
Most Relevant
2
90200 Concrete Open Problems in Mechanistic Interpretability: IntroductionΩ
Neel Nanda
3mo
Ω
0
2
31200 COP in MI: Interpreting Algorithmic ProblemsΩ
Neel Nanda
3mo
Ω
1
2
29A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)Ω
Neel Nanda
5mo
Ω
15
2
20A Walkthrough of In-Context Learning and Induction Heads (w/ Charles Frye) Part 1 of 2Ω
Neel Nanda
4mo
Ω
0
2
17200 COP in MI: Exploring Polysemanticity and SuperpositionΩ
Neel Nanda
3mo
Ω
0
2
17200 COP in MI: Analysing Training DynamicsΩ
Neel Nanda
3mo
Ω
0
2
16Understanding the tensor product formulation in Transformer Circuits
Tom Lieberum
1y
2
2
15200 COP in MI: Looking for Circuits in the WildΩ
Neel Nanda
3mo
Ω
5
2
12200 COP in MI: Techniques, Tooling and AutomationΩ
Neel Nanda
3mo
Ω
0
1
22No Really, Attention is ALL You Need - Attention can do feedforward networks
Robert_AIZI
2mo
2
1
15Anthropic's SoLU (Softmax Linear Unit)
Joel Burget
9mo
1
1
8Addendum: More Efficient FFNs via Attention
Robert_AIZI
2mo
0
Add Posts