LESSWRONGTags
LW

Transformer Circuits

EditHistory
Discussion (1)
Help improve this page
EditHistory
Discussion (1)
Help improve this page
Transformer Circuits
Random Tag
Contributors
Posts tagged Transformer Circuits
3
29Finding Neurons in a Haystack: Case Studies with Sparse ProbingΩ
wesg, Neel Nanda
1mo
Ω
5
2
101200 Concrete Open Problems in Mechanistic Interpretability: IntroductionΩ
Neel Nanda
6mo
Ω
0
2
34How to Think About Activation PatchingΩ
Neel Nanda
12d
Ω
5
2
32200 COP in MI: Interpreting Algorithmic ProblemsΩ
Neel Nanda
6mo
Ω
2
2
29A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)Ω
Neel Nanda
7mo
Ω
15
2
20A Walkthrough of In-Context Learning and Induction Heads (w/ Charles Frye) Part 1 of 2Ω
Neel Nanda
7mo
Ω
0
2
19200 COP in MI: Exploring Polysemanticity and SuperpositionΩ
Neel Nanda
5mo
Ω
0
2
17200 COP in MI: Analysing Training DynamicsΩ
Neel Nanda
5mo
Ω
0
2
16Understanding the tensor product formulation in Transformer Circuits
Tom Lieberum
1y
2
2
15200 COP in MI: Looking for Circuits in the WildΩ
Neel Nanda
6mo
Ω
5
2
13200 COP in MI: Techniques, Tooling and AutomationΩ
Neel Nanda
5mo
Ω
0
2
8Explaining the Transformer Circuits Framework by Example
Felix Hofstätter
2mo
0
1
75An Analogy for Understanding Transformers
TheMcDouglas
1mo
5
1
23No Really, Attention is ALL You Need - Attention can do feedforward networks
Robert_AIZI
5mo
2
1
17Anthropic's SoLU (Softmax Linear Unit)
Joel Burget
1y
1
Load More (15/16)
Add Posts