This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Transformer Circuits
Edit
History
Discussion
(1)
Help improve this page
Edit
History
Discussion
(1)
Help improve this page
Transformer Circuits
Random Tag
Contributors
Posts tagged
Transformer Circuits
Most Relevant
3
29
Finding Neurons in a Haystack: Case Studies with Sparse Probing
Ω
wesg
,
Neel Nanda
1mo
Ω
5
2
101
200 Concrete Open Problems in Mechanistic Interpretability: Introduction
Ω
Neel Nanda
6mo
Ω
0
2
34
How to Think About Activation Patching
Ω
Neel Nanda
12d
Ω
5
2
32
200 COP in MI: Interpreting Algorithmic Problems
Ω
Neel Nanda
6mo
Ω
2
2
29
A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)
Ω
Neel Nanda
7mo
Ω
15
2
20
A Walkthrough of In-Context Learning and Induction Heads (w/ Charles Frye) Part 1 of 2
Ω
Neel Nanda
7mo
Ω
0
2
19
200 COP in MI: Exploring Polysemanticity and Superposition
Ω
Neel Nanda
5mo
Ω
0
2
17
200 COP in MI: Analysing Training Dynamics
Ω
Neel Nanda
5mo
Ω
0
2
16
Understanding the tensor product formulation in Transformer Circuits
Tom Lieberum
1y
2
2
15
200 COP in MI: Looking for Circuits in the Wild
Ω
Neel Nanda
6mo
Ω
5
2
13
200 COP in MI: Techniques, Tooling and Automation
Ω
Neel Nanda
5mo
Ω
0
2
8
Explaining the Transformer Circuits Framework by Example
Felix Hofstätter
2mo
0
1
75
An Analogy for Understanding Transformers
TheMcDouglas
1mo
5
1
23
No Really, Attention is ALL You Need - Attention can do feedforward networks
Robert_AIZI
5mo
2
1
17
Anthropic's SoLU (Softmax Linear Unit)
Joel Burget
1y
1