LESSWRONGTags
LW

Transformers

EditHistorySubscribe
Discussion (0)
Help improve this page
EditHistorySubscribe
Discussion (0)
Help improve this page
Transformers
Random Tag
Contributors
Posts tagged Transformers
3
110How LLMs are and are not myopic
Ω
janus
4mo
Ω
10
2
86Google's PaLM-E: An Embodied Multimodal Language Model
SandXbox
9mo
7
2
71Residual stream norms grow exponentially over the forward pass
Ω
StefanHex, TurnTrout
7mo
Ω
24
2
62Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind
Ω
DragonGod
10mo
Ω
12
2
54Concrete Steps to Get Started in Transformer Mechanistic Interpretability
Ω
Neel Nanda
1y
Ω
7
2
53How fast can we perform a forward pass?
jsteinhardt
1y
9
2
23How Do Induction Heads Actually Work in Transformers With Finite Capacity?
Fabien Roger
8mo
0
1
78An Analogy for Understanding Transformers
CallumMcDougall
6mo
5
1
44Brief Notes on Transformers
Ω
Adam Jermyn
1y
Ω
3
1
44Searching for Modularity in Large Language Models
NickyP, Stephen Fowler
1y
3
1
42Building a transformer from scratch - AI safety up-skilling challenge
Ω
Marius Hobbhahn
1y
Ω
1
1
42GPT-2's positional embedding matrix is a helix
AdamYedidia
4mo
18
1
32New Tool: the Residual Stream Viewer
Ω
AdamYedidia
2mo
Ω
7
1
29We Need To Know About Continual Learning
michael_mjd
7mo
14
1
26The positional embedding matrix and previous-token heads: how do they actually work?
Ω
AdamYedidia
4mo
Ω
4
Load More (15/25)
Add Posts