LESSWRONGTags
LW

Transformers

EditHistorySubscribe
Discussion (0)
Help improve this page
EditHistorySubscribe
Discussion (0)
Help improve this page
Transformers
Random Tag
Contributors
Posts tagged Transformers
Most Relevant
2
86Google's PaLM-E: An Embodied Multimodal Language Model
SandXbox
20d
7
2
55Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind Ω
DragonGod
2mo
Ω
12
2
53How fast can we perform a forward pass?
jsteinhardt
10mo
9
2
47Concrete Steps to Get Started in Transformer Mechanistic InterpretabilityΩ
Neel Nanda
3mo
Ω
7
2
23How Do Induction Heads Actually Work in Transformers With Finite Capacity?
Fabien Roger
4d
0
1
44Searching for Modularity in Large Language Models
NickyP, Stephen Fowler
7mo
3
1
42Building a transformer from scratch - AI safety up-skilling challengeΩ
Marius Hobbhahn
5mo
Ω
1
1
33Brief Notes on TransformersΩ
Adam Jermyn
6mo
Ω
2
1
22No Really, Attention is ALL You Need - Attention can do feedforward networks
Robert_AIZI
2mo
2
1
8Research agenda - Building a multi-modal chess-language model
p.b.
1y
2
1
8Addendum: More Efficient FFNs via Attention
Robert_AIZI
2mo
0
1
7Are Mixture-of-Experts Transformers More Interpretable Than Dense Transformers? Q
simeon_c
3mo
Q
4
1
-4So, just why do GPTs have to operate by continuing an existing string?
Bill Benzon
3d
0
Add Posts