This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Transformers
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Transformers
Random Tag
Contributors
Posts tagged
Transformers
Most
Relevant
2
86
Google's PaLM-E: An Embodied Multimodal Language Model
SandXbox
20d
7
2
55
Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind
Ω
DragonGod
2mo
Ω
12
2
53
How fast can we perform a forward pass?
jsteinhardt
10mo
9
2
47
Concrete Steps to Get Started in Transformer Mechanistic Interpretability
Ω
Neel Nanda
3mo
Ω
7
2
23
How Do Induction Heads Actually Work in Transformers With Finite Capacity?
Fabien Roger
4d
0
1
44
Searching for Modularity in Large Language Models
NickyP
,
Stephen Fowler
7mo
3
1
42
Building a transformer from scratch - AI safety up-skilling challenge
Ω
Marius Hobbhahn
5mo
Ω
1
1
33
Brief Notes on Transformers
Ω
Adam Jermyn
6mo
Ω
2
1
22
No Really, Attention is ALL You Need - Attention can do feedforward networks
Robert_AIZI
2mo
2
1
8
Research agenda - Building a multi-modal chess-language model
p.b.
1y
2
1
8
Addendum: More Efficient FFNs via Attention
Robert_AIZI
2mo
0
1
7
Are Mixture-of-Experts Transformers More Interpretable Than Dense Transformers?
Q
simeon_c
3mo
Q
4
1
-4
So, just why do GPTs have to operate by continuing an existing string?
Bill Benzon
3d
0