LESSWRONG
LW

869
Wikitags

Superposition

Edited by duck_master last updated 5th Dec 2023

Posts about the concept of superposition - that is, neural nets representing concepts as a superposition of many neurons.

Subscribe
Discussion
Subscribe
Discussion
Posts tagged Superposition
154[Interim research report] Taking features out of superposition with sparse autoencoders
Ω
Lee Sharkey, Dan Braun, beren
3y
Ω
23
206Toward A Mathematical Framework for Computation in Superposition
Ω
Dmitry Vaintrob, jake_mendel, Kaarel
2y
Ω
19
131Circuits in Superposition: Compressing many small neural networks into one
Ω
Lucius Bushnaq, jake_mendel
11mo
Ω
9
72Circuits in Superposition 2: Now with Less Wrong Math
Ω
Linda Linsefors, Lucius Bushnaq
2mo
Ω
0
67Superposition is not "just" neuron polysemanticity
Ω
LawrenceC
1y
Ω
4
46Some costs of superposition
Ω
Linda Linsefors
2y
Ω
11
22AI alignment as a translation problem
Roman Leventov
2y
2
9Conditional Importance in Toy Models of Superposition
james__p
7mo
4
8From Conceptual Spaces to Quantum Concepts: Formalising and Learning Structured Conceptual Models
Roman Leventov
2y
1
5Thoughts on Toy Models of Superposition
james__p
7mo
2
289Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
Ω
Zac Hatfield-Dodds
2y
Ω
22
137Comparing Anthropic's Dictionary Learning to Ours
Ω
Robert_AIZI
2y
Ω
8
103Open Source Sparse Autoencoders for all Residual Stream Layers of GPT2-Small
Ω
Joseph Bloom
2y
Ω
37
90Growth and Form in a Toy Model of Superposition
Ω
Liam Carroll, Edmund Lau
2y
Ω
7
77Interpretability with Sparse Autoencoders (Colab exercises)
Ω
CallumMcDougall
2y
Ω
9
Load More (15/33)
Add Posts