LESSWRONGTags
LW

Alignment Jam

EditHistory
Discussion (0)
Help improve this page
EditHistory
Discussion (0)
Help improve this page
Alignment Jam
Random Tag
Contributors
1Esben Kran

This lists the posts that have come from the Alignment Jam hackathons.

Posts tagged Alignment Jam
1
137We Found An Neuron in GPT-2Ω
Joseph Miller, Clement Neo
4mo
Ω
22
1
118Solving the Mechanistic Interpretability challenges: EIS VII Challenge 1Ω
StefanHex, Marius Hobbhahn
1mo
Ω
1
1
81Results from the interpretability hackathon
Esben Kran, Neel Nanda
7mo
0
1
62Solving the Mechanistic Interpretability challenges: EIS VII Challenge 2Ω
StefanHex, Marius Hobbhahn
22d
Ω
0
1
20Superposition and Dropout
Edoardo Pona
1mo
2
1
18Identifying semantic neurons, mechanistic circuits & interpretability web apps
Esben Kran, Neel Nanda
2mo
0
1
13Results from the AI testing hackathon
Esben Kran
5mo
0
Add Posts