LESSWRONGTags
LW

Apart Research

EditHistorySubscribe
Discussion (0)
Help improve this page
EditHistorySubscribe
Discussion (0)
Help improve this page
Apart Research
Random Tag
Contributors
4Esben Kran
4habryka

Apart Research is a research organization working on AI Alignment. This tag includes posts written by Apart researchers, and content about Apart Research.

Posts tagged Apart Research
4
81Results from the interpretability hackathon
Esben Kran, Neel Nanda
1y
0
3
25Newsletter for Alignment Research: The ML Safety Updates
Esben Kran
1y
0
3
9Black Box Investigation Research Hackathon
Esben Kran, Jonas Hallgren
1y
4
2
37Safety timelines: How long will it take to solve alignment?
Esben Kran, JonathanRystroem, Steinthal
1y
7
2
24AI Safety Ideas: An Open AI Safety Research Platform
Esben Kran
1y
0
2
22Results from the language model hackathon
Esben Kran
1y
1
1
141We Found An Neuron in GPT-2
Ω
Joseph Miller, Clement Neo
10mo
Ω
22
1
20[Book] Interpretable Machine Learning: A Guide for Making Black Box Models Explainable
Esben Kran
1y
1
1
19ML Safety at NeurIPS & Paradigmatic AI Safety? MLAISU W49
Esben Kran, Steinthal
1y
0
1
18Identifying semantic neurons, mechanistic circuits & interpretability web apps
Esben Kran, Neel Nanda
8mo
0
1
15Join the interpretability research hackathon
Esben Kran
1y
0
1
12Will Machines Ever Rule the World? MLAISU W50
Esben Kran
1y
7
1
11Early Experiments in Reward Model Interpretation Using Sparse Autoencoders
lukemarks, Amirali Abdullah, Rauno Arike, Fazl, nothoughtsheadempty
2mo
0
1
10Join the AI Testing Hackathon this Friday
Esben Kran
1y
0
1
10Robustness & Evolution [MLAISU W02]
Esben Kran
1y
0
Load More (15/19)
Add Posts