LESSWRONGTags
LW

Research Agendas

EditHistory
Discussion(0)
Help improve this page(2 flags)
Posts tagged Research Agendas
Most Relevant
16
47The Learning-Theoretic AI Alignment Research AgendaΩ
Vanessa Kosoy
3y
Ω
37
13
34New safety research agenda: scalable agent alignment via reward modelingΩ
Vika
2y
Ω
13
11
185Embedded AgentsΩ
abramdemski, Scott Garrabrant
2y
Ω
41
11
107Paul's research agenda FAQΩ
zhukeepa
3y
Ω
69
11
94Our take on CHAI’s research agenda in under 1500 wordsΩ
alexflint
7mo
Ω
19
11
63Research Agenda v0.9: Synthesising a human's preferences into a utility functionΩ
Stuart_Armstrong
2y
Ω
20
11
25AI Governance: A Research Agenda
habryka
2y
3
4
111Thoughts on Human ModelsΩ
Ramana Kumar, Scott Garrabrant
2y
Ω
31
4
54MIRI's technical research agenda
So8res
6y
52
4
54Preface to CLR's Research Agenda on Cooperation, Conflict, and TAI Ω
JesseClifton
1y
Ω
8
4
18Deconfusing Human Values Research Agenda v1Ω
G Gordon Worley III
1y
Ω
12
3
60Using GPT-N to Solve Interpretability of Neural Networks: A Research Agenda
elriggs, Gurkenglas
6mo
11
2
114Embedded Agency (full-text version)Ω
Scott Garrabrant, abramdemski
2y
Ω
11
2
108Robust DelegationΩ
abramdemski, Scott Garrabrant
2y
Ω
10
2
99Subsystem AlignmentΩ
abramdemski, Scott Garrabrant
2y
Ω
12
Load More (15/40)
Add Posts