This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Research Agendas
Edit
History
Discussion
(0)
Help improve this page
(2 flags)
Posts tagged
Research Agendas
Most Relevant
16
47
The Learning-Theoretic AI Alignment Research Agenda
Ω
Vanessa Kosoy
3y
Ω
37
13
34
New safety research agenda: scalable agent alignment via reward modeling
Ω
Vika
2y
Ω
13
11
185
Embedded Agents
Ω
abramdemski
,
Scott Garrabrant
2y
Ω
41
11
107
Paul's research agenda FAQ
Ω
zhukeepa
3y
Ω
69
11
94
Our take on CHAI’s research agenda in under 1500 words
Ω
alexflint
7mo
Ω
19
11
63
Research Agenda v0.9: Synthesising a human's preferences into a utility function
Ω
Stuart_Armstrong
2y
Ω
20
11
25
AI Governance: A Research Agenda
habryka
2y
3
4
111
Thoughts on Human Models
Ω
Ramana Kumar
,
Scott Garrabrant
2y
Ω
31
4
54
MIRI's technical research agenda
So8res
6y
52
4
54
Preface to CLR's Research Agenda on Cooperation, Conflict, and TAI
Ω
JesseClifton
1y
Ω
8
4
18
Deconfusing Human Values Research Agenda v1
Ω
G Gordon Worley III
1y
Ω
12
3
60
Using GPT-N to Solve Interpretability of Neural Networks: A Research Agenda
elriggs
,
Gurkenglas
6mo
11
2
114
Embedded Agency (full-text version)
Ω
Scott Garrabrant
,
abramdemski
2y
Ω
11
2
108
Robust Delegation
Ω
abramdemski
,
Scott Garrabrant
2y
Ω
10
2
99
Subsystem Alignment
Ω
abramdemski
,
Scott Garrabrant
2y
Ω
12