x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Considerations in diffuse control — LessWrong
Considerations in diffuse control
11
Methodological considerations in making malign initializations for control research
Alek Westover
,
Vivek Hebbar
,
Julian Stastny
3mo
0
4
Three visions for diffuse control
Alek Westover
2mo
0
29
Four Downsides of Training Policies Online
Alek Westover
,
egan
3mo
4
18
Theoretical predictions on the sample efficiency of training policies and activation monitors
Alek Westover
,
Vivek Hebbar
3mo
2
32
How will we do SFT on models with opaque reasoning?
Ω
Alek Westover
,
Vivek Hebbar
,
egan
1mo
Ω
17