x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Unofficial OpenAI Alignment Blog Linkposts — LessWrong
Unofficial OpenAI Alignment Blog Linkposts
https://alignment.openai.com/
Posts before Apr 30, 2026 are not crossposted.
6
Auto-review of agent actions without synchronous human oversight
papetoast
16d
0
24
Investigating the consequences of accidentally grading CoT during RL
papetoast
12d
0