x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Unofficial OpenAI Alignment Blog Linkposts — LessWrong
Unofficial OpenAI Alignment Blog Linkposts
https://alignment.openai.com/
Posts before Apr 30, 2026 are not crossposted.
6
Auto-review of agent actions without synchronous human oversight
papetoast
1mo
0
24
Investigating the consequences of accidentally grading CoT during RL
papetoast
1mo
0
5
Can public chat data predict real-world AI misalignments?
papetoast
1d
0