x

LESSWRONG

LW

Unofficial OpenAI Alignment Blog Linkposts — LessWrong

Unofficial OpenAI Alignment Blog Linkposts

May 04, 2026 by papetoast

https://alignment.openai.com/

Posts before Apr 30, 2026 are not crossposted.

6Auto-review of agent actions without synchronous human oversight

1mo

0

24Investigating the consequences of accidentally grading CoT during RL

1mo

0

5Can public chat data predict real-world AI misalignments?

1d

0