964

LESSWRONG
LW

963
AI ControlRecursive Self-ImprovementAI

7

[ Question ]

Handover to AI R&D Agents - relevant research?

by Ariel_
13th Nov 2025
1 min read
A
0
0

7

AI ControlRecursive Self-ImprovementAI

7

New Answer
New Comment
Moderation Log
More from Ariel_
View more
Curated and popular this week
A
0
0

Any good posts/papers discussing "handover"? i.e. the handover of AI research to AI-R&D agents [1] I'm also interested in any adjacent research agendas which might help the handover succeed. For reference, I am scoping a technical research project related to this, happy to share over DM. 

Some of the more relevant work I've read are Plan A, B, C, D, Wentworth's slop post, various scalable oversight/safety case papers, automation collapse - but these don't quite seem enough, so I was wondering if there's other (published) research or if its all happening only at the Labs. 

AI Control work might qualify here, if its working more on the "get useful work out of the models" part of control that is so far a bit less emphasized. 

  1. ^

    The plan of the original OpenAI Superalignment team, and presumably now continuing at Anthropic under Jan Leike (?)