x

LESSWRONG
LW

ZachParent — LessWrong

ZachParent

ZachParent

Message

48

1

7mo

ZachParent

48

7mo

The sum of its parts: composing AI control protocols

This work was supported through the MARS (Mentorship for Alignment Research Students) program at the Cambridge AI Safety Hub (caish.org/mars). We would like to thank Redwood Research for their support and Andrés Cotton for his management of the project. TLDR The field of AI control takes a worst-case scenario approach,...

Oct 15, 2025•14

Prompt optimization can enable AI control research

This project was conducted as a part of a one-week research sprint for the Finnish Alignment Engineering Bootcamp. We would like to thank Vili Kohonen and Tyler Tracy for their feedback and guidance on the project. TLDR Research in AI control is tedious. To test the safety and usefulness of...

Sep 23, 2025•40