x

LESSWRONG

LW

Manas Joglekar — LessWrong

Manas Joglekar

Manas Joglekar

Message

134

6y

Manas Joglekar

134

6y

Why we are excited about confession!

by Boaz Barak, Gabriel Wu, and Manas Joglekar

Boaz Barak, Gabriel Wu, Jeremy Chen, Manas Joglekar [Linkposting from the OpenAI alignment blog, where we post more speculative/technical/informal results and thoughts on safety and alignment.] > TL;DR We go into more details and some follow up results from our paper on confessions (see the original blog post). We give...