x

LESSWRONG
LW

Jan Kulveit — LessWrong

Jan Kulveit

Jan Kulveit

Message

175

Ω

34

1

20

7y

Jan Kulveit

175

Ω

34

7y

;

Box inversion hypothesis

This text originated from a retreat in late 2018, where researchers from FHI, MIRI and CFAR did an extended double-crux on AI safety paradigms, with Eric Drexler and Scott Garrabrant in the core. In the past two years I tried to improve it in terms of understandability multiple times, but...

Oct 20, 2020•59