x

LESSWRONG

LW

monika_j — LessWrong

monika_j

monika_j

Message

53

1

5

4mo

monika_j

53

4mo

Monitoring benchmark for AI control

Monitoring benchmark/Semi-automated red-teaming for AI control We are a team of control researchers supported by CG’s Technical AI safety grant. We are now halfway through our project and would like to get feedback on the following contributions. Have a low bar for adding questions or comments to the document, we...