Monitoring benchmark for AI control
Monitoring benchmark/Semi-automated red-teaming for AI control We are a team of control researchers supported by CG’s Technical AI safety grant. We are now halfway through our project and would like to get feedback on the following contributions. Have a low bar for adding questions or comments to the document, we...
Jan 3047