Sharing a mostly inactive empirical agenda from mid 2025 (companion piece to a Techgov paper). Currently not planning to spend much time on it, but may consider collaborating. For a while I was curious about Scalable Oversight and AI Control. Both because they're cool to work on, and because they...
Any good posts/papers discussing "handover"? i.e. the handover of AI research to AI-R&D agents [1] I'm also interested in any adjacent research agendas which might help the handover succeed. For reference, I am scoping a technical research project related to this, happy to share over DM. Some of the more...
Disclaimer: This is a toy example of a scheming environment. It should be read mainly as research practice and not as research output. I am sharing it in case it helps others, and also to get feedback. The initial experiments were done in April 2025 for a MATS 8.0 submission,...
(mildly rewritten version of my EA forum post) The EU AI Act was originally proposed in 2020 as a very "EU regulates stuff first" kind of legislation, trying to make sure EU values are upheld (fairness, transparency, democracy, etc). Several revisions (and some lobbying) later, GPAI (general purpose AI) and...