rvnnt
Should you publish solutions to corrigibility?
This question is partly motivated by observing recent discussions about corrigibility and wondering to what extent the people involved have thought about how their results might be used. If there existed practically implementable ways to make AGIs corrigible to arbitrary principals, that would enable a wide range of actors to...
Requesting feedback/advice: what Type Theory to study for AI safety?
TL;DR: Requesting feedback or advice on questions of what math to study (or what projects to engage in) in order to learn how to prove things about programs and to analyze the difficulty/possibility of proving things about programs. Ulterior motive: AI safety (and also progress towards a university degree). Motivation,...
How would you rate Principia's operational adequacy? In particular: how would you rate Principia on closure and opsec? [1]
Alternatively, if you use a different framework for thinking about operational adequacy, I'd be very curious to read your assessment w.r.t. that framework! ↩︎