Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.
This is a linkpost for https://carado.moe/qaci.html

this post aims to keep track of posts relating to the question-answer counterfactual interval proposal for AI alignment, abbreviated "QACI" and pronounced "quashy". i'll keep it updated to reflect the state of the research.

this research is primarily published on the Orthogonal website and discussed on the Orthogonal discord.

as a top-level view of QACI, you might want to start with:

the set of all posts relevant to QACI includes:

New Comment