x
The Auditor’s Key: A Framework for Continual and Adversarial AI Alignment — LessWrong