x
AI Control: Improving Safety Despite Intentional Subversion — LessWrong