A Models-centric Approach to Corrigible Alignment — LessWrong