Thoughts on implementing corrigible robust alignment — LessWrong