x
Thoughts on implementing corrigible robust alignment — LessWrong