x
Can corrigibility be learned safely? — LessWrong