x
Predicted corrigibility: pareto improvements — LessWrong