A sketch of a value-learning sovereign — LessWrong