What is ambitious value learning? — LessWrong