Ambitious vs. narrow value learning — LessWrong