How much can value learning be disentangled? — LessWrong