x
Value learning subproblem: learning goals of simple agents — LessWrong