New paper on aligning AI with human values — LessWrong