Torture and Dust Specks and Joy--Oh my! or: Non-Archimedean Utility Functions as Pseudograded Vector Spaces
A dozen years ago, Eliezer Yudkowsky asked us which was less (wrong?) bad: * 3^^^3 people each getting a dust speck in their eyes, or * 1 person getting horribly tortured continually for 50 years. He cheekily ended the post with "I think the answer is obvious. How about you?"...
Why does this pose an issue for reinforcement learning? Forgive my ignorance, I do not have a background in the subject. Though I don't believe that I have information which distinguishes cereal/granola in terms of which has stronger highest-severity consequences (given the smallness of those numbers and my inability to conceive of them, I strongly suspect anything I could come up with would exclusively represent epistemic and not aleatoric uncertainty), even if I accept it then the theory would tell me, correctly, that I should act based on that level. If that seems wrong, then it's evidence we've incorrectly identified an implicit severity class in our imagination of the... (read more)