Consider Reconsidering Pascal's Mugging

[-]paulfchristiano8y100

If you have an unbounded utility function and a broad prior, then expected utility calculations don't converge. It's not that decision theory is producing an answer and we are rejecting it---decision theory isn't saying anything. This paper by Peter de Blanc makes the argument. The unbounded cases is just as bad as the infinite case. Put a different way, the argument for representing preferences by utility functions doesn't go through in cases where there are infinitely many possible outcomes.

That said, most people report that they wouldn't make the trade, from which we can conclude that their utility functions are bounded, and so we don't even have to worry about any of this.

I think that the argument you give---that there are much better ways of securing very large payoffs---is also an important part of making intuitive sense of the picture. The mugger was only ever a metaphor, there is no plausible view on which you'd actually pay. If you are sloppy you might conclude that there is some mugger with expected returns twice as large as whatever other plausible use of the money you are considering, but of course all of this is just an artifact of rearranging divergent sums.

[-]Rafael Harth8y10

If you have an unbounded utility function and a broad prior, then expected utility calculations don't converge.

That is the core of what replying to zulu's post made me think.

I won't say too much more until I read up on more existent thoughts, but I as of now I strongly object to this

That said, most people report that they wouldn't make the trade, from which we can conclude that their utility functions are bounded, and so we don't even have to worry about any of this.

I neither think that the conclusion follows, nor that utility functions should ever be bounded. We need another way to model this.

[-]Ari Fox Zerner8y10

Send me $5 or I will destroy the universe. paypal.me/arizerner

[-]Ari Fox Zerner8y10

Thank you for paying. I will not destroy the universe, nor will I issue similar threats against you in the future. In addition, you have demonstrated the admirable quality of willingness to put your money where your mouth is.

[-]zulupineapple8y10

The mugger claiming that they can affect a googolplexplex lives doesn’t give them exclusive access to a non-zero probability of affecting a googolplexplex lives; other ways do exist.

Why do you think that? What is the probability that the mugger does in fact have exclusive access to 3^^^^3 lives? And what is the probability for 3^^^^^3 lives?

By the way, what happens if a billion independent muggers all mug you for 1 dollar, one after another?

[-]Rafael Harth8y10

By the way, what happens if a billion independent muggers all mug you for 1 dollar, one after another?

The same as if one mugger asks a billion times, I believe. Do you think the probability that a mugger is telling the truth is a billion times as high in the world where 1000 000 000 of them ask the AI versus the world where just 1 asks? If the answer is no, then why would the AI think so?

Why do you think that? What is the probability that the mugger does in fact have exclusive access to 3^^^^3 lives? And what is the probability for 3^^^^^3 lives?

In the section you quoted, I am not saying that other ways of affecting 3^^^^3 lives exist, I am saying that other ways with a non-zero probability of affecting that many lives exist – this is trivial, I think. A way to actually do this does, most likely, not exist.

So there is of course a probability that the mugger does have exclusive access to 3^^^^3 lives. Let's call that $p$ . What I am arguing is that it is wrong to assign a fairly low utility $u$ to 1$ worth of resources and

then conclude "aha, since $p \cdot U ($ 3^^^^3 lives) $> u$ , it must be correct to pay!" And the reason for this is that $u$ is not actually small. Calculating $u$ , the utility of one dollar, does itself include considering various mugging-like scenarios; what if there is just a bit of additional self-improvement necessary to see how 3^^^^3 lives can be saved? It is up to the discretion of the AI to decide when the above formula holds.

So $p$ might be much larger than 1/3^^^^3 but $u$ is actually very large, too. (I am usually a fan of making up specific numbers, but in this case that doesn't seem useful).

[-]zulupineapple8y30

I am usually a fan of making up specific numbers, but in this case that doesn't seem useful

I think you really should. I asked you to compare P(mugger can save 3^^^^3 lives) with P(mugger can save 3^^^^^3 lives). The second probability should be only slightly lower than the first, it can't possibly be 3^^^^3/3^^^^^3 as low, because if you're talking to an omnipotent matrixlord, the number of arrows means nothing to them. So it doesn't matter how big u is, with enough arrows P(mugger can save N lives) times U(N lives) is going to catch up.

What I am arguing is that it is wrong to assign a fairly low utility u to 1$ worth of resources

What does "low utility" mean? 1$ presumably has 10 times less utility than the 10$ that I have in my pocket right now, and it's much lower than U(1 life), so it's clearly not the most useful thing in the world, but aside from that, there isn't much to say. The scale of utilities as a "0", but the choice of "1" is arbitrary. Everything is high or low only in comparison to other things.

Do you think the probability that a mugger is telling the truth is a billion times as high in the world where 1000 000 000 of them ask the AI versus the world where just 1 asks?

The muggers may or may not be independent. It's possible that they each of them has independent powers to save a different set of 3^^^^3 lives. It also possible that all of them are lying, but P(a billion people are all lying) is surely much lower than P(one person is lying). I could imagine why you still wouldn't pay, but if you did the math, the numbers would be very different from just one person asking a billion times.

[-]Rafael Harth8y10

Rather than addressing them here, I think I'll make a part 2 where I explain exactly how I think about these points... or, alternatively, realize you've convinced me in the process (in that case I'll reply here again).

What happened since is neither one or the other, which is why I found it tricky to decide what to do. Basically it seems to me that the everything just comes down to the fact that expected utilities don't converge. Every response I'd have to your arguments would run into that wall. This seems like an incredibly relevant and serious problem that throws a wrench into all of these kinds of discussions, and Pascal's Mugging seems like merely a symptom of it.

So basically my view changed from "There's no fire here" to "expected utilities don't converge holy shit why doesn't everyone point this out immediately?" But I don't see PM as showcasing any problem independent from that, and find the way I head it talked about before pretty strange.

[-]Rafael Harth8y10

Thank you for that post. The way I phrased this clearly misses these objections. Rather than addressing them here, I think I'll make a part 2 where I explain exactly how I think about these points... or, alternatively, realize you've convinced me in the process (in that case I'll reply here again).

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

5

Consider Reconsidering Pascal's Mugging

5

5