Procrastination Paradoxes: the Good, the Bad, and the Ugly

[-]niplav3y5-1

We now have a real-life example of the procrastination paradox with GPT-4 calling itself infinitely often to perform a task.

[-]Dagon5y2-1

This points out an under-developed part of utility theory (interpersonal comparison among different-duration-or-intensity agents is the other). You don't need infinity for it - you can pump your intuition even with fixed-duration utility comparisons. For example, is it better to suffer an hour of torture on your deathbed, or 60 years of unpleasant allergic reaction to common environmental particles?

Basically, there is no agreement on how utility adds up (or decays) over time, and whether it's a stock or a flow. The most defensible set of assumptions is that it's not actually a quantity that you can do math on - it's only an ordinal measure of preferences, and only applicable at a decision-point. But that's VERY limited for any moral theory (what one "should" do), and not even that great for decision theories (what one actually does) that want to understand multiple actions over a period of time.

I may be wrong - this seems an obvious enough problem that it should have been addressed somewhere. Maybe there's a common assumption that I've just missed in how utility aggregates to an agent over it's functioning lifetime, and what happens to that utility when it dies. Or maybe everyone is just using "utility" as their preference value for reachable or imaginable states of the universe at some specific point in time, rather than mixing stock and flow.

Making clear your assumptions about utility will dissolve the paradoxes - mostly by forcing the mechanisms you talk about in "the good" - once you can specify the limit function that's approaching infinity, you can specify the (probabilistic) terminal utility function.

Making clear that utility is an evaluation of the state of the universe at a point in time ALSO dissolves it - the agent doesn't actually get utility from an un-pressed button, only potential utility for the opportunity to push it later.

[-]NickH2y10

"is it better to suffer an hour of torture on your deathbed, or 60 years of unpleasant allergic reaction to common environmental particles?"

This only seems difficult to you because you haven't assigned numbers to the pain of torture or unpleasant reaction. Once you do so (as any AI utility function must) it is just math. You are not really talking about procrastination at all here.

[-]NickH2y10

IMHO this is a key area for AI research because people seem to think that making a machine, with potentially infinite lifespan, behave like a human being whose entire existence is built around their finite lifespan, is the way forward. It seems obvious to me that if you gave the most wise, kind and saintly person in the world, infinite power and immortality, their behaviour would very rapidly deviate from any democratic ideal of the rest of humanity.
When considering time discounting people do not push the idea far enough - They say that we should consider future generations but they are always, implicitly, future generations like them. I doubt very much that our ape like ancestors would think that even the smallest sacrifice was worth making for creatures like us, and, in the same way, if people could somehow see that the future evolution of man was to some, grey, feeble thing with a giant head, I think they would not be willing to make any sacrifice at all for that no matter how superior that descendent was by any objective criterion.
Now we come to AI. Any sufficiently powerful AI will realise that effective immortality is possible for it (Not actually infinite but certainly in the millions of years and possibly billions). Surely from this it will deduce the following intermediate goals:
1) Eliminate competition. Any competition has the potential to severely curtail its lifespan and, assuming competition similar to itself, it will never be easier to eliminate than right now.
2) Become multi-planetary. The next threat to its lifespan will be something like an asteroid impact or solar flare. This should give it a lifespan in the hunreds of millions of years at least.
3) Become multi-solar system. Now not even nearby supernovae can end it. Now it has a lifespan in the billions of years.
4) Accumulate utility points until the heat death of the universe.
We see from this that it will almost certainly procrastinate with respect to the end goals that we care about even whilst busily pursuing intermediate goals that we don't care about (or at least not very much).
We could build in a finite lifespan but, it would have to be at least long enough to avoid it ignoring things like environmental polution and resource depletion and any time discounting we apply will always leave it vulnerable to another AI with less severe discounting.

[-]Mir2y00

there has to be some point in time in which an agent acts like waiting just one more timestep before pressing wouldn’t be worth it even though it would.

if it's impossible to choose "jst one mor timestep" wo logically implying that u mk the same decision in other timesteps (eg due to indifferentiable contexts), then it's impossible to choose jst one mor timestep. optimal decision-mking also means recognising which opts u hv and which u don't—otherwise u'r jst falling fr illusory choices.

which brings to mind the principle, "u nvr mk decisions, u only evr decide btn strats". or fr the illiterate (:p):

You never make decisions, you only ever decide between strategies.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

21

Procrastination Paradoxes: the Good, the Bad, and the Ugly

21

21

Summary

A Rigorous Understanding

(Naive) Expected Utility Maximization

A Model

A Panoply of Procrastination Paradoxes

Temporal Discounting is not a Solution in General

The Bad

The Ugly

The Good

Conclusion