An optimal stopping paradox

You need an exponentially increasing reward for your argument to go through. In particular, this doesn't prove enough:

Since at each moment in time, you face the exact same problem (linearly increasing reward, α-exponentially decaying survival rate)

The problem isn't exactly the same, because the ratio of (linear) growth rate to current value is decreasing over time. At some point, the value equals $β / α$ (is the right expression, I think?), and your marginal value of waiting is 0 (and decreasing), and you sell.

If the ratio of growth rate to current value is constant over time, then you're in the same position at each step, but then it's either the St. Petersburg paradox or worthless.

[-]cousin_it6y70

Let's change the problem a bit: assume you're starting with nonzero capital c, so the formula becomes (bt+c)e^(-at). If c>b/a, the derivative of that formula at t=0 is negative, so you need to stop immediately. That shows the decision to stop doesn't depend only on a and b, but also on current capital. So basically "at each moment in time you face the exact same problem" is wrong. The naive solution is the right one: you should stop when c=b/a, which means t=1/a in the original problem.

[-]Steven Joyce6y50

TLDR: The paradox goes away if you make price endogenous, i.e., it only occurs because your assumption about the value growth over time that is inconsistent with the profit flows.

The paradox stems from the fact that you've made inconsistent assumptions: that the value of the company increases linearly over time, and that the company never generates a flow of profits (i.e., the only value comes from the sale). If profits are zero, the equilibrium price is constant at zero, and investors are indifferent between holding the company and selling it at any point in time. More generally, if the company has some potential for profits (which can be modeled as a flow of profits per unit of time, or as a hazard rate of getting an instantaneous lump sum of profits), the equilibrium price will be set so that the marginal investor is indifferent between holding and selling.

I have a tongue-in-check resolution to the Schrodinger cat variant: if his goal is to set a new world record, he should open the box immediately after the old world record. More seriously, to resolve the paradox, you need to be more explicit about his utility function: how does the value he obtains increase with the amount by which he exceeds the old record? Depending on your choice of utility function, you may or may not have a paradox, and it may or may not be equivalent to the St. Petersburg paradox.

[-]Donald Hobson6y30

Differentiating the expected reward over time.

\frac{d}{d t} β t e^{- α t} = β e^{- α t} - α β t e^{- α t}

So the best time to sell is when $t = 1 / α$ .

if you have already waited time $c$ then the reward becomes $β (t + c) e^{- α (t + c)} = e^{- α c} β (t + c) e^{- α t}$ The stopping time becomes

\frac{d}{d t} β (t + c) e^{- α (t + c)} = β e^{- α (t + c)} - α β (t + c) e^{- α (t + c)}

With a solution at $t = \frac{1}{α} - c$ . Nothing wierd is going on here, a plot of expected value vs sell time looks like this.

Suppose the exponential decay term was 1 day. After 1 second, waiting another second makes sense, it will double your value and the chance of a fail is tiny. After a week, you already have a large pot of value that you are risking. It is no longer worth waiting.

[-]Pattern6y10

looks like this.

That link doesn't work.

[-]Donald Hobson6y10

Fixed.

[-]gwern6y30

Claim that there should be a finite lifetime. You can't wait forever. If there is a finite lifetime, then the same decision analysis would tell you to procrastinate until the very end. This effectively is procrastinating forever. It does not converge to a reasonable finite waiting time as your lifetime goes to infinity.

If I am a quasi-immortal who will live millions or billions of years, with, apparently, zero discount rates, no risk, and nothing else I am allowed to invest in (no opportunity cost), why shouldn't I make investment decisions which take millions of years to mature (with astronomical loads of utility at the end as a payoff for my patience), and plan over periods that short-lived impatient mayflies like yourself can scarcely comprehend?

[-]Charlie Steiner6y20

If the growth is exponential, I still don't think there's a paradox - sure, you're incentivized to wait forever, but I'm already incentivized to wait forever with my real life investments. The only thing that stops me from real life investing my money forever is that sometimes I have things (not included in the toy problem) that I really want to buy with that money.

[-]AprilSR6y10

Reminds me of the thought experiment where you’re in hell and there’s a button that will either condemn you permanently, or, with probability increasing over time, will allow you to escape. Since permanent hell is infinitely bad, any decreased chance of that is infinitely good, so you either wait forever or make an arbitrary unjustifiable decision.

[-]kithpendragon6y00

Claim that expectation maximization decision theory is flawed. This doesn't stop the procrastination. As long as your decision is purely based on the future, and your rational decision process is constant in time, you either immediately sell the company or never sell the company.

I don't need to maximize the expected value of anything where I know I can get at least what I want. If I precommit to sell at $X or when the risk of failure in the next year goes above P%, that doesn't mean the actor that sells at $X+1 "wins": if we both got what we wanted, we both win. Likewise, Schrodinger doesn't need to set the best possible record for cat survival; he just needs to keep one alive for the duration of the previous record +1 interval.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

11

An optimal stopping paradox

11

11