Knightian Uncertainty: Bayesian Agents and the MMEU rule

[-]AShepard11y80

I think the analysis in this post (and the others in the sequence) has all been spot on, but I don't know that it is actually all that useful. I'll try to explain why.

This is how I would steel man Sir Percy's decision process (stipulating that Sir Percy himself might not agree):

Most bets are offered because the person offering expects to make a profit. And frequently, they are willing to exploit information that only they have, so they can offer bets that will seem reasonable to me but which are actually unfavorable.

When I am offered a bet where there is some important unknown factor (e.g. which way the coin is weighted, or which urn I am drawing from), I am highly suspicious that the person offering the bet knows something that I don't, even if I don't know where they got their information. Therefore, I will be very reluctant to take such bets

When faced with this kind of bet, a perfect bayesian would calculate p(bet is secretly unfair | ambiguous bet is offered) and use that as an input into their expected utility calculations. In almost every situation one might come across, that probability is going to be quite high. Therefore, the general intuition of "don't mess with ambiguous bets - the other guy probably knows something you don't" is a pretty good one.

Of course you can construct thought experiments where p(bet is secretly unfair) is actually 0 and the intuition breaks down. But those situations are very unlikely to come up in reality (unless there are actually a lot of bizarrely generous bookies out there, in which case I should stop typing this and go find them before they run out of money). So while it is technically true that a perfect Bayesian would actually calculate p(bet is secretly unfair | ambiguous bet was offered) in every situation with an ambiguous bet, it seems like a very reasonable shortcut to just assume that probability is high in every situation and save one's cognitive resources for higher impact calculations.

[-]So8res11y50

Thanks! I completely agree that "reject bets offered to you by humans" is a decent heuristic that humans seem to use. I also agree that bet-stigma is a large part of the reason people feel they need something other than Bayesianism (which treats every choice as a bet about which available action is best). These points (and others) are covered in the next post.

In this post, I'm addressing the argument that there are rational preferences that the Bayesian framework cannot, in principle, capture. This addresses a more general concern as to whether Bayesianism captures the intuitive ideal of 'rationality'. Here I'm claiming that, at least, the MMEU rule is no counter-example. The next post will contain my true rejection of the MMEU rule in particular.

[-]VAuroch11y30

It's fairly common in programming, particularly, to care not just about the average case behavior, but the worst case as well. Taken to an extreme, this looks a lot like Caul, but treated as a partial but not overwhelming factor, it seems reasonable and proper.

For example, imagine some algorithm which will be used very frequently, and for which the distribution of inputs is uncertain. The best average-case response time achievable is 125 ms, but this algorithm has high variance such that most of the time it will respond in 120 ms, but a very small proportion of the time it will take 20 full seconds. Another algorithm has average response time 150 ms, and will never take longer than 200 ms. Generally, the second algorithm is a better choice; average-case performance is important, but sacrificing some performance to reduce the variance is worthwhile.

Taking this example to extremes seems to produce Caul-like decisionmaking. I agree that Caul appears insane, but I don't see any way either that this example is wrong, or that the logic breaks down while taking it to extremes.

[-]CalmCanary11y50

The most obvious explanation for this is that utility is not a linear function of response time: the algorithm taking 20 s is very, very bad, and losing 25 ms on average is worthwhile to ensure that this never happens. Consider that if the algorithm is just doing something immediately profitable with no interactions with anything else (e.g. producing some crytptocurrency), the first algorithm is clearly better (assuming you are just trying to maximize expected profit), since on the rare occasions when it takes 20 s, you just have to wait almost 200 times as long for your unit of profit. This suggests that the only reason the second algorithm is typically preferred is that most programs do have to interact with other things, and an extremely long response time will break everything. I don't think any more convoluted decision theoretic reasoning is necessary to justify this.

[-]VAuroch11y10

True, but even in cases where it won't break everything, this is still valued. Consistency is a virtue even if inconsistency won't break anything. And it clearly breaks down in the extreme case where it becomes Caul, but I can't come up with a compelling reason why it should break down.

My best guess: The factor that is being valued here is the variance. Low variance increases utility generally, because predictability is valuable in enabling better expected utility calculations for other connected decisions. There is no hard limit on how much this can matter relative to the average case, but as the discrepancy between the average cases diverge so that the low-variance version becomes worse than a greater and greater fraction of the high-variance cases, it it remains technically rational but its implicit prior approaches an insane prior such as that of Caul or Perry.

I think this would imply that for an unbounded perfect Bayesian, there is no value to low variance outside of nonlinear utility dependence, but that for bounded reasoners, there is some cutoff where making concessions to predictability despite loss of average-case utility is useful on balance.

[-]Optimization Process4y20

My attempted condensation, in case it helps future generations (or in case somebody wants to set me straight): here's my understanding of the "pay $0.50 to win $1.10 if you correctly guess the next flip of a coin that's weighted either 40% or 60% Heads" game:

You, a traditional Bayesian, say, "My priors are 50/50 on which bias the coin has. So, I'm playing this single-player 'game':

"I see that my highest-EV option is to play, betting on either H or T, doesn't matter."
Perry says, "I'm playing this zero-sum multi-player game, where my 'Knightian uncertainty' represents a layer in the decision tree where the Devil makes a decision:

"By minimax, I see that my highest-EV option is to not play."

...and the difference between Perry and Caul seems purely philosophical: I think they always make the same decisions.

[-]halcyon11y20

No matter how obvious your reasoning may appear to you, there is someone out there stupid enough to have thought the contrary. Believe it or not, this series goes a long way towards dissipating my pessimism about the world. My subconscious really believed it is a fact that on average, nature tends to destroy our mortal ambitions, and that's why it is dangerous to Tempt Fate.

I have always known this is a theological outlook, but I tried to deal with it by avoiding thoughts like that rather than marshaling positive arguments against it. After reading this, I consciously understand, to a significantly greater degree, why it doesn't actually make sense to generalize those thought processes for use in reasoning. I like this much better than just intuitively labeling them as low status. Thank you.

[-]AlexMennen11y20

Cautious Caul is interesting because I actually do expect that my utility is nonlinear with respect to measure. For instance, if I got to choose between either the entire universe getting destroyed with probability 1/2 or half of Everett branches getting destroyed with probability 1, I would much prefer the second one. That said, in practice, I don't expect to be able to make any use of the distinction between measure in a multiverse and probability.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

24

Knightian Uncertainty: Bayesian Agents and the MMEU rule

24

24

Antagonistic ambiguity

Preferring the least convenient world

Bayesian ambiguity aversion