Tyranny of the Epistemic Majority

[-]habryka2y*2724Review for 2022 Review

I put decent probability on this sequence (of which I think this is the best post) being the most important contribution of 2022. I am however really not confident of that, and I do feel a bit stuck on how to figure out where to apply and how to confirm the validity of ideas in this sequence.

Despite the abstract nature, I think if there are indeed arguments to do something closer to Kelly betting with one's resources, even in the absence of logarithmic returns to investment, then that would definitely have huge effects on how I think about my own life's plans, and about how humanity should allocate its resources.

Separately, I also think this sequence is pushing on a bunch of important seams in my model of agency and utility maximization in a way that I expect to become relevant to understanding the behavior of superintelligent systems, though I am even less confident of this than the rest of this review.

I do feel a sense of sadness that I haven't seen more built on the ideas of this sequence, or seen people give their own take on it. I certainly feel a sense that I would benefit a lot if I saw how the ideas in this sequence landed with people, and would appreciate figuring out the implications of the proof sketches outlined here.

[-]Jan_Kulveit2y4-3

+1 on the sequence being on the best things in 2022.

You may enjoy additional/somewhat different take on this from population/evolutionary biology (and here). (To translate the map you can think about yourself as the population of myselves. Or, in the opposite direction, from a gene-centric perspective it obviously makes sense to think about the population as a population of selves)

Part of the irony here is evolution landed on the broadly sensible solution (geometric rationality). Hower, after almost every human doing the theory got somewhat confused by the additive linear EV rationality maths, what most animals and also often humans on S1 level do got interpreted as 'cognitive bias' - in the spirit of assuming obviously stupid evolution not being able to figure out linear argmax over utility algorithms in a a few billion years.

I guess not much engagement is caused by
- the relation between 'additive' vs 'multiplicative' picture being deceptively simple in formal way
- the conceptual understanding of what's going on and why being quite tricky; one reason is I guess our S1 / brain hardware runs almost entirely in the multiplicative / log world; people train their S2 understanding on linear additive picture; as Scott explains, maths formalism fails us

[-]evhub3y249

Sort of a side note, but one takeaway I've had from the whole FTX fiasco—particularly given SBF's comments here—is that being really careful about teaching and understanding Kelly betting is more important than I would have thought.

[-]Scott Garrabrant3y1715

Yep, I had been wanting to write this sequence for months, but FTX caused me to sprint for a week until it was all done, because it seems like now is the time people are especially hungry for this theory.

This sequence was going to be my main priority for December (and Kelly betting was going to be my most central example). I thought the main reason EAs needed it was to be able to not feel guilting every time they stop to have fun, to not get Pascal's mugged by calculations about the amount of matter in the universe, to not let longtermism take over the entire EA movement, to have fewer internal politics related issues, and to be more scout-mindset-like to take Julia's term. The Kelly betting was supposed to be more of an analogy about putting all your eggs in one basket.

Then, I suddenly quickly updated on how much the EA community needed these memes.

[-]Trinley Goldenberg3y72

You, however, do not know if it is a fair coin, and are offering me a fair bet. I only have 100 dollars to my name, and I am can bet as much as I want (up to 100 dollars) in either direction at even odds.
If I bet 100 dollars on heads, heads-me gets 200 dollars, and tails-me gets nothing. If I bet 100 dollars on tails, tails-me gets 200 dollars, and heads me gets nothing. If I bet nothing, both versions of me get 100 dollars.
However, every dollar in the hands of heads-me is worth 1.5 times as much as a dollar in the hands of tails-me, since heads-me exists 1.5 times as much. (I am ignoring here any diminishing returns in my value of money.)
Thus, to maximize value I should bet 100 dollars on heads. However, maybe it is better to think of tails-me as the rightful owner of 40 percent of my resources. When I bet 100 dollars on heads, I am seizing money from tails-me for the greater good, since heads-me has the (proportionally greater) existence necessary to better take advantage of it.
Alternatively, I could say that since 60 percent of me is heads-me, heads me should only control 60 dollars, which can be bet on heads. Tails me should control 40 dollars, which can be bet on tails. These two bets partially cancel each other out, and the net result is that I bet 20 dollars on heads.
If you are especially fast at maximizing expected logarithms, you might see where this is going.

Wow I have been looking for an intuitive explanation of Kelly Betting for years, and this is the first one that really hit from an intuitive mathematical perspective.

[-]Scott Garrabrant3y20

Thanks.

Be warned that this explanation only applies if the environment is offering both sides of every event at the same odds.

[-]Terence Coelho2y30

Is there no way to salvage it via a Nash bargaining argument if the odds are different? Or at least, deal with scenarios where you have x:1 and 0:1 odds (i.e. you can only bet on heads)?

[-]Scott Garrabrant3y30

Where by the "same odds," I mean if you can take 3:2 for True, you can take 2:3 for False.

[-]Trinley Goldenberg3y21

Yes, I got down to the Nash Bargaining part which is a bit harder and got confused again, but this helped as a very simple math intuition for why to Kelly Bet, if not how to calculate it in most real world betting situation.

[-]Caspar Oesterheld3y3-6

Nice!

I'd be interested in learning more about your views on some of the tangents:

>Utilities are bounded.

Why? It seems easy to imagine expected utility maximizers whose behavior can only be described with unbounded utility functions, for example.

>I think many phenomena that get labeled as politics are actually about fighting over where to draw the boundaries.

I suppose there are cases where the connection is very direct (drawing district boundaries, forming coalitions for governments). But can you say more about what you have in mind here?

Also:

>Not, they are in a positive sum

I assume the first word is a typo. (In particular, it's one that might make the post less readable, so perhaps worth correcting.)

[-]Scott Garrabrant3y150

1) So, VNM utility theorem, assuming the space of lotteries is closed under arbitrary mixtures, where you e.g. you can specify a sequence of lotteries, and take the mixture that assigns probability to the $n$ th lottery. implies bounded utilities, since otherwise, you can get a lottery with infinite utility, and violate continuity.

I think there are some reasons to not want to allow arbitrary lotteries, and then, you could technically have unbounded utility, but then you get a utility function that can only assign utilities in such a way that you can't set up any St, Petersburg paradoxes. I think that this move makes sense, but it means you have to integrate your probability and utility, and modulo actually thinking of them as integrated in this way, I think utilities are bounded is a good approximation.

I think that almost everyone who talks about unbounded utility functions is not actually doing the above, and is actually violating the VNM axioms, and for me, the word "utility" means VNM.

2) I think that a lot of the behavior referred to as "soldier mindset" as opposed to "scout mindset" is related to the kind of boundaries we are talking about here. I think that e.g. politics with EA feels like it has a lot to do with coalition building, and conflicts about transparency, which fit int this soldier mindset thing.

I think that a lot of politics is about conflicts between respecting the rights of the individuals vs the rights of groups comprised of those individuals. This is something like saying to what extent do we want to think of various different levels as "people." Across humans, you can get things about state's rights, corporation's rights, family's rights, immigration. Within humans, you can get questions about how much you want to hold adults accountable to mistakes they might make as children. I don't know, I am hesitant to get into object level politics.

3)Yeah, I will fix it.

[-]Lorxus2y21

Firstly, your utility is not logarithmic in dollars. Utilities are bounded.

Ehn, the universe is finite and there's no way we can get anywhere near a dollar per atom of value out of the universe. There's well less than particles in the universe and $ln (10^{100}) \sim 230$ , so if you were wrong about utility not being O(log(money)) because it has to be bounded, how could you ever tell even in principle? (That said I do think you're right, but that's because economium is likely as edible as dollar bills are.)

[-]Slider3y20

View of income disparity as a problem that overrides expected wealth among your possible selfs is a very interesting angle.

Does this mean that there are voting schemes that are structurally impossible to gerrymander? Do they inevitably fail other voting desiderata?

Wouldn't it also make sense to treat the outside-view to be updated. To treat yourself as beating the market if you are beating the market. Or is it that "unknown unknowns" and "I know that I don't know" kind of factors never shift? I read that the recommendation is that when you are wrong one should be less agentic and do the null behaviour (kind of like the action version of null hypothesis). The angle I used to apply is that if you are wrong you should update to be more right. But this recommendation works even if you don't know how to improve. Halt and do what you were previously doing instead of totally freezing.

So am I correct that taking Kelly betting seriously leads to recommendation that St. Petersburg should be rejected? I am also thinking of a continous version of the setup where at each timestep you can stake the amount of money you want for double or nothing. If double is a tiinsy tiny bit more probable than nothing you only stake very little money. And at exactly even odds you stake exactly 0 money. Is this not a solution to the Petersburg blowup?

Seems there are recommendations that are in violation of maximising for expected value and for clarity of myself and other I will restate more explicitly. You have 100 money and are considering two bets. Bet A is 2/3 for 2.1 (2+0.1) times the bet and 1/3 chance of nothing, the bet taking degree is 66.66.. . Bet B is 2/6 chance of 4.2 (2*(2+0.1)) times the bet and 4/6 nothing, the bet taking degree is 33.33. The expectation on both is 1.4 but the bets don't get treated the same, we are not ambivalent between them. We prefer A and can do so without providing a risk tolerance profile. This is probably mostly additional structure on top, most comparisons that go otherwise are overriding ambivalences to favour one side. Same expected values point in the same direction but not at the same magnitude.

It is interesting to think whether there are exceptions and this new scheme would recommend contrary to pure EV expectation. It would seem that less volatile scenarios move faster with difference in outcome intensity. As with expectation value 1.4 we had two bets with 66.66 and 33.33. Are there any bets that have bet taking degrees between those that have a lesser expected value?

I suspect the case is that provided each bet alone we would engage then to those 66.66 and 33.33 degrees but together we are not putting in the whole 100 (66.66+33.33) if offered together.

If some dyper scenario happens at a probability p, then even if the utility shoots throught the roof or is roofless, the maximum that scenario can command is that p fraction and can't go over that. You are not allowed to bet 1000 out of 100. And you can't recommend harder than "100% yes".

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

191

Tyranny of the Epistemic Majority

191

191

The Steward of Myselves

Compositionality

Bayesian Updating

Bargaining with Myself

Kelly Betting

Betting Even Less

Betting Less Still