Anthropics is pretty normal

Stuart_Armstrong

Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.

In this post, I'll defend these claims:

The common understanding of anthropic reasoning is wrong.
There are interesting reasons for that error.
Anthropic updating is the same as normal updating. For example, our survival is evidence that the world is safer than we thought.
Full anthropic reasoning is actually pretty normal and easy in most cases.
We don't need to define some special class of "observers" to do anthropic reasoning.

Common understanding of anthropics

By the "common understanding", I mean something like:

$A_{1}$ : "If we had died in an existential catastrophe, we wouldn't be around to observe it. Hence we can't conclude anything from our survival about the odds of existential risk."

Sometimes "can't conclude anything" is weakened to allow some weak updating.

Now $A_{1}$ sounds reasonable. But consider instead:

$A_{2}$ : "If we had won the lottery, a lottery-losing us wouldn't be around to observe it. Hence we can't conclude anything from our loss about the odds of winning the lottery."

Formally the two arguments have the same structure. Now, people might start objecting that the difference between an observer or no observers is not the same thing as the difference between an observer seeing a loss and one not seeing it. And then I might respond by slicing into the definition of observer, creating "half-observers", and moving smoothly between observer and non-observer...

But that's the wrong response, on both of our parts (shame on you, hypothetical strawman, for reasoning like that!). The key question is not "can we justify that $A_{1}$ and $A_{2}$ might be different?" Because we can always justify something like that if we work on it hard enough.

Instead we should be asking "1) Why do we find $A_{1}$ convincing?", and "2) Do we have reasons to believe $A_{1}$ is wrong?"

My answer to 2) is "yes, of course; $A_{2}$ is clearly wrong, and $A_{1}$ is formally structured the same way, so there must be a paradox lurking there" (spoiler: there is a paradox lurking there).

For 1), I introspected on why I had been lead astray for so long, and here are some of the reasons why we might believe $A_{1}$ (or at least think that anthropic reasoning is hard):

A desire to not go overboard.
Conditional probabilities being confusing if not written out in full.
An awareness of survivorship bias.
Anthropic probability theories seem confusing and full of paradoxes.

The desire not to go overboard is the easiest to understand: to those who might say "so, it turns out we were safe after all!", we can answer, correctly "not necessarily; we might just have got lucky". And that is correct; we might have got lucky. But it's also some evidence, at least, that maybe we were safer than we thought.

Updating and conditional probability

The anthropic principle

Let's go back to the idea that started this all: the anthropic principle. Looking at the Wikipedia article on it, there seems to be a bunch of different principles; here's my attempt at putting them in a table:

$\begin{matrix} The basic anthropic principle & P (the universe allows life | we live) \approx 1 Weak AP (Carter) & P (our space-time location allows life | our space-time location has observers) \approx 1 Weak AP & P (the universe allows life | (Barrow and Tipler) & carbon-based life exists) \approx 1 Strong AP (Carter) & P (the universe allows life | observers exist) \approx 1 Strong AP & P (the universe/multiverse allows life) = 1 (Barrow and Tipler) \end{matrix}$

The Barrow and Tipler Strong AP is in my view wrong (I think they're mixing frequentist and Bayesian probability, if they have to posit an actual multiverse). But the other ones seem trivially true, just as matter of conditional probability. And the differences between them are unimportant: whether it's looking at the whole universe, or our space-time location, and at observers in general, carbon based life, or just ourselves. All of these are equally true, and it seems to me that people arguing about different versions of the AP just haven't seen them written down as they are here, where it's clear that they are all of a similar format:

Conditional on some form of life (which includes us) existing, it is almost certain that anything necessary for that life, has actually happened.

Conditional probabilities

Now look back at $A_{1}$ . It looks similar, but it isn't; the conditionals are used a bit differently. What $A_{1}$ says is "conditional on us surviving, the probability of an existential catastrophe having happened is zero. And this probability is independent of whether the world is safe or not. Hence we can't deduce whether the world is safe or not".

All the mischief is in that word "hence". Conditional probabilities are tricky and counterintuitive; to pick an example from logical uncertainty, $P ($ ''0=1'' $|$ "0=0" $) = 0$ while $P ($ "0=0" $|$ "0=1" $) = 1$ . And, in general, you can't move "is independent of" from one side of the conditional to the other.

So these probabilities have to be computed explicitly - though you can get a hint of the potential mistake by considering "conditional on us seeing ourselves lose the lottery, the probability of us winning the lottery is zero. And this probability is independent of the odds of the lottery. Hence we can't say anything about the odds of the lottery".

The power of Bayes compels you!

I have actually computed the odds explicitly, using Bayesian reasoning, to show that statements like $A_{1}$ are wrong. But let's invert the problem: if we assumed $A_{1}$ was true, what would that imply?

Imagine that the world is either $safe$ (low risk of existential catastrophe) or $dangerous$ (high risk of existential catastrophe). Then $A_{1}$ would argue that $P (safe | we survived)$ is the same as $P (safe)$ : our survival provides no evidence of the world being safe. Then applying almighty Bayes:

$P (we survived | safe) = \frac{P (safe | we survived) \times P (we survived)}{P (safe)} = P (we survived) .$

The same reasoning shows $P (we survived) = P (we survived | dangerous)$ . Therefore $A_{1}$ would force us to conclude that the safe and the dangerous worlds have exactly the same level of risk!

Similar problems arise if we try and use weaker versions of $A_{1}$ - maybe our survival is some evidence, just not strong evidence. But Bayes will still hit us, and force us to change our values of terms like $P (we survived | dangerous)$ . There are simply not enough degrees of freedom in the system for anthropic updating to be done any way other than in the normal way.

Survivorship bias and special Earths

There are clearly issues of selection bias and survivorship bias in anthropic reasoning. We can't conclude from seeing all the life around us, that the universe is full of life.

But that doesn't stop us from updating normally, it just means we have to update on exactly what we know: not on the information that we observe, but on the fact that we observe it.

Take a classical example of survivorship bias: hedge funds success. We see a lot of successful hedge funds, and we therefore conclude that hedge funds are generally successful. But that conclusion is mistaken, because the least successful hedge funds tend to go bankrupt, leaving us with a skewed sample. So if we noticed "most hedge funds I can see are successful", concluded "most hedge funds are successful", and updated on that... then we'd be wrong.

Similarly, if we noticed a lot of life around us, concluded "life is common", and updated on that, we'd be wrong. If, however, we instead concluded "life is common on at least one planet" and updated on that, then we would be correct.

Notice how specific the update requirements can be. Suppose we had three theories. Theory $T_{1}$ gives a $25 %$ probability to life existing on any given planet. Theory $T_{2}$ gives a $50 %$ probability for life existing on any Earth-like planet, and $0 %$ for other planets. While theory $T_{3}$ gives a $100 %$ probability to life existing on Earth, specifically, and $0 %$ to life existing anywhere else.

Now, the different $T_{i}$ might have different priors. But updating them on the fact of our existence will increase the probability of $T_{3}$ twice as much as $T_{2}$ , which itself is twice as much as $T_{1}$ . Even though $T_{1}$ posits a universe filled with life and $T_{3}$ a universe almost empty of life, our existence is evidence for $T_{3}$ over $T_{1}$ .

So, when updating on anthropic evidence, we have to update on what we see (and the fact that we see it), and not assume we are drawing from a random sample of possible observations about the universe. But, with those caveats, anthropic updating works just as normal updating.

Anthropic reasoning in medium sized universes

There's a final reason that anthropic reasoning can seem daunting. I've shown above that the update process of anthropic probability is the normal update process. But what about the initial probabilities? There are a plethora of anthropic probability theories - SIA, SSA, FNC - and some people (ie me) arguing that probabilities don't even exist, and that you have to use decision theory instead.

But in this section I'll show that, if you make some reasonable assumptions about the size of the universe (or at least the size of the part of the universe you're willing to consider), then all those probabilities collapse into the same thing, which is pretty much just normal probability for the universe in which you exist. If we make those assumptions, we can then do anthropic probabilities in an easy way, at least for problems without explicit duplication.

Defining medium sized universes

Let's talk about how unique you are. From human to human, there is typically 20 million base pairs of variation. Our brain processes 50 bits per second, or 2.2 billion bits in a lifetime. A lot of this information will be highly redundant, but not all of it.

The Hubble volume roughly $10^{31}$ cubic light years, or roughly $10^{31} \times (10^{16})^{3} = 10^{71} m^{3}$ in volume. In bits, this is ${log}_{2} (10^{71}) = 262$ . So if we packed our Hubble volume with humans, and those humans were initially identical but had had about ten seconds to diverge, then we would not expect to find two copies of the same human anywhere.

Of course, humans are not packed anywhere near that density, and humans diverge a lot more than that. So we expect to go a great great great ... great great great distance before finding identical copies of ourselves.

So I define a medium sized universe as universes larger than our own, but where we'd expect to find but a single copy of ourselves. These universes can, of course, be very big - a universe that is $10^{300}$ times bigger than the Hubble volume would count as a very small example of a medium-sized universe.

I suggest that, in general, we restrict anthropic reasoning to medium-sized universes.

This might seem controversial; after all, doesn't the universe appear to be infinite? Well, probability theories have problems with infinity, anthropic probability theories even more so. In most areas, we are fine with ignoring the infinity and just soldiering on in our local area; I'm suggesting that we do that for most anthropic reasoning as well. By "most" I mean "reasoning about situations that don't involve infinities, deliberate duplication, or simulations". Though you can't shove that many simulations into a medium sized universe, so avoiding simulations may be unnecessary (it does tend to make the rest of the reasoning much easier, though).

Anthropic probability in medium sized universes

Different theories of anthropic probability are trying to answer subtly different questions about the universe and ourselves. But they only really differ if there are multiple copies of the same person.

Take SIA. We know that SIA is independent of reference class, so we may as well take the reference class $R_{s}$ consisting of a the agents subjectively indistinguishable from a given human (eg ourselves). Because there are almost certainly no duplicates in this universe, this reduces to a single copy, at most. So if $P_{R_{s}}^{S I A}$ is the probability function for SIA with this reference class, then it is almost exactly equal to $P (\cdot | we exist)$ for $P$ the non-anthropic probability distribution over universes.

And $P (\cdot | we exist)$ is just the Full Non-indexical conditioning version of anthropic probability. Now, I know that FNC is inconsistent; still, in medium sized universes, it's very close to being consistent (and very close to being SIA).

If we use SSA with the reference class $R_{s}$ or the consistent class $R_{s_{t}}$ , we get a similar almost equality:

$P_{R_{s}}^{S I A} \approx P_{R_{s_{t}}}^{S S A} \approx P_{s}^{F N C} = P_{R_{s}}^{S S A} = P (\cdot | we exist)$ .

And understates how nearly identical these probabilities are.

Now, there is one anthropic probability theory that is different: SSA with significantly larger reference class (say the class of all humans, all sentient beings, or all "observers"). But this post argues against those larger reference classes, claiming they belong more to decision theory and morality, not probability. And remember, the definition of the reference class for SSA is contained in the question we are asking. Only for questions where "we could have been person X", in a specific sense, does SSA with larger reference classes make sense.

Another reason to restrict to $R_{s}$ is that in medium sized universes, the anthropic probabilities are essentially free from all the usual paradoxes.

Notice that in using $R_{s}$ , we haven't had to formally define what an "observer" is, or what would qualify an agent to get that rank. Instead we're just looking at agents that are subjectively indistinguishable from each other, a narrow and reasonably well-defined class.

Anthropics for beginners

So, here's how to proceed with anthropics in most situations:

Assume the universe is medium sized.
Check (or assume) that there is no actual duplication or simulations going on.
Use a prior over universes, and update it based on the fact that you exist.
Proceed to update using any other information you find, remembering selection bias: the update is the fact that you saw this information, not that the information exists.

And that should suffice for most non-specialised work in the area.

[-]shminux6y100

Again, anthropics is basically generalizing from one example. Yes, humans have dodged an x-risk bullet a few times so far. There was no nuclear war. The atmosphere didn't explode when the first nuclear bomb was detonated (something that happens to white dwarfs in binary systems, leading to some supernovae explosion). The black plague pandemic did not wipe out nearly everyone, etc. If we have a reference class of x-risks and assign the probability of a close call p to each member of the class, then all we know is that after observing n close calls the probability of no extinction would be p^n. If the number is vanishingly small, we might want to reconsider our estimate of p ("the world is safer than we thought"). Or maybe the reference class is not constructed correctly. Or maybe we truly got luckier than other hypothetical observable civilizations who didn't make it. Or maybe quantum immortality is a thing. Or maybe something else. After all, there is only one example, and until we observe some other civilizations actually not making it through, anthropics is groundless theorizing. Maybe we can gain more insights into the reference classes and the probabilities of a close call, and of surviving an even from studying near extinction events roughly fitting into the same reference class (past asteroid strikes, plagues, climate changes, ...). However, none of the useful information comes from guessing the size of the universe, of whether we are in a simulation, of "updating based on the fact that we exist" beyond accounting for the close calls and x-risk events.

That said, I certainly agree with your point 4. That only the observed data need to be accounted for.

[-]Stuart_Armstrong6y20

none of the useful information comes from guessing the size of the universe, of whether we are in a simulation,

The reason I assume those is so that only the "standard" updating remain - I'm deliberately removing the anthropically weird cases.

[-]Chris_Leong6y30

1) Subjectively distinguishable needs to be clarified. It can either a) that a human receives enough information/experience to distinguish themselves b) that a human will remember information/experience in enough detail to distinguish themselves from another person. The later is more important for real-world anthropics problems and results in significantly more copies.

2) "In most areas, we are fine with ignoring the infinity and just soldiering on in our local area" - sure, but SSA is inherently non-local. It applies over the whole universe, not just the Hubble Volume. If we're going to use an approximation to handle our inability to model infinities, we should be using a large universe, large enough to break your model, rather than a medium sized one.

The correct way to handle SSA is to deal with the exact question that it poses. But for most purposes, this approximation suffices.

[-]Dr. Jamchie6y10

And then I might respond by slicing into the definition of observer, creating "half-observers", and moving smoothly between observer and non-observer...

Do you have this written down somewhere in more detail? It seems that for this to work one needs to assume the gradual appearance of consciousness, something like rock<beetle<mouse<ape<human. Will this work if one assumes consciousness to be binary, that it either is or it isn't?

Will this work if one assumes consciousness to be binary, that it either is or it isn't?

If it's binary, I point out the binariness is arbitrary, start looking at states of uncertainty about whether there was consciousness or not (or observers or not), talk about video feeds that may or may not be observed, or start looking at disasters that kill the population gradually yet inevitably. It's... not a very fruitful avenue to explore, in my view.

[-]Ofer6y10

Therefore A1 would force us to conclude that the safe and the dangerous worlds have exactly the same level of risk!

Similar problems arise if we try and use weaker versions of A1 - maybe our survival is some evidence, just not strong evidence. But Bayes will still hit us, and force us to change our values of terms like P( we survived | dangerous ).

I'm confused by this. The event "we survived" here is actually the event "at least one observer similar to us survived", right? (for some definition of "similar").
If the number of planets on which creatures similar-to-us evolve is sufficiently large, we get:
$P ($ at least one observer similar to us survived $) \approx P ($ at least one observer similar to us survived $|$ dangerous $) \approx 1$

[-]Stuart_Armstrong6y50

No, the event "we survived" is "we (the actual people now considering the anthropic argument and past xrisks) survived".

Over enough draws, you have $P (at least one observer similar to us won the lottery) \approx$ $P (at least one observer similar to us won the lottery | the lottery has bad odds)$ $\approx 1$ .

So we update the lottery odds based on whether we win or not; we update the danger odds based on whether we live. If we die, we alas don't get to do much updating (though note that we can consider hypothetical with bets that pay out to surviving relatives, or have a chance of reviving the human race, or whatever, to get the updates we think would be correct in the worlds where we don't exist).

[-]Ofer6y20

Thank you, I understand this now (I found it useful to imagine code that is being invoked many times and is terminated after a random duration; and reflect on how the agent implemented by the code should update as time goes by).

I guess I should be overall more optimistic now :)

LESSWRONG
LW