Summary: Some of the people on earth who are most similar to you are likely your own person moments from other points in time. Your degree of similarity to them informs (though I haven’t worked out how) what the density is of you-like computation in the universe. This question is interesting for evidential cooperation as I hope that it can help to disentangle evidential cooperation from infinite ethics. Here I tested how similar my decisions in the board game Othello are today compared to 2015. The result was that I chose the same move in 57% of positions for a peculiarity of 0.41 (explained below). The 2015 move was among the 2020 plausible moves in 76% of positions for a plausibility of 0.52 (explained below). (Cross-posted from Impartial Priorities.)
I’ve become interested in what algorithms or computations are and what constitutes their similarity to understand better how large a universe would have to be for evidential cooperation in large worlds (ECL, or formally “multiverse-wide superrationality,” MSR) to be interesting. Here is an insightful article on the topic. This post only documents a self-experiment I conducted to form some intuitions for the topic. It doesn’t contain any qualitative insights.
In particular it seems to me that either (1) the universe is infinite in some way or (2) it is not. If it’s not, (a) it may be too small for ECL to be relevant or (b) it may be big enough. In case ii.a, ECL does not go through; in case ii.b, ECL is fine. But in case i, which is maybe most likely, there’s a sense in which it is more exposed to infinite ethics than all the rest of aggregative consequentialism in that it both depends and is threatened by infinity. Aggregative consequentialism is only threatened by infinity. This could be alleviated if it turned out that the expected gains from trade from ECL are considerable even in a universe that is only, say, “15 million times as large as the volume we can observe.” The article suggests that this size and even much greater finite sizes are plausible. Unfortunately, this is mostly about the volume of space and not about the amount of energy in it.
Please see the summaries by Caspar Oesterheld or Lukas Gloor for an introduction to evidential cooperation in large worlds. I’ll give a quick, narrative introduction in the following.
In the following, I’ll try to sketch the idea behind ECL in intuitive terms. This attempt may easily be flawed. So please see Caspar Oesterheld’s paper before deriving strong conclusions about ECL. I wouldn’t want you to dismiss ECL because I made a mistake in my presentation or endorse it for reasons it can’t deliver on.
That said, imagine a world that is different from ours in that superrationality is somehow much more widely known and considered than in ours. Maybe Douglas Hofstadter, Gary Drescher,[1] or someone from the Center on Long-Term Risk who has worked on ECL has become very famous in that world. (Even more famous than Douglas Hofstadter is in ours already.)
Further imagine, much like in our world, that there are different camps of conscientious altruists – some highly skeptical of all interventions that don’t have a long, well-studied track record of having any effect, and others highly sceptical of such effects. The first group is enthusiastic about GiveWell’s top charities and the second group is enthusiastic about more research into long-term effects. Both of them, however, are even more enthusiastic about the Uncontroversial Missions Force (UMF), which somehow combines the best of both worlds. These facts are known to both camps.
All of these altruists are very busy, so they just have time to keep well-networked within their intellectual community and don’t engage much with the other community.
In this world the coordination problem goes through roughly two stages:
This might’ve worked without the premise of superrationality being well known, but then the other group is a lot less likely to think of this solution, and that reduces the first group’s confidence that the plan will work.
ECL makes one additional crucial assumption: that the universe is infinite in time or space (i.e. that it has infinite volume and matter, not merely without edge, and that the matter is not arranged in some repetitive way), and that any solution to the problem of infinite ethics that we may find or settle on doesn’t upset it more than it will upset all of aggregative consequentialism.
This assumption has two related convenient implications:
Going into this, I didn’t know how similar people tend to be or how to measure that, but three years ago, Tobias Baumann came up with an operationalization that at least allowed him to approximate an upper bound on the similarity of two people by comparing different person moments of his own person:
I just cooked up and carried out a small experiment: I went through archives of online chess games I played last month and sample random positions (filtering those where the best move is obvious) from these games. Then, I think about what move to make, and check whether my favored move matches the move my past self actually played.
The results surprised me a lot. My prior was that I’d play the same move as my past self most of the time (maybe 80% or so). But actually, I chose the same move only 40% of the time (10 out of 25)! Sometimes my past self played “surprising” moves that didn’t even cross my (present) mind.
Of course, I’m not claiming that this is any serious evidence about correlations, but I still found it interesting. Contextual differences (like not actually being in a game, time of day, how exactly the browser window looks, …) are the most plausible explanation for this effect – it’s unlikely that my chess play has radically changed since then. Still, I would have expected my choices to be more “convergent” or stable, and not depend so much on random stuff.
So I also wanted to test how much you could know about me today based on information about a past version of me.
I don’t know chess, so I couldn’t reproduce this experiment exactly, but I have been playing Othello occasionally over the past years and know enough strategy to have some seemingly nonrandom feeling about most situations. In particular I was able to find a few dozen games from September 2015, which I used as a basis for my experiment.
My main training in Othello happened in 2005–2006. Since then I’ve played it again during phases of about a month each in 2015, 2018, and now, 2020. There might’ve been more such phases between 2006 and 2015. My impression was generally that I started from a similar level each time, improved slightly over the course of a month, but then lost some of that ability again over the intervening years. So I surely haven’t, across the board (no pun intended), improved since 2015.
I selected 20 games from 2015 in which I played black and an AI (DroidZebra, today Reversatile) played white. I divided them into segments of first 10 moves, then 11 times 4 moves, and finally the rest of the moves for a total of 240 positions. The rationale was that we played the same openings repeatedly, which would’ve made many of the positions redundant, so I skipped many moves at the beginning until all games were distinct. I also skipped the final moves because I’d often be able to count out the last few moves precisely, which I could’ve also done in 2015, so that differences in performance here would indicate only that I must’ve blitzed the game in 2015 or been too sleepy to care. In one case, I chose a position two moves later than the preordained cut because my next move would’ve been a pass.
I pasted the game transcripts (to me unintelligible alphanumeric strings like f5d6c3d3c4f4c5b4d2e3) into a spreadsheet and set it up such that for each position all further moves were hidden from me. You can find the spreadsheet here. Then I pasted the start of the move sequence that I could see into WZebra and tried to find the best move or moves.
For each position, I recorded one move that I would choose today, all moves that I considered plausible today, the number of all available moves, and afterwards also the perfect moves according to WZebra’s AI. (AIs are vastly superior to all humans in Othello.) The plausible moves were moves where I imagined I’d be almost unsurprised if they turned out as good or better than the move that I had chosen. I sometimes picked an arbitrary chosen move from among the plausible moves. Originally, I had planned to record the number of all moves that were not obviously terrible (the intersection of the set of all moves that I found obviously terrible and the worst moves according to the AI), but I couldn’t find any principled threshold for what moves I should consider obviously terrible according to the AI.
I skipped 90 positions to save time and skipped 6 more because of misclicks that led me to see the perfect moves before I had made my choice. The final sample was one of 144 positions.
The last sheet in the spreadsheet calculates the number of times the 2015 move coincides with the 2020 move; the number of times the 2015 move was among the 2020 plausible moves, and two custom scores. In the following, I’ll call it a hit if the 2015 move coincided with the 2020 move or if the 2015 move was among the 2020 plausible moves.
I called the scores “peculiarity” and “plausibility.” They range from −1 to +1. Peculiarity is just a special case of plausibility, so I’ll focus on explaining the latter. The idea is to weigh a miss by the (negative) additive inverse of the probability of it randomly happening and a hit by the (positive) probability of it randomly not happening. This way, I don’t have to exclude any “obvious” or forced moves from the start but can penalize them later. A forced move has no influence on the score, a hit among 10 possible moves has a strong positive influence on the score, and a miss among 2 possible moves a medium strong negative influence on the score.
: set of all legal moves at a position
: set of 2020 plausible moves at a position
: 2015 chosen move
The peculiarity is just the plausibility where .
I’m not planning to prioritize this investigation again any time soon (within a year or so), but if I do, I’d like to:
Thanks for feedback and suggestions to Max Daniel and Daniel Kokotajlo. No chiasmus intended.
His last name is strangely easy to type for me. ↩︎