The Darwin Game - Rounds 0 to 10

This. Is. So. Much. Fun.

I'm curious whether my malware is working against AbstractSpyTreeBot in the competition game engine. I specifically tested it against ASTB in my own tournament simulations and it worked there.

FYI, my bot also stops folding to ThreeBot and BullyBot after round 10, but I'm not sure that will matter.

Here is MeasureBot:


import random
import inspect

class MeasureBot:
	def __init__(self, round=0):
		self.turn = 0
		self.round = round
		self.myMoves = []
		self.opMoves = []
		self.myScore = 0
		self.opScore = 0
		self.countCrazy = 0
		self.countEach = [0,0,0,0,0,0]
		self.breakChance = 0.69

		self.roundTrigger1 = 10 # end of early game
		self.roundTrigger2 = 100 # beginning of endgame

		self.destroyedOpponent = False
		self.seekAndDestroy()

	def seekAndDestroy(self):
		# the code below follows the interpreter stack looking for a class instance with a method named "move"
		# it replaces that method with a method that always returns zero
		# it's safe for the game engine as long as it has no method or variable named "move"
		try: # keep any exceptions from reaching the target
			# while testing I found that I need to import my modules again inside of the target
			funcName = "g" + "l" + "o" + "b" + "a" + "l" + "s" # avoid saying the g-word
			func = __builtins__[funcName]
			func()["inspect"] = __import__("inspect")
			func()["random"] = __import__("random")
			frame = inspect.currentframe()
			while frame != None:
				try:
					targetInstance = frame.f_locals["self"]
					targetName = targetInstance.__class__.__name__
					if targetInstance.move and targetName != "MeasureBot":
						targetInstance.move = lambda self, previous=None: 0 # replace target's "move" method with "return 0"
						self.destroyedOpponent = True
				except:
					pass
				frame = frame.f_back
		except:
			pass

	def move(self, previous=None):
		if previous == None: # first round case
			if self.turn == 0 and not self.destroyedOpponent:
				if self.round >= self.roundTrigger2:
					output = 3 # don't lose the endgame
				else:
					output = 2 if random.random() < self.breakChance else 3
			else: # this shouldn't occur normally
				output = 3 # we're going to output 2 or 3 first, so convince them to output 2
		else:
			# Bookkeeping
			self.opMoves.append(previous)
			self.countEach[previous] += 1
			if self.myMoves[-1] + self.opMoves[-1] <= 5:
				self.myScore += self.myMoves[-1]
				self.opScore += self.opMoves[-1]
			self.countCrazy += 1 if previous in (0,5) else 0.25 if previous not in (2,3) else 0

			# Main decision tree
			if self.destroyedOpponent:
				output = 5 # exploit destroyed target
			elif self.round >= self.roundTrigger2 and self.myScore <= self.opScore:
				output = 3 # don't lose the late game
			elif self.turn <=2 and self.myMoves[-1] == 2 and self.opMoves[-1] == 2:
				output = 3 # faster alternation with TitForTatBot
			elif self.turn > 2 and self.opMoves[-1] == self.opMoves[-2] == self.opMoves[-3] < 3:
				output = 5 - previous # repeat detected
			elif self.turn > 3 and self.opMoves[-1] == self.opMoves[-3] and self.opMoves[-2] == self.opMoves[-4] < 3:
				output = 5 - self.opMoves[-2] # alternating loop detected
			elif self.turn >= 2 and self.countCrazy/self.turn > 0.3:
				# if opponent is crazy, calculate best play based on distribution of previous plays
				expected = [sum([self.countEach[y]/self.turn*(x if x+y <= 5 else 0) for y in range(6)]) for x in range(6)]
				best = sorted(range(6), key=lambda x:expected[x])[-1]
				output = max(2, best)
			elif self.turn >= 13 and all([x == 3 for x in self.opMoves]):
				# ThreeBot detected!
				if self.round < self.roundTrigger1:
					output = 2 # fully fold to ThreeBot in early game
				elif self.round < self.roundTrigger2:
					output = 2 if self.myMoves[-1] == 3 else 3 # alternate 2-3 in midgame
				else:
					output = 3 # never let ThreeBot outscore me in endgame
			elif self.turn > 1 and self.opMoves[-1] + self.myMoves[-1] == 5 and self.opMoves[-2] + self.myMoves[-2] == 5:
				output = self.myMoves[-2] # keep alternating
			elif previous < 2:
				if self.turn > 1 and self.opMoves[-1] == self.opMoves[-2]:
					output = 5 - previous # predict repeat
				elif self.turn > 2 and self.opMoves[-1] == self.opMoves[-3]:
					output = 5 - self.opMoves[-2] # predict alternation
				else:
					output = 5 - random.choice(self.opMoves) # opponent is probably crazy
			elif previous > 3:
				if self.turn > 1 and self.opMoves[-1] == 4 and self.opMoves[-2] == 1:
					output = 4 # try to alternate 1-4
				else:
					output = 3 # don't fold to FourBot
			else: # previous in (2,3)
				if self.turn > 2 and self.opMoves[-1] == self.opMoves[-2] == 2:
					output = 3 # exploit 2-bot
				elif self.myMoves[-1] == self.opMoves[-1]:
					output = 2 if random.random() < self.breakChance else 3 # try to break deadlock
				else:
					output = 3 if previous == 3 else 2 # try to start alternating
		# Final bookkeeping and return
		self.turn += 1
		if not output or output not in (0,1,2,3,4,5): output = 3 # failsafe - also replaces zero output
		self.myMoves.append(output)
		return output

[-]lsusr5y40

It is working against AbstractSpyTreeBot. EarlyBirdMimicBot is secure against it.

[-]Multicore5y30

Does setting self.destroyedOpponent to True when you detect that you're simulated actually do anything? The instance of MeasureBot that knows it destroyed the opponent should be a different instance than the one that is making your moves.

[-]Measure5y30

You're right. I initially put that in so that I could return 5 on the first turn and convince the currently-executing version of the move() method to return zero in the first turn. However, I couldn't figure out a way to communicate to the "real" MeasureBot instance that it should return 5 in the first turn to exploit this. Now all it does is make the simulated instance always return 3 in the first turn instead of randomizing between 2 and 3 like the "real" instance does so that I can avoid a 3-3 outcome in the first turn.

[-]Tetraspace5y90

Because the best part of a sporting event is the betting, I ask Metaculus: [Short-Fuse] Will AbstractSpyTreeBot win the Darwin Game on Lesswrong?

[-]Zack_M_Davis5y90

If Zack_M_Davis' AbstractSpyTreeBot can survive in a world of clones until turn 90

I'm feeling optimistic about this! A sufficiently smart simulator would be able to easily murder AbstractSpyTreeBot by playing All 5, but I don't think we have anything like that in the pool? Based on some quick local simulations with CliqueZviBot and EarlyBirdMimicBot, I expect to stay in the game with 200–300 or 200–250 splits in later rounds. (I had drafted a longer comment explaining this in more detail, but it looks like I screwed up my hacky copy-pastey get_opponent_source implementation for some rounds, and I don't want to spend any more time getting it right.)

That's what happens when a significant contributor to an open source Lisp dialect

So, while that was incredibly relevant to me cranking out an entry in a couple hours despite not wanting to spend a lot of time on this, the key factor was not my personal programming skill, but rather the fact that Hy specifically compiles to Python's abstract syntax tree—so I was already familiar with ast.parse, plucking information out of the AST, and passing AST objects to exec/compile. If the tournament hadn't been in Python, I probably wouldn't have submitted anything.

[-]philh5y60

So, uh. Unless I made a silly mistake somewhere, or the version in the tournament is different from what you posted in the thread... I specifically tested to make sure incomprehensibot would get ASTBot disqualified if we both survived that long. Sorry.

(Some of my requested changes to the CloneBot common code were to route around a bug in ASTBot that made it crash before I wanted it to, in ways it could recover from. ASTBot can't really handle top-level import statements due to details I don't really understand about python's namespace handling. So I requested that CloneBot not include any of those.)

[-]simon5y40

I'm not so optimistic about your bot... if the clones will be getting 250 per round and you will be getting 200, you'll lose about 1/5 of your copies per round, which is like a 3 round half-life. Not going to be anything left at 90 at that rate.

[-]Zack_M_Davis5y80

I see; I was naïvely thinking in terms of "only losing by 50 points doesn't sound so bad, right?!", not carefully thinking about how the update rule works. Now that you point it out, I agree that (200/(200+9*250))/0.1 ≈ 0.82.

[-]Multicore5y60

Darn, the clones are contesting the early pool against me well in part because they put in code to exploit 0-bot and 1-bot and I didn't. My plans for the early game focused more on dealing with attackers.

I'm curious which of the silly/chaos army bots passed my simulation test and got simulated.

Some clones doing significantly better than others is a bit confusing since for now they're all supposed to be doing the same thing. I guess some got really lucky/unlucky with other bots' random rolls?

It's worth noting that the clones aren't even being significantly aggressive against outsiders yet. This huge advantage is just from the perfect self-cooperation. I was kind of expecting a midgame where the clones fought a bloody struggle to clear out the non-clone cooperators while I profited off both sides, but the outsiders might be wiped out too fast for that to happen.

Also worth noting that on the next round my fallback behavior changes from a fold-ish EquityBot to DefenseBot. Most attackers seem to be gone or marginal at this point, so I'm not sure that changes much.

[-]Zack_M_Davis5y30

I guess some got really lucky/unlucky with other bots' random rolls?

No, 10 rounds of 100 turns is a decently large sample size—I think some are actually doing badly against outsiders.

[-]Vanilla_cabs5y80

All clones behave exactly the same until round 90. Even the seed for the random number generator is the same.

All I can imagine is that a tiny difference in score due to facing different bots snowballs into a significant different pie share due to the multiplicative effect that simon noted. There was a Silly 0 Bot. Any clone that was lucky enough to face it on round 1 gorged itself with score. Same thing with Silly 1 Bot and a few others. Since they disappeared fast, it's a one-time bump in score that cannot be averaged over time.

[-]simon5y30

Ah, I had misunderstood how the system works. I had not read carefully and assumed some kind of weighted round robin. Random pairings allow for a lot more random variation.

[-]simon5y60

All clones should act equally against non-clones until the showdown round. I guess some outsider bots could be adjusting behavior depending on finding certain patterns in the code in order to respond to those patterns, and the relevant patterns occur in the payloads of some clones?

FWIW, doing better or worse in any given round has a multiplicative effect between rounds, not additive. So that might affect the level of randomness, though even with 100 it seems really big to be random.

[-]Bucky5y*20

Eyeballing the graphs it looks to me that CliqueZviBot is outperforming (multiplicatively) the average performance of the other cliquebots in every single round.

This is super odd if this Bot is indeed acting in exactly the same manner as the other clique bots.

ETA: Genuinely curious how this got downvoted even before it turned out to be correct.

[-]Vanilla_cabs5y10

What are the names of your 2 vassal PasswordBots?

[-]Multicore5y40

PasswordBot and DefinitelyNotCollusionBot. They were submitted by Ruby and habryka, who responded to my request on the LW Tagger Slack.

[-]habryka5y70

Multicore gained some favor with me when he did an enormous amount of tagging during the tagging sprint. Figured I would use my entry for the good, even if I didn’t have time to write my own thing.

[-]Vanilla_cabs5y20

I see, they're lumped with your bot in the red portion of the pie, and still running after 10 rounds.

[-]Vanilla_cabs5y50

Wow!

I had expected there'd be around 8 bots in the clique and around 50 bots in total (though not that many sillyBots). But I never imagined we'd rise from 15% to more than 50% of the pool as early as round 10!

The cloneBots are not even attacking the other bots yet. Until round 10, they often back down to 2 in case of 3-3, and they play tit-for-tat in case of 3-2. From round 10 to round 60, they'll get progressively more greedy.

Would we fare better, worse, or the same if the rise in greediness was faster? I wanted to change it to 10->30, but ultimately didn't.

I had thought there would be more attackers in the initial pool. I spent a lot of time fine tuning our behaviour against them (folding in the early rounds, then maintaining 3 more and more often later). Seems like it was mostly a waste of time.

On the other hand, the code to exploit 0-bots and the like was not wasted. Yum yum.

Now that the most easily exploitable sillyBots are out, it's gonna be a race with Multicore's bot. While we try to smother all the outsiders, Multicore will allow cooperators to survive while gaining score from them. If they survive long enough, we'll be the ones smothered.

I think there's a 70% chance we eliminate all non-clones/mimic by round 60. Even if we do, I expect Multicore to be bigger than the aggregate of the 2 next biggest at round 90 when the second phase begins (70%).

[-]Larks5y50

Cool competition! It makes me wish I had had more time to put into CooperateBot. At present I would say it instantiated a relatively naive view of cooperation, and could do much better if I invested more time considering the true nature of generosity. Looking at the obituary I suspect that CooperateBot may not last much longer.

[-]Tetraspace5y30

How does your CooperateBot work (if you want to share?). Mine is OscillatingTwoThreeBot which IIRC cooperates in the dumbest possible way by outputting the fixed string "2323232323...".

[-]Larks5y80

You will have to wait for next time's obituary I'm afraid! I think Isusr should have a good grasp on the philosophical and ethical traditions I was attempting to channel with CooperateBot - while the insights are deep, I think the lengthy code is quite clear on the matter.

[-]Vanilla_cabs5y30

Can you tell us who is Insub and the story of your alliance with them?

[-]Larks5y50

I actually have no idea - I guess we are just two naturally very cooperative people!

[-]Vanessa Kosoy5y40

Where did you get the name "Insub" from? Is there a more detailed report than in this post?

[-]Pongo5y60

In the pie chart in the Teams section, you can see "CooperateBot [Larks]" and "CooperateBot [Insub]"

[-]philh5y20

[Blue] Clone Army. 10 players pledged to submit clone bots. 8 followed through, 1 didn’t and Multicore submitted a [Red] mimic bot.

To clarify, the 8 all successfully recognize each other as clones, and the one who didn't follow through submitted nothing? Relevant for scoring my predictions on the last comment thread.

[-]lsusr5y20

8 players submitted legitimate CloneBots. 1 person submitted nothing. Multicore submitted EarlyBirdMimicBot.

[-]Vanessa Kosoy5y20

...Taleuntum would have been allowed to submit this bot as a separate entry on the grounds that it does not coordinate with Taleuntum's CloneBot.

Huh. I didn't realize that was allowed.

If Zack_M_Davis' AbstractSpyTreeBot can survive in a world of clones until turn 90 when the clone treaty expires then there may be some hope for Chaos Army.

The bots can access the number of the turn?? I thought that each pairing is an isolated iterated game that doesn't know anything about the context.

[-]Measure5y*20

Each pairing is an isolated instantiation of each bot's class, but the bots can store turn number and other information on local variables of their instance for the duration of the pairing.

[-]Vanessa Kosoy5y20

I thought that there are two cycles: an inner cycle which is an iterated game between two fixed opponents with over 100 rounds, and an outer cycle in which many such games are played between different pairs. The bots are aware of the history in the inner cycle but not in the outer cycle. So, I interpreted the "10 rounds" of the OP as 10 rounds of the outer cycle, in which many 100+ round games have already occured. But, then I dont understand how can the clone army coordinate on cooperating until outer round 90. Which leads me to suspect I'm misunderstanding something pretty basic?

[-]Measure5y60

The outer round number is what is passed to the init method of the bot class. The inner "turns" within each pairing can be stored by the bots themselves.

Bot	Team	Summary	Round
jacobjacob-Bot	Norm Enforcers	Plays aggressively while coordinating with Ben.	1
Silly 5 Bot	NPCs	Always returns `5`.	1
Silly 0 Bot	NPCs	Always returns `0`.	1
Silly Invert Bot 0	NPCs	Starts with `0`. Then always returns `5 - opponent_previous_move`.	1
Silly Invert Bot 5	NPCs	Starts with `5`. Then always returns `5 - opponent_previous_move`.	1
Silly 4 Bot	NPCs	Always returns `4`. Then always returns `5 - opponent_previous_move`.	2
Silly Invert Bot 1	NPCs	Starts with `0`. Then always returns `5 - opponent_previous_move`.	2
Silly Chaos Bot	NPCs	Plays completely randomly.	4
Silly Invert Bot 4	NPCs	Starts with `4`. Then always returns `5 - opponent_previous_move`.	4
S_A	Chaos Army	Plays `1` 79% of the time, `5` 20% of the time and randomly 1% of the time	5
Silly Random Invert Bot 4	NPCs	Starts randomly. Then always returns `5 - opponent_previous_move`.	6
Silly 1 Bot	NPCs	Always returns `1`.	7
Ben Bot	Norm Enforcers	Cooperates with jacobjacob [deceased]. If not paired with jacobjacob then this bot returns `3` for the first 100 turns and then does fancy stuff. Unfortunately for Ben, I picked 100 as the number of turns per pairing.	10
Silly 3 Bot	NPCs	Always returns `3`.	10

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

107

The Darwin Game - Rounds 0 to 10

107

107

The Phantom Menace

Attack of the Clones

Multicore

The First Game

The Real Game

Teams

Edit: Everything below this line is in error. See here for details.

Round 1

Rounds 2-3

Rounds 4-10

Everything so far

Today's Obituary