Meditations on Moloch (MoM) is great but it’s not super clear: specifically, it’s not super clear what Moloch actually is. In this post I’ll propose a new way of modelling Moloch using evolutionary game theory that I hope makes things clearer.
Moloch is usually explained in terms of well-known economic concepts, like prisoner’s dilemmas, nash equilibria, negative sum games or collective action problems[1]. MoM on this view is a vivid and accessible introduction to these concepts, helping us think through how widely these concepts apply, as well as providing a rousing metaphor - “defeating Moloch” - for the challenge of solving collective action problems.
I think there’s a lot that’s right about this view. But what doesn’t sit right with me is that it implies that MoM doesn’t actually bring anything conceptually new to the table. And this means that it doesn’t quite seem to capture the feeling of insight many people have had reading MoM.
I suggest there’s a way of modelling Moloch that doesn’t just reduce it to standard collective action problems, and so captures what’s new and exciting in MoM.
Specifically, I argue that we need to look to evolutionary game theory to find a model that
Evolutionary game theory (EGT) emerged in the 1970s as the application of game theory to biology, specifically questions around evolving populations.
And it then turned out to be a useful model to apply in economics and social sciences as well, in part because it avoided the need to attribute hyper-rational beliefs to the agents in the model.
I won’t try and give any more of an introduction to EGT here, particularly as someone has recently done that.
Instead I’ll outline one specific model in EGT that helps us get clearer on Moloch, namely the Evolutionary Prisoner’s Dilemma (EPD).
This model is defined by the following assumptions:
Now since (from 1 and 4) this is a large population of individuals with differing rates of reproductive success we can formalise this as a dynamical system using the replicator equation:
Where:
So the replicator equation says that the rate of change in the proportion of individuals playing a particular strategy depends on whether the average fitness of individuals playing that strategy against random members of the population is higher or lower than the average fitness of the population. If it’s higher, the rate of change is positive; if it’s lower, it’s negative.
Plugging in the specific payoffs for PD, where is the expected payoff of Cooperate against a random member of the population and is the expected payoff of Defect, we can derive the following rate of change for the proportion of cooperators in the population:
Since > , is negative, with absolute values depending on the PD payoffs, unless is either 1 or 0.
In other words unless the starting conditions include a population of 100% cooperators, the proportion of cooperators will reduce to zero.
A key aspect of Moloch is the concept of multipolar traps, which is introduced in MoM by means of a number of examples.
Part of what’s confusing about MoM is that these examples don’t seem super well chosen. Since they range so widely - from classic PD and dollar auctions to cancer and competitive behaviour under capitalism - it ends up being very hard to specify what they all have in common apart from some very broad notion of a collective action problem where everyone would be better off if they could somehow coordinate.
Now you might say this just supports the standard interpretation of Moloch as a ‘toolbox’ of different models grouped together under the heading of collective action problems.
But what this view misses is the fact that when MoM tries to summarize what multipolar traps are - summarize, in other words, what the examples all have in common - it keeps returning to core features that suggest a distinctive dynamic.
There are two such summaries in the early parts of MoM:
Summary 1:
all these scenarios are in fact a race to the bottom. Once one agent learns how to become more competitive by sacrificing a common value, all its competitors must also sacrifice that value or be outcompeted and replaced by the less scrupulous. Therefore, the system is likely to end up with everyone once again equally competitive, but the sacrificed value is gone forever.
Summary 2:
A basic principle unites all of the multipolar traps above. In some competition optimizing for X, the opportunity arises to throw some other value under the bus for improved X. Those who take it prosper. Those who don’t take it die out. Eventually, everyone’s relative status is about the same as before, but everyone’s absolute status is worse than before. The process continues until all other values that can be traded off have been – in other words, until human ingenuity cannot possibly figure out a way to make things any worse.
Note that these summaries do not in fact identify a basic principle that unites all the examples he’s given (this might already be obvious to you, let me just spell it out anyway).
For simplicity, let’s just focus on his first example, standard PD (though similar points could be made about other examples).
PD is not an example of either
a) one agent “learning how to become more competitive by sacrificing a common value” (the most one could say is that it is an example of how an agent can gain a higher payoff by sacrificing a common value)
b) agents being “outcompeted” or “dying out” if they don’t make the same sacrifice (the most one can say is that agents that don’t make the same sacrifice will gain a lower payoff)
c)a “process” that continues until things cannot get any worse (the most one can say is that the nash equilibrium in this static one-shot situation is not pareto optimal)
Rather than pointing to generic collective action problems like PD, these key passages in fact point to a dynamic process involving competition and selection - in other words an evolutionary process.
This suggests that the core examples of multipolar traps - those that most closely match the conceptual abstraction MoM points at - are those that present an evolutionary race to the bottom, such as the rat story, the capitalism example and cancer.
And it’s hopefully clear how each of these are a close fit to the EPD model:
In short, I’m not saying that my model is a great fit for all of MoM’s examples of multipolar traps - I don’t think any single model would be.
But I am saying that EPD is a good model of the kind of race to the bottom MoM consistently points to, one which ends in a situation where “human ingenuity cannot possibly figure out a way to make things any worse”. The 100% proportion of defectors that EPD has as a stable end-state is a much better fit to this idea than standard notions of PD and collective action problems.
So far we’ve basically been talking about section I and II of MoM. Let’s see how this model measures up to MoM’s more wide-ranging and philosophical later sections.
Section III starts with the idea “time flows like a river. Which is to say, downhill” …until it reaches the sea. It then provides “three bad reasons - excess resources, physical limitations, and utility maximization - plus one good reason - coordination” why we occasionally manage to move uphill and therefore why we’ve not yet reached the sea.
I guess the focus on coordination as the ‘good’ reason here is another thing that’s led to the view that Moloch is just one big collective action problem. There’s some truth in this, but that’s not all Moloch is. The metaphor of the river flowing downhill to the sea clearly supports the dynamic interpretation I’ve given in terms of EPD. Water flowing downhill is basically a dynamical system. Since water can’t flow down further than sea-level, the sea represents the absolute zero of cooperation that is the only stable state in EPD.
We don’t need to go into the three bad reasons here, except to note that ‘bad’ here means ‘temporary’. We may temporarily have excess resources, as when a whale carcass falls to the bottom of the sea, providing a bounty of food to the population (note the biological example again). But eventually the whale is consumed and the “Malthusian death-trap” is restored (note that this phrase is used as if it sums up what Moloch is).
This argument is important because it means when critics of MoM point to the abundance of cooperation and altruism in the world (e.g Meditations on Mot and Slaying Alexander’s Moloch) they haven’t really succeeded in refuting MoM unless they also show that it isn’t due to living in a dream-time of excess resources. (Side note: are the most convincing examples of cooperation found in modernity? Which coincides with the discovery of how to harness billions of years’ worth of stored energy in the form of fossil fuels?)
It’s also true that the idea that cooperation can increase - the river flowing uphill - means we’re not exactly in an EPD world, since in EPD cooperation steadily decreases. But the whole point of the whalefall argument is that the overall arc of cosmic history might still approximate EPD even if there are complex stochastic variations along the way that make it look like we’re on a different trajectory.
Section IV brings in technology as a key reason for thinking these variations are temporary. Technology enables the population explosion that restores the Malthusian death-trap after whalefall. Advanced AI in particular can get us to one of two nightmare scenarios: the paperclip maximiser, and Robin Hanson’s Age of Em - emulated humans running on super fast hardware.
And both of these correspond to the 100% defect stable state of EPD. One specific metric (the fitness-maximising ‘payoff’) is optimised at the expense of all other values - in the one case it’s paperclips, in the other, economic efficiency.
The EPD model of Moloch seems particularly compelling when MoM reminds us that the paperclip maximizer or economy of Ems couldn’t allow just a small part of the universe to be a nature reserve for beings with other values - and in doing so lists precisely the ‘core’ examples of multipolar traps I highlighted earlier:
Remember: Moloch can’t agree even to this 99.99999% victory. Rats racing to populate an island don’t leave a little aside as a preserve where the few rats who live there can live happy lives producing artwork. Cancer cells don’t agree to leave the lungs alone because they realize it’s important for the body to get oxygen. Competition and optimization are blind idiotic processes and they fully intend to deny us even one lousy galaxy.
Sections V-VIII is where things get more mythological. Moloch turns out to be a new name for some other Gods: Nick Land’s 'Gnon’ and Lovecraft’s Outer Gods, especially Azathoth and Cthulhu. MoM starts using phrases like “Cthulhu, Gnon, Moloch, call them what you will” and “Moloch-aka-the-Outer-Gods”.
Gnon is Land’s garbled acronym for ‘Nature and Nature’s God’ and the Four Horsemen of Gnon are said to be capitalism, war, evolution and memetics. MoM calls these “the same processes I talked about earlier” - further evidence for the ‘core example’ interpretation.
The phrase ‘Nature and Nature’s God’ resonates with the Enlightenment deism that created the idea of God as clockmaker, setting nature’s Newtonian laws into action and then leaving them to it - as well as with Spinoza’s phrase ‘God or nature’ where laws of nature basically are God.
The identification with Gnon therefore supports the idea that Moloch is essentially a dynamical system with fixed laws - of which Newton’s clockwork universe is the archetype - rather than a static coordination problem.
And it also supports the idea that the nature of that dynamical system involves Darwinian evolutionary processes - two of the four horsemen are evolution and memetics, and the other two (war and capitalism) are key arenas where these processes play out in human history. So Moloch-aka-Gnon is very much the kind of thing you’d use evolutionary game theory to model.
If there’s a difference between Moloch and Gnon, it’s the fact that Nick Land’s followers think following the laws of Gnon is a good idea. And the thrust of these sections of MoM is that, no, Gnon is not good, actually, because “in the long run, we’re all dead and our civilization has been destroyed by unspeakable alien monsters”.
This is why MoM brings in Lovecraft’s more pessimistic mythemes. Cthulhu is the God whose cultists free him from his watery grave - and he then gobbles them up. And Azathoth is the blind idiot god that Eliezer had already said was the God of evolutionary processes that don’t necessarily end well. In his new guise as Moloch he becomes the God of evolutionary processes that necessarily don’t end well.
So these last sections of MoM - no less than the other sections - strongly support interpreting Moloch as not just a toolbox of collective action problems, but as an evolutionary dynamical system that ends in total annihilation - of which the simplest model that captures the connection to collective action problems is EPD.
I’ll end with some reflections on how EPD can explain why it’s useful to talk about Gods at all here.
Richard Ngo’s Meditations on Mot presents maybe the clearest argument that anthropomorphising coordination failures as Moloch is not useful. He gives the example of Mot - another Canaanite God supposedly representing the lack of technology in our lives.
Whenever a patient lies suffering from a disease that we haven’t cured yet, that’s Mot’s hand at work. Whenever a child grows up in poverty, that’s because of Mot too. We could have flying cars, and space elevators, and so much more, if it weren’t for Mot.
And this is supposed to be a kind of reductio ad absurdum of the strategy of anthopomorphising abstract problems as Gods. There’s no ‘unified force’ holding back the progress of technology: absence of advanced technology is just the default state. And there’s a similar situation with coordination failures: their absence is just the default state, and it takes specific achievements in respect of institution design, governance and political and moral thinking to overcome them, just as technology requires specific scientific and engineering breakthroughs.
Now it’s worth noting that despite these arguments, Moloch is still said to be useful ‘as a rallying cry’.
We can’t be cold and rational all the time—we need emotionally salient motivations to get us fired up.
And this is indeed one standard reason given for having rationalist Gods - it’s a cognitive hack to repurpose our tribal instincts and the way that our brains are wired to attribute agency towards the goal of improving our ability to thrive as a species.
But is Ngo right that “we should get rid of Moloch as a causal node in our ontologies: as a reason why the world is one way, rather than another”?
‘Meditations on Mot’ relies heavily on the standard interpretation of Moloch as a wide range of collective action problems. But if we interpret Moloch through the lens of EPD, as I’ve been suggesting, it’s a lot easier to see the ‘unifying force’ at work.
For one thing, EPD is a dynamical system: just like, say, a swinging pendulum, the replicator equation is a simple function that tells us the time-dependence of a variable, namely the proportion of cooperators in the population.
And one could argue that the mathematical decrease of that variable to zero over time is no less literally the effect of a force than, say, the effects of gravity or the electroweak force, both of which bottom out in time-dependent differential equations when you really drill down.
But irrespective of this, Moloch-aka-EPD is clearly a unifying model with the potential to explain the macroscopic features of our civilizational and even cosmic trajectory - at least as a first approximation, which is the way mathematical models generally work in science.
And it’s the macroscopic aspect here - the fact that we’re not just talking about specific collective action problems here or there, but evolutionary dynamics that apply universally, that makes it especially natural to talk of Gods.
There’s a long tradition - from the Stoics, to Thomas Aquinas to more recent writers like Paul Davies - who see the ‘Laws of Nature’ as evidence of a divine, mind-like principle behind them. Even just the use of the word ‘law’ here suggests the idea of a cosmic lawgiver.
Now I’m not at all saying this is a logical inference. My point is that the move from scattered coordination problems to EPD and the dynamics of evolution takes you into the territory of ‘laws of nature’. And so if you want a mythology - whether as a cognitive hack, motivational tool or whatever - then EPD is a great candidate for deification.
My main argument has been that EPD is a good model of what Meditations on Moloch is all about - and might help explain what you found interesting and persuasive about it.
MoM is not just classical game theory with mythical window-dressing: there's a conceptual innovation here that parallels the move from classical game theory to evolutionary game theory.
This raises some further questions, which I aim to explore in future posts. What does this evolutionary reframing imply for the practical challenge of 'defeating Moloch'? More broadly, is this move from classical to evolutionary game theory helpful in understanding major global challenges and existential risks? And aside from EPD, are there other models in evolutionary game theory we should be exploring here?
For example: Liv Boeree and this Lesswrong sequence describe Moloch as “the God of negative-sum games”. There’s a similar interpretation in criticisms of MoM such as Meditations on Mot, which calls Moloch the “God of coordination failures”. The Moloch’s toolbox section of Eliezer’s Inadequate Equilibria book isn’t particularly trying to be an interpretation of MoM, but still bottoms out in fairly standard economic concepts - games with multiple nash equilibria, two-sided markets. Possibly the most comprehensive interpretation of Moloch in my view is the chapter on Collective Action Problems in Dan Hendrycks’ AI safety book, which also implies that Moloch is the God of collective action problems, understood as including a wider range of standard economic models from prisoner’s dilemmas, to arms races.