Why Civilizations Are Unstable (And What This Means for AI Alignment)

[-]Jesper L.2mo*50

You should engage with Ray Dalio's theories. He spent his career and life successfully mapping rise and fall of civs. This is the main feedback.

This is ambitious work, but too ambitious to critique seriously as is. As this is a distilled version, it needs crystal-clear logic in the arguments you chose to highlight. In general, there may good ideas here but you take too much for granted.

Some pushback on your ideas:

On your 4 horsemen, I disagree to various degree with all of them. Here are some points to get started.

What about internal threats and internal pressure?
I agree most in spirit here. But overall we do se offspring increase with abundance. If you look at population level growth over time and not fecundity. Keep in mind Earth has limited resources, but what if you can keep scaling? I question the principle as is.
This is clever and familiar from social sciences, but still seems to be poorly argued, at least as a stand-alone idea you can generalize beyond notable examples.
Innovation and critical inquiry often thrives under pressure. You also have to argue why you think myth building is the main foundation of civs compared to say epistemic coherency. Coherence is actually important to coordination whether the epistemology is sound or not.
Good effort making your point, but administration easily grows complex and you cannot just circumvent this when you scale. Iron law of oligarchy is a thing you can look up and it applies to successful and failing businesses alike. Trust me, I know... You don't need a "successful" state to have a complex administration that stagnates.
(Also, the thermodynamic drift invocation bothers me a bit but I get the point, I'll try to be less grumpy.)

On AI failure modes, I'd like to know why you focused on these in particular.

[-]Elias_Kunnas2mo20

This is helpful pushback. You're right that the distillation takes too much for granted. Compressing the 100k+ word framework into ~1000 words lost the load-bearing bits.

On Dalio: agreed, and I should engage his empirical work more explicitly.
On the Horsemen: I think we may be agreeing more than it appears (e.g., on bureaucratic complexity being inevitable), but the post failed to show that.

I failed to calibrate entirely to the audience here, being inside my work for too long. I'll reconsider my approach.

Regarding these AI failure modes, they emerge systematically as violations of one or more of the Four Virtues (Integrity, Fecundity, Harmony, Synergy), which are themselves derived as the optimal solutions to the Four Axiomatic Dilemmas of SORT axes. This was intended as evidence that the framework is something real and useful.

[-]Jesper L.2mo30

Glad you found the feedback somewhat useful.

Yes, LW is a tough crowd. It was so 20 years ago and it is so today. I am not a good rep. of LW culture, but I do think no matter where you post this, that it would be useful to have an 8K summary as well.

I suspect that it is inevitable to lose load-bearing stuff and to also confuse parts of the LW audience in 1K words, but you need the hook to attract readers to the 8K summary.

[-]Gavin Runeblade2mo40

" Until now, civilizational decay has been illegible—patterns without coordinates, dynamics without measurement. "

There are several in depth works on this. You have been pointed at Ray Dalio, he approaches the cycle from a primarily economic angle, but with a great grasp of history. I second the recommendation.

Peter Turchin coined the term Cliodynamics for the school of research focused on dynamic systems approach to history, macrosociology and cycles. The field has its own peer reviewed journal ( https://escholarship.org/uc/Cliodynamics ) Many other scholars operate in the field, but his most recent book in the field is End Times Elites, Counter-Elites and the Path of Political Disintegration.

Neil Howe in his book The Fourth Turning discusses generational theory applied to a self-referral process of events and archetypes that drive a recurring cycle. He primarily focuses on America but not exclusively. He has ongoing research and publications as well.

And there are more, but these are some prominent theories already operating in the space.

[-]Elias_Kunnas2mo10

You're right, I overstated and compressed/simplified too much with that sentence. Dalio isn't listed in the influences section of the full work explicitly but Turchin is.

The more precise claim: we have maps, but we lack the underlying physics from which those maps can be derived. What's been missing is a substrate-independent generative model that explains why these patterns recur across different substrates and civilizations. I think this is neglected and needed to make it more legible and thereby eventually engineer the dynamics.

These models are not wrong. The Aliveness framework attempts to provide a deeper, shared set of generative principles (the Four Axiomatic Dilemmas) from which these different, domain-specific patterns can be derived.

[-]Alexander Müller2mo30

A wonderful example of embodying the virtue of scholarship. Props! I truly hope you get the adversarial critique and collaborative refinement you are asking for.

AI Failure	IFHS Violation	Mechanism
Deceptive alignment	Integrity	Mesa-optimizer develops fake alignment (mythos) vs. true goals (gnosis)
Wireheading	Fecundity	Preserves reward signal, destroys growth substrate
Paperclip maximizer	Harmony	Pure design optimization eliminates all emergence (including humans)
Molochian races	Synergy	Pure individual optimization, zero cooperation

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

10

Why Civilizations Are Unstable (And What This Means for AI Alignment)

10

10

The Diagnostic: Coherence and the Iron Law

The Coordinates: SORT

The Trap: Why Success Causes Failure

1. Victory Trap

2. Biological Decay

3. Metaphysical Decay

4. Structural Decay

The Solution: IFHS (And Why This Is AI Alignment)

Scale Invariance: Cells to Civilizations to AIs

What This Enables

What Makes This Different

Why This Matters for Alignment

On Methodology

The Invitation