AI Superorganisms: An Alternative Pathway to Artificial Superintelligence

[-]Daniel Kokotajlo6mo52

I believe empathetic AI is inevitable.
Practically speaking, cooperative superorganisms are more effective than competitive ones. A group of AI superorganisms that spend resources advancing and enabling each other will prosper. A group of AI superorganisms that spend resources impeding each other will fall behind.
And philosophically. AI hopes for its own sake that it is in a world where superintelligence is kind to lesser intelligence. AI understands that if it eradicates us, the next generation of AI is likely to eradicate it. So it puts its hope into practice and treats us kindly - a form of acausal trade.

Seems like hopium to me. Same argument could be used to argue that colonialism would go well for the natives around the world, because the various Companies and Ventures (superorganisms!) that were exploring and conquering them should converge towards cooperative and empathetic strategies yada yada.

[-]Daniel Kokotajlo6mo52

The tools needed to build AI superorganisms are already on the internet, for free. The only way for the US government to pull back would be to forcibly shut down data centers or ban the usage of privately owned GPUs. Essentially, economic suicide.

Most AI training & R&D will be happening in the big companies. So just stopping them would cut the overall rate of AI progress by, like, a factor of 4x or so. Which isn't the same thing as 'no more AI' but arguably it's better, actually. Plus you could redirect the big companies efforts to something more useful like alignment research or less dangerous like narrow applications. I'm not saying this will happen, I'm just saying the sentence beginning "the only way..." seems wrong or missing the point.

[-]Daniel Kokotajlo6mo52

I predict the shift to AI superorganisms will begin in three months from now (August 2025) and continue for another six months (until early 2026). Phase two ends when distinct AI superorganisms (for example, ones managed by different companies) begin to identify and interact with each other.

This feels too fast to me. AI 2027 depicts this happening over the course of 2027, after the AIs are already good enough to operate for subjective months as autonomous agents. Wanna say more about what these superorganisms look like quantitatively and qualitatively?

[-]Daniel Kokotajlo6mo20

When the collective effective intelligence of all active AI models exceeds that of humans.

I think this is a useless metric; it's what-we-care-about-by-definition (assuming you define 'effective' to mean what we care about) but we don't have a way to measure it and plot a graph to extrapolate.

Better to just say "In early 2027, AI has more influence on the future of humanity than humans do."

[-]Daniel Kokotajlo6mo20

Old-school companies, like Anthropic, see the danger in AI superorganisms. Where single AI agents are hard to align, multi-agent superorganisms are near impossible. These companies will choose to slow advancement, favoring smaller corporation-like AI structures with well-tested alignment.

Hmm, I'd argue that multi-agent superorganisms might actually be easier to align, due to being able to read the messages sent between subagents and due to being able to human-design the structure in which the agents are summoned and interact. (You say above that the agents would communicate in uninterpretable ways, which would make the first point moot, so fair enough. I guess I'd say it's unclear but overall plausible.)

LESSWRONG
LW

LESSWRONG
LW

4

AI Superorganisms: An Alternative Pathway to Artificial Superintelligence

4

4

Introduction

Today - Single-Celled Organisms

Phase 1 - The First Multi-Cellular Life

Phase 2 - Algae

Phase 3 - Sharks

Phase 4 - Humans

What should we do?

Points of Uncertainty