Fighting Obvious Nonsense About AI Diffusion

[-]Richard_Ngo7mo*3620

Our government is determined to lose the AI race in the name of winning the AI race.
The least we can do, if prioritizing winning the race, is to try and actually win it.

This is a bizarre pair of claims to make. But I think it illustrates a surprisingly common mistake from the AI safety community, which I call "jumping down the slippery slope". More on this in a forthcoming blog post, but the key idea is that when you look at a situation from a high level of abstraction, it often seems like sliding down a slippery slope towards a bad equilibrium is inevitable. From that perspective, the sort of people who think in terms of high-level abstractions feel almost offended when people don't slide down that slope. On a psychological level, the short-term benefit of "I get to tell them that my analysis is more correct than theirs" outweighs the long-term benefit of "people aren't sliding down the slippery slope".

One situation where I sometimes get this feeling is when a shopkeeper charges less than the market rate, because they want to be kind to their customers. This is typically a redistribution of money from a wealthier person to less wealthy people; and either way it's a virtuous thing to do. But I sometimes actually get annoyed at them, and itch to smugly say "listen, you dumbass, you just don't understand economics". It's like a part of me thinks of reaching the equilibrium as a goal in itself, whether or not we actually like the equilibrium.

This is obviously a much worse thing to do in AI safety. Relevant examples include Situational Awareness and safety-motivated capability evaluations (e.g. "building great capabilities evals is a thing the labs should obviously do, so our work on it isn't harmful"). It feels like Zvi is doing this here too. Why is trying to actually win it the least we can do? Isn't this exactly the opposite of what would promote crucial international cooperation on AI? Is it really so annoying when your opponents are shooting themselves in the foot that it's worth advocating for them to stop doing that?

It kinda feels like the old joke:

On a beautiful Sunday afternoon in the midst of the French Revolution the revolting citizens led a priest, a drunkard and an engineer to the guillotine. They ask the priest if he wants to face up or down when he meets his fate. The priest says he would like to face up so he will be looking towards heaven when he dies. They raise the blade of the guillotine and release it. It comes speeding down and suddenly stops just inches from his neck. The authorities take this as divine intervention and release the priest.
The drunkard comes to the guillotine next. He also decides to die face up, hoping that he will be as fortunate as the priest. They raise the blade of the guillotine and release it. It comes speeding down and suddenly stops just inches from his neck. Again, the authorities take this as a sign of divine intervention, and they release the drunkard as well.
Next is the engineer. He, too, decides to die facing up. As they slowly raise the blade of the guillotine, the engineer suddenly says, "Hey, I see what your problem is ..."

[-]orthonormal6mo140

Zvi is arguing "X implies Y" here. Zvi happens to believe Y but disbelieve X; however, he is writing to people who think "X and not-Y", in order to nudge them to support Y.

Here X = it is good for the US to build superintelligence fast, before China does, and Y = we should have some diffusion rules making it harder for China to catch up to the USA.

Zvi believes Z = nobody should be building superintelligence soon, and believes Z implies Y, but it is useful to show that X implies Y as well.

[-]Thane Ruthenis7mo104

Hot (?) take, the USG shooting itself in the foot as it pertains to AI is good actually and we should not be risking interrupting them.

Like, okay, there are different ways the USG could shoot itself in the foot:

Something that doesn't slow the US down but speeds up others. (E. g., OpenAI open-sourcing its AI.)
Something that slows down the US but speeds up others (warning away from using Huawai chips, selling chips off, discouraging immigration, misplacing data centers).
Something that just slow the US down (screwing up nuclear-energy buildout).
Something that slows everyone down.

(1) is obviously bad. But (3) and (4) are great^[1].

And (2) is also potentially good: because the others would use the resources worse or won't use them for AGI acceleration at all.

Like, take China. Its mindset is famously of a "fast follower", with dicey attempts at innovation being internally unpopular^[2]; and the Chinese AI researchers probably know as much about the world-dominantion fantasies motivating American CEOs as they do about the AGI doom (i. e., barely anything, reportedly). So there's neither willingness nor motivation to race to AGI there. ... Unless the US AGI labs succeed at manufacturing that race. Which, come to think of it, they would stop trying to do if they start believing they'd lose it.

So the US AGI labs losing beneficial access to the raw resources that could be converted into AI progress (chips, energy, talent) is good in my books.

There's a potential argument here that the US AGI companies would be better to have ahead because they're more likely to get AGI right due to being more safety-conscious, or offer us more opportunities to reform them into being properly safety-conscious. I don't think much of that argument. I would rather have e. g. 3 more years until AGI than bet on fringe possibilities like those.

There's also an argument that keeping the raw resources in Western hands would make it easier to ban AGI research (by controlling supply chains and/or negotiating an international ban with China) if we do manage to wake the USG up to the omnicide risk. This is a more solid argument... But still something I'd trade away for timelines a few years longer.

^{^}
As they pertain to slowing down AI progress, I mean. Obviously they can be parts of overall-terrible-for-the-world policies like tariffs or restricting immigration.
^{^}
DeepSeek is not evidence against this vision, but rather, its confirmation: they did not innovate, only reverse-engineered and optimized.

[-]sanxiyn7mo54

I disagree on DeekSeek and innovation. Yes R1 is obviously a reaction to o1, but its MoE model is pretty innovative, and it is Llama 4 that obviously copied DeepSeek. But yes I agree innovation is unpopular in China. But from interviews of DeepSeek founder Liang Wenfeng, we know DeepSeek was explicitly an attempt to overcome China's unwillingness to innovate.

[-]Vladimir_Nesov7mo50

it is Llama 4 that obviously copied DeepSeek

DeepSeek-V3's MoE architecture is unusual in having high granularity, 8 active experts rather than the usual 1-2. Llama 4 Maverick doesn't do that^[1]. The closest thing is the recent Qwen3-235B-A22B, which also has 8 active experts.

From the release blog post:

As an example, Llama 4 Maverick models have 17B active parameters and 400B total parameters. ... MoE layers use 128 routed experts and a shared expert. Each token is sent to the shared expert and also to one of the 128 routed experts.

↩︎

[-]Thane Ruthenis6mo2-2

its MoE model is pretty innovative

I would roughly punt it into the category of "optimization", not "innovation". "Innovation" is something like transformers, instruct-training, or RL-on-CoTs. MoE scaling is an incremental-ish improvement.

Or, to put it in other words: it's an innovation in the field of compute-optimal algorithms/machine learning. It's not an AI innovation.

But from interviews of DeepSeek founder Liang Wenfeng, we know DeepSeek was explicitly an attempt to overcome China's unwillingness to innovate

Yes, and we're yet to see them succeed. And with the CCP having apparently turned its sights on them, that attempt may be thoroughly murdered already.

[-]StanislavKrym7mo11

Umm... I have already warned that inner troubles of the American administration might cost OpenBrain lots of compute, which can influence the AI race.

I have also made a comment where I tried to show that the US leadership would be undermined by the Taiwan invasion unless the US domestic chip production dominates the Chinese one. It would be especially terrifying to discover that both OpenBrain and DeepCent have similar amounts of compute while neither side^[1] can increase these amounts faster than the other (and I did imply something similar in the comment I made!), since neither the US nor China can slow down without an international deal. And a hypothetical decay of the US makes the matters worse for the US.

Moreover, I have mentioned the possibility that the US administration realises that they have a misaligned AI, but without the AI-driven transformation of the economy the US will be unable to produce more chips and/or energy, leaving China with leadership. Then the US could be forced to let the AI transform the economy. Or threaten to unleash the misaligned AI unless China somehow surrenders its potential leadership...

Could you ask the AI-2027 team to reconsider the compute forecast and estimate the influence of the revised compute and power of the AIs on the other aspects of the scenario?

^{^}
The slowdown ending of AI-2027.com had OpenBrain receive compute by merging with its rivals. The collapsed section about the Indo-Pakistan nuclear war (which would be equivalent to the Taiwan invasion in 2025) in my comment describes a situation where OpenBrain and its former rivals have done a number of computations similar to DeepCent and its former rivals.

[-]ErickBall7mo11

To not only fail to robustly support and bring down regulatory and permitting barriers to the nuclear power we urgently need to support our data centers

Nuclear power for data centers is nice in theory, but if you have timelines ~8 years or less for superintelligence, then for practical purposes basically none of the relevant data centers will be powered by new nuclear plants in the US regardless. Of course if they get rid of enough government support they might force some of the old ones to shut down, but that seems unlikely.

LESSWRONG
LW

LESSWRONG
LW

41

Fighting Obvious Nonsense About AI Diffusion

41

41

Table of Contents

Some of What Is Being Incorrectly Claimed

Response to Eric Schmidt

China and the AI Missile Gap

To Preserve Your Tech Edge You Should Give Away Your Tech Edge

To Preserve Your Compute Edge You Should Sell Off Your Compute

Shouting From The Rooftops: The Central Points to Know

The Longer Explanations

The Least We Can Do