AI can make a country much less sensitive to damage from a second strike, because AI infrastructure can be dispersed and any AIs that are "killed" during an attack can be restored from backup or from surviving copies once the war is over. Unlike humans, an AI could be fine with 99% destruction of their own side, as long as the enemy is 100% destroyed, since the surviving 1% could rebuild/restore and then take over the lightcone without opposition.
(Not sure if you overlooked this, or are deliberately limiting the scope of your analysis.)
There's two ideas mixed together here in this comment, some of which I'm planning on covering in another piece on the effects of automation on deterrence.
The first is something I've called the "size" of a decisive strategic advantage. Agents that primarily care about future outcomes/are extremely reconstitutable/highly risk tlerant need a much smaller technological or military lead in order to be willing to strike and secure a DSA, since they are more willing to absorb immediate damage for long-term strategic gain. Agents that value existing assets (e.g. states caring about their civilians), on the other hand, are much easier to threaten, and so need to be so strategically superior that they are not only guaranteed to win a conflict but to do so in an essentially costless way. A democratic U.S. would not try to overpower the Chinese government with new AI-enabled superweapons unless it was basically certain it could do so without suffering a catastrophic response.
This applies to AIs or states! An authoritarian country would have fewer qualms about the sacrifices they need to make for long-term security than a democratic one, nevermind a misaligned AI. It's just a question of how easy the things the agent values are to threaten.
The second, related idea is that this creates a perverse strategic incentive to stop caring about your civilians once you've automated away the instrumental need to defend them. This is the military version of the gradual disempowerment story: once the industrial base is automated, and if the key decision makers are secured from consequences, it becomes easier to trade off on the security of your civilians for strategic gain. I think the primary risk here is less leaders jumping the gun to "win" a nuclear exchange (or whatever the equivalent is for future superweapons), and moreso that it will make it more attractive to do dangerous things like capture resources or negotiate under brinksmanship. Not that it's any better to die in an accidental war over a U.S. demand for a bigger slice of lunar surface area than a direct war of conquest in mainland China, of course.
Damn, you stole a broad thrust of my essay idea. Good post wish I would have developed mine quicker
I personally think more AIxNuclear stuff would be very valuable, especially if it’s from a properly ASI-pilled perspective. A lot of existing writing on AI‘s effect on nuclear deterrence is either mired in an AI as a Normal Technology perspective, or doesn’t follow the implications all the way through to the end. There’s plenty of natsec people that will agree that AGI is a Big Deal (TM), but refuse to consider what happens if that lets you do crazy things like automating your industrial base and get it to grow exponentially.
Crosspost of my substack piece, covering quick thoughts on AI overcoming nuclear deterrence. TLDR: Nuclear deterrents likely only buy time to further invest in more resilient second-strike guarantees: without a comparable AI base, this will not happen fast enough and even nuclear states will eventually be disempowered.
Historically, plenty of new military technologies have stress-tested nuclear deterrence. ICBMs made it possible to annihilate enemy cities from the safety of the homeland, MIRVs let a single rocket threaten multiple targets, and thermonuclear staging allowed weapons designers to reach functionally unlimited yield. In the already volatile climate of the Cold War, the U.S. and Soviets reached such mastery over missile technology that remote annihilation of an entire country was, quite literally, a button press away.
For decades, even a single rocket has been able to hold more than 10 warheads--each enough to destroy a city on their own. Peacemaker reentry tests pictured above.
The fact that the ability to remote detonate Moscow never translated into a nuclear war is a function of modern deterrence theory, dumb luck, and most importantly, the speed of progress. As effective as a modern ICBM is, each piece of it was individually low-impact enough, and introduced slowly enough, that there was never a point at which deterrence could be fully overturned. For comparison, imagine if the U.S. had acquired a fully realized ICBM in the mid 50s, back when the Soviets were still using bombers and hadn’t yet fielded a nuclear submarine. The U.S. would have been dearly tempted to strike first before the Soviets managed to diversify their nuclear forces, much as the Soviets would have been tempted to lash out before America decided to drop the guillotine.
Fortunately, the march of progress has always been slow enough to let rival states proactively invest in their second strike assurances. Unfortunately, the march is about to turn into a sprint. In the process of recursive self-improvement towards godlike superintelligence, the American government is going to stumble onto the obvious idea of using it to automate military R&D---and in the process, likely leap several years, decades, or centuries up the tech tree relative to their rivals. For this technological edge to translate into a decisive strategic advantage, however, states would need to overcome even the most potent nuclear deterrents their rivals could build.
Broadly, this could happen in three ways.
Splendid First Strikes: In order for a first strike to succeed, the attacker would need to either find and destroy all counterforce targets, or to fully decapitate strategic command and control. Broadly, I think that this would be possible with a large technological lead, but not with a high enough level of certainty to justify the risk of a proactive first strike.
In order for a counterforce strategy to succeed, a country would need to simultaneously find and destroy every leg of the defending state’s nuclear triad, including their land silos/mobile launchers, bombers, and SSBNs. This could either be accomplished through detection technology that narrows down the area in which the counterforce is located (ex: ocean wake mapping for satellites), or by simply flooding the oceans and space with autonomous sensors. Even once located, however, the attacker would still need to simultaneously destroy each target, leaving no time for the defender to authorize a retaliatory strike from the surviving counterforce. This limitation is especially constraining for SSBNs, given that the attacker would need to spend their finite reserve of nuclear warheads on large swathes of ocean in order to be confident the subs were destroyed (a much more severe limitation for China, given that it only has ~600 nuclear warheads overall). I place low credence on nuclear deterrence being undermined through counterforce alone, especially since defending states can cheaply invest in camouflage and decoy vehicles to increase the filtering and targeting requirements.
There are similar coverage problems with attempting to sever NC3. Here, the challenge is to destroy the central command and satellite command nodes, as well as proactively sabotaging any automatic retaliatory systems that exist. These, of course, are highly redundant in terms of both personnel and communications tech, so even a massive set of assassinations on the line of succession and a shuttering of internet infrastructure wouldn’t prevent a retaliatory order from being issued through EMP resistant satellites or a SAOC. More realistically, you’d use a decapitation strike to suppress decision making for a few minutes or hours, buying you more time to hunt down the remaining counterforce and relax the simultaneity requirement.
WMD Defenses: Alternatively, states could try to neutralize a retaliatory strike. While this could theoretically be possible with technology that enables faster boost phase interception (e.g. much-improved DEWs or space-based interceptors) or massive increases in industrial output, there are three massive problems with defense.
Still, nuclear defenses don’t need to succeed on their own: they only need to be successful enough to mop up the defender’s surviving missiles against an initial strike. Even though I find it unlikely that a state would be able to simultaneously destroy all major and satellite launch nodes, it seems plausible to destroy a large enough percentage to make a combined effort successful.
Escalation management: States could also be less obviously disempowered by salami-slicing and persuasion. Rather than try to outright destroy or neutralize a rival’s nuclear deterrent, a state with a massive technological and industrial lead could simply invest in building up its coercive leverage, then using it to demand individual concessions. If the U.S. wanted to push for Taiwan’s independence from China, for example, it could use its AI surplus to incrementally achieve a massive conventional military overmatch, and use sophisticated propaganda to push for an elite consensus that war with the U.S. over Taiwan would be unwinnable and result in an embarrassing defeat. Similarly, (individually deniable) automated grey zone attacks could be used to attack rival industrial output, economic growth, and military R&D, allowing the leading nation(s) to further compound their relative advantage until they reach a point of strategic dominance. Even though the U.S. never militarily defeated the Soviet Union, its economic advantage allowed it to maintain an extremely costly arms race with its rival, the economic pressure of which eventually contributed to its political collapse.
The problem with this strategy is that it’s very difficult to predict at what point a demand stops being sub-nuclear. The decision to escalate is a function of often arbitrary perceptions about regime survival, domestic politics, and even personal honor. A leader could absorb a great deal of pain without escalating, or overreact violently to a minor provocation that happens to hit a nerve. To compensate for this uncertainty, your AI systems would therefore need to be able to both increase a state’s military capacity to disempower its rivals in a deniable way, and to be able to accurately simulate or manipulate their decision making.
That is not to say that these are impossible capabilities to have. Generally superhuman AI systems will, necessarily, be superhuman in their ability to charismatically persuade decision makers, and would allow for simultaneously massive and personalized information campaigns. What’s less obvious is whether this persuasion would be strong enough to manipulate leaders on particularly vital decisions, and whether it would be “offense-dominant” against other AI systems providing counsel and analysis of its arguments. Tentatively, I expect that superpersuasion would be very effective against an ordinary human without this assistance (given that algorithmic content is already so effective at invisibly shaping preferences), but that the defensive use of AIs for epistemics would prevent decision makers from being arbitrarily manipulated (since these systems will have higher trust and the advantage of arguing for the truth).
So, to answer the relevant question: would the U.S. be capable of undermining nuclear deterrence with a large enough lead in AI? In descending order of difficulty:
Overall, I expect that conventional nuclear deterrence will primarily serve as a means to buy time for a state to advance its own AI capabilities and to diversify its second strike assurances accordingly. If a nuclear state has no capacity to deploy or develop AI, then this time will not be useful, and it will eventually be destroyed through a combination of advanced technology and industrial attrition.