Reflections on "Making the Atomic Bomb"

[-]Daniel Kokotajlo2y70

Do you know, Josef Vassarionovich, what main argument has been advanced against uranium? “It would be too good if the problem could be solved. Nature seldom proves favorable to man.” — Letter from Georgi Flerov to Joseph Stalin, April 1942.

Huh, this reminds me of the current discourse around AGI and R&D acceleration and so forth. A lot of people seem to think something like this, they just aren't willing to say it so bluntly. "No way does scaling up current systems just work, it can't be that easy, more breakthroughs must be needed." "Surely there are all sorts of bottlenecks and things, no way is Dyson Swarm possible in our lifetimes even for a society of superintelligences."

[-]Daniel Kokotajlo2y62

First, it fetishizes intelligence.

What does that mean?

...would you not complain if the quote was "Every eighteen months, the probability that a randomly selected human could, if they wanted to, destroy the world, doubles."

[-]Daniel Kokotajlo2y40

It is much more complex.

I agree.

Worth noting that this is a reason to think that it's more likely for the people building AGI to mess up on alignment/safety/control/etc., due to having less understanding of how it works.

[-]Noosphere892y3-1

I think the most important implication, assuming that this assumption is right below, is that it would be better to regulate use of AI and superintelligence, rather than assuming that we realistically can slow it down significantly or stop it.

This implies that AI governance should focus less on slowdowns and more on how to make civilization thrive even under progress in AI intelligence.

In particular, I think this is going to cause a stir in the AI governance people because one of it's conclusions go against a central belief of AI governance people: That slowing down AI is the best course of action.

AI is like the bomb: the impossible vs. inevitable phenomenon.

Like the case of the bomb, I think building AI systems, including AGI, will become significantly easier with time, and the plot above will also hold for AI. Hardware, engineering, and algorithmic insights will make AIs more capable and cheaper to build and deploy. In particular, I believe that we will discover more efficient ways to train AI systems than the current “brute force” approach of using O(N²) operations to train an N-sized model on O(N) pieces of data; since each data point, on average, contributes O(1) bits of information to the model, it should not require Ω(N) time to process it.

Being the first mover will require significant expenses but might also confer significant advantages. This does mean that probably any type of “pauses” or “delays” are unlikely to make a long-term difference but may create more of an “overhang” effect, where multiple parties (companies/countries) can achieve similar capabilities at about the same time. For example, if the Manhattan Project didn’t exist, the Atomic bomb would have been delayed by several years, but when it was built, it would have likely been done by multiple countries. Whether such a “multipolar” scenario is good or bad is hard to predict in advance. Ultimately what can be built will be built, and I believe regulations and policies can have a minimal impact on the capabilities of AI systems in the long run. But like in the nuclear case, the decisions of first-movers, regulations, and research investments we make today can profoundly impact humanity’s future trajectory. It was not pre-ordained that we would live in a world with more than 3,000 thermonuclear missiles ready to launch at a second's notice. It was also not pre-ordained that 75 years after Hiroshima and Nagasaki, only nine countries would have nuclear weapons, and no such weapon has been used in war since. Technology might be inevitable, but the way we use it isn’t.

[-]boazbarak2y10

I indeed believe that regulation should focus on deployment rather than on training.

[-]Daniel Kokotajlo2y20

People have been destroying large parts of the world since the days of Genghis Khan. The Manhattan Project did indeed require the work of highly intelligent scientists, but they were only one component of the project and were successful not just because of their intelligence.

Yudkowsky is no doubt aware of these facts

[-]Daniel Kokotajlo2y20

apparently any war-making device that can be built will be built, even if its only possible use is the self-destruction of our race.

There are now several important counterexamples to that claim, listed in decreasing order of importance (due to decreasing plausibility that they'd actually be militarily effective)
--Various bioweapons
--Various nuclear weapons even more powerful than the H-bombs we built so far
--Various exotic weapons like the nuclear-powered radiation-spewing cruise missile
--Militarization of space (the project orion space dreadnaught proposal)
--Weaponization of genetic engineering

Are there lessons for AI?

We are currently in another time of great scientific advances that promise to change human society. What are the similarities and differences between the Atomic bomb and Artificial Intelligence?

AI is not the bomb 1: AI is a dual-use technology

The uncontrolled release of vast quantities of energy cannot serve any purpose other than destruction. Thus the atomic bomb is an instrument of death. The more powerful the bomb is, the more dangerous it is. (Nuclear fission generally is, of course, a dual-use technology.) In contrast, AI is a general technology that can be used for many purposes. In fact, it’s hard to think of a goal that AI cannot benefit. This implies a fundamental difference in the risk profile of the bomb and AI. Unlike what is sometimes suggested, AI safety and capabilities are not fundamentally opposed to one another. In some cases, such as self-driving cars, it is clear that making AI systems more capable also means making them safer. In other settings, the relation is more complicated, where AI capabilities improvements help both the “attacker” and “defender,” and it is unclear which side is helped the most:

In cyber warfare, hackers can use AI systems to find vulnerabilities and hack software systems. But, by the same token, software vendors can use AI to patch vulnerabilities in advance before they are discovered and exploited. It is unclear which side is intrinsically more favored (assuming they both have access to AIs of equal strength). One major advantage AI offers for the defender side is coverage and timeliness. Currently, software vendors can spare limited effort toward securing systems, and this effort is often proportional to the amount that the system or individual component is used. Thus often, it is the more esoteric features (e.g., TLS heartbeat extension) that lack attention from developers and can have security vulnerabilities survive for an extended time.

A similar dynamic exists in biowarfare: AI can be used to design bioweapons, but also to prepare in advance vaccines, as well as accelerate vaccine and therapeutic developments for both natural and designed pandemics.

Disinformation is another setting in which AI can help both those who create or spread disinformation as well as those who try to prevent it. AI can make “deep fakes” and can also create highly personalized misleading messages. However, AI can also be used by social media companies to monitor misleading messages, whether on servers or on the client’s device. Hence, AI can detect deepfakes (whether by watermarking or using cryptography to establish a “chain of custody” and authenticity of the media) and warn users of influence attempts. Once again, it is unclear which side will be favored, and regulations that ensure the right set of incentives might play a considerable role.

Eliezer Yudkowsky likes to say that “every eighteen months, the minimum IQ necessary to destroy the world drops by one point.” I believe this quote is wrong on several levels. First, it fetishizes intelligence. People have been destroying large parts of the world since the days of Genghis Khan. The Manhattan Project did indeed require the work of highly intelligent scientists, but they were only one component of the project and were successful not just because of their intelligence. (One passage in Rhodes’ book describes how Oppenheimer was so successful as a lab director also because he was attuned to the emotional needs of the scientists, with one example being him taking the time for weekly meetings with Teller.) As the infamous Oppenheimer-Truman meeting showed (not to mention the actions of Hitler and Stalin), ultimately, it was not the most intelligent people responsible for the most destruction. Second, as I discuss above, computational capabilities have a dual impact, and it is unclear, to say the least, if technological improvements make the world safer or more dangerous.

AI is not the bomb 2: It is much more complex.

One of the striking facts in the book is the contrast between how hard it was to build the atomic bomb and how (reasonably) easy it was to predict it in advance and get order-of-magnitude estimates of its capabilities. Multiple physicists made pen and paper calculations of the critical mass of U235 and the size of a bomb. Before Plutonium was ever created, physicists could roughly predict its potential for fission. In contrast, our ability to predict the future capabilities of AI systems is very restricted. (See this post for my crude attempts at back-of-envelope calculations, sticking with qualitative predictions only.) We can predict near-term advances on particular benchmarks fairly reliably. But extending these predictions further out or trying to map out what these benchmarks will mean for real-world impact is still highly problematic. One worrying trend is that people sometimes replace principled analysis (as we had with fission) with opinion polls of scientists or other dubious calculations, which can give a false impression of quantifiability (“there is X% likelihood of outcome Y”) but are really “cargo cult science.”

AI is like the bomb: the impossible vs. inevitable phenomenon.

Like the case of the bomb, I think building AI systems, including AGI, will become significantly easier with time, and the plot above will also hold for AI. Hardware, engineering, and algorithmic insights will make AIs more capable and cheaper to build and deploy. In particular, I believe that we will discover more efficient ways to train AI systems than the current “brute force” approach of using O(N²) operations to train an N-sized model on O(N) pieces of data; since each data point, on average, contributes O(1) bits of information to the model, it should not require Ω(N) time to process it.

Being the first mover will require significant expenses but might also confer significant advantages. This does mean that probably any type of “pauses” or “delays” are unlikely to make a long-term difference but may create more of an “overhang” effect, where multiple parties (companies/countries) can achieve similar capabilities at about the same time. For example, if the Manhattan Project didn’t exist, the Atomic bomb would have been delayed by several years, but when it was built, it would have likely been done by multiple countries. Whether such a “multipolar” scenario is good or bad is hard to predict in advance. Ultimately what can be built will be built, and I believe regulations and policies can have a minimal impact on the capabilities of AI systems in the long run. But like in the nuclear case, the decisions of first-movers, regulations, and research investments we make today can profoundly impact humanity’s future trajectory. It was not pre-ordained that we would live in a world with more than 3,000 thermonuclear missiles ready to launch at a second's notice. It was also not pre-ordained that 75 years after Hiroshima and Nagasaki, only nine countries would have nuclear weapons, and no such weapon has been used in war since. Technology might be inevitable, but the way we use it isn’t.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

51

Reflections on "Making the Atomic Bomb"

51

51

The level of investment in the Manhattan Project was truly staggering.

Much of this cost was due to the compressed schedule.

The government fully trusted the scientists on the science.

Many scientists independently realized the possibility of an atomic bomb.

The project was secret to the public but public to the US’ adversaries.

How long was the road from theoretical possibility to actually building the bomb.

How terrible of a weapon the bomb is.

How little sense the H bomb makes.

Are there lessons for AI?

AI is not the bomb 1: AI is a dual-use technology

AI is not the bomb 2: It is much more complex.

AI is like the bomb: the impossible vs. inevitable phenomenon.