Discontinuous progress in history: an update

Wow. This is some dope research. I'm blown away. (I've curated this post.)

I have no idea how you guys came up with these areas to explore. "Let us attempt to track the size of ships in the 1800s". "Let us research the temperature at which superconduction worked in the 1900s". This is an impressive number of trends explored and data gathered, and a fascinating set of results for thinking about AI but also for understanding history and the world generally.

Regarding curating the post, I have to mention that very rarely does historical research like this ever get written in such a simple, concise and readable way, so thank you very much for that on top of the research itself.

And above all, I'll remember to look out for discontinuities when interacting with product features, objects that have size, or Isambard Kingdom Brunel.

[-]Daniel Kokotajlo6y230

IIRC we had a bounty out, to crowdsource suggestions for metrics and specific inventions to investigate. We also just thought of a bunch ourselves. In fact one major limitation of our research is that there are probably all sorts of selection biases in what we chose to investigate; it would be great to have a random sample instead of a brainstormed sample, because then we could say much more substantial things about the probability of discontinuities.

IMO Isambard Kingdom Brunel and Elon Musk seem pretty similar. For example, they both have weird names. I'd predict a heightened chance of discontinuities on metrics his companies are working on. Maybe also a heightened chance of something really big being built. :)

[-]NoSignalNoNoise6y20

IMO Isambard Kingdom Brunel and Elon Musk seem pretty similar. For example, they both have weird names. I'd predict a heightened chance of discontinuities on metrics his companies are working on. Maybe also a heightened chance of something really big being built. :)

Are there any major recent discontinuities in the cost, range, or number of electric cars, cost or energy density of batteries, or cost of putting stuff into orbit?

[-]Daniel Kokotajlo6y50

I don't know; I haven't investigated. My guess is that the Falcon 9 is the most impressive of all the things Musk has done so far, but cost per kilogram to orbit is a weird metric because of how corrupt and ossified the industry is (other than SpaceX.) When companies like Boeing, ULA, etc. manage to lobby politicians to throw huge wads of cash at them for inferior products, and then SpaceX comes along and undercuts them with a superior product, who knows what the "real" costs-per-kilogram are? Both SpaceX and its competitors are probably charging substantially more than they need to.

[-]Thomas Kwa6y10

Cost, range, and number of electric cars are rather artificial metrics-- I think measuring the cost, range, and number of cars in total would be much more in the spirit of the post, which haven't seen any surprising departures from trends. As for batteries, a ~15% decrease in cost per doubling in cumulative number of lithium-ion batteries produced combined with roughly exponential growth in the market has meant a smooth trend over the past few years: see this source.

I believe it's also easily checked that there's no significant discontinuity in cost to orbit. Starship could be promising, and Musk's goal is an extremely ambitious $2M/1000t to LEO, but even that is only 30 years or so on the current trend line.

[-]Daniel Kokotajlo5y50

On the contrary, the graph of launch costs you link seems to depict Falcon 9 as a 15-ish-year discontinuity in cost to orbit; I think you are misled by the projection, which is based on hypothetical future systems rather than on extrapolating from actual existing systems.

[-]Thomas Kwa5y50

This seems right, thanks.

[-]bfinn6y10

On the last point, I wonder if there might be a slight bias in general of history remembering people with distinctive names, or indeed other irrelevant characteristics. (Cf Robert Hooke's disputes with other scientists led to centuries of underestimation of his importance.)

[-]bfinn6y*110

Very interesting work indeed. A bunch of different observations:

The invention of the shipping container seems a likely candidate for a discontinuity in shipping speed, shipping costs and global trade. Though economic data on it is patchy, the go-to book on the subject is The Box by Marc Levinson.

Re invention of the telegraph, the book A Farewell to Alms by Gregory Clark accounts (pp. 305-7) via a series of clever inferences how from Roman times to 1800 the speed of long-distance travel of important information, regardless of method, was constant at 1 mph. This increased slightly in the first half of the 19th century, until the discontinuity from the telegraph. The book has lots of other data on historical innovation that may well be useful to you.

Re product features, and this recent LessWrong article (which no doubt you've seen) about the crucial difference between a prototype and a practical invention, I repeat my comment there that a high-quality implementation, e.g. usability & user-friendliness, rather than specific features often seems to be the crucial breakthrough. As shown by Apple: various inventions of theirs - the Apple Mac, iPod, smartphone, iPad - had little innovation as such. Desktop computers, GUIs, mice; digital music players; mobile phones, personal digital assistants; touch screens, tablet computers - these all already existed. But in each case Apple's breakthrough was to take them from being commercial but mediocre implementations, to very good implementations. And only when that happened did mass adoption occur, which is a crucial step in the impact of the invention.

Finally, I should make the obvious remark that though looking at the history of past inventions is very interesting, applying this to the future, particularly to AI, would be an extrapolation, which may or may not be valid at all, particularly if superhuman AGI is a quite different phenomenon from any previous invention (which it may well be).

[-]Roko6y90

It would be interesting to find some more discontinuities that are unrelated to Western Civilization.

For example, what about Zheng He's treasure voyages or the Great Wall of China? Or Mesoamerican civilizations?

Could you systematically contact relevant historians to farm this work out?

https://en.m.wikipedia.org/wiki/Zheng_He

[-]Charlie Steiner6y70

American civilizations are more recent than one might expect - the big iconic pyramids at Chichen Itza date from ~1000 CE. Plus, a lot of the historical data is vague or hard to get at - not just because of spanish missionaries, but also because a lot of the big North American civilizations left monuments in the form of huge earthworks, not stone, and their written records, if they existed, are lost.

There are definitely upheavals and discontinuities in the archaeological record (the one that really comes to mind is the introduction of corn to north america), but I'm not so sure about our ability to reconstruct nontrivial engineering-type metrics.

[-]Richard Korzekwa6y60

I agree that it would be interesting to look at evidence from further in the past or from non-Western progress.

Unfortunately, we found researching progress from before roughly 1700-1800 (and sometimes even later) to often be quite difficult. Most sources are vague, disagree with each other, or have clear signs of unreliability. Even when we have good accounts of what the state of the art was at some particular time, it was difficult to establish a progress trend leading up to it.

You're probably right that professional historians would be good at sorting some of these problems out. Usually when we did contact subject matter experts during the investigation, they could at help us to reality check out findings, but we did not try to get them to actually do work for us.

[-]Roko6y70

You should definitely try harder to connect with academic historians. They justify their existence by telling us they can help us learn from the past - this is a golden opportunity for it!

[-]Aryeh Englander6y70

One thing that jumped out at me when reading this is that you were counting something as a discontinuity (a relative rate of change) by looking at how many years it jumped ahead (an absolute rate of change). This effectively rules out most recent technologies because the rate of technological progress is already quite high, so you'd have a much harder time jumping 100 years ahead of schedule now than you would have in the past.

I would think that a better metric would be to use some measure of general technological progress as a a base (the x-axis) instead of absolute number of years. I strongly suspect that you would find quite a few more discontinuities this way which were otherwise ruled out because they didn't "jump far enough ahead". For example, I suspect that AlexNet would be a discontinuity on this metric.

[-]Raemon4y60Review for 2020 Review

I was surprised that I had misremembered this post significantly. Over the past two years somehow my brain summarized this as "discontinuities barely happen at all, maybe nukes, and even that's questionable." I'm not sure where I got that impression.

Looking back here I am surprised at the number of discontinuities discovered, even if there are weird sampling issues of what trendlines got selected to investigate.

Rereading this, I'm excited by... the sort of sheer amount of details here. I like that there's a bunch of different domains being explored, which helps fill in a mosaic of how the broader world fits together.

It's an interesting question how much any of this should directly bear on AI timeline forecasts. The more recent debates between Eliezer and Paul dig into some differences into how to apply this. Is AI going to be like past technological jumps, or an entirely new one?

I appreciate Katja et all flagging various potential issues with the methodology in the original post, and noting some possible other questions you could research. If I had infinite researchers I'd probably still want those questions explored, but I'm not sure how many of current researchers I'd be excited to delve into those followup questions. I feel like the approach of "investigate past trends" has passed the 80/20 point of informing our AI timelines, and I'd probably prefer those researchers to orient to new questions that illuminate different facets of the AI strategic landscape.

[-]Jsevillamol4y41

I feel like the approach of "investigate past trends" has passed the 80/20 point of informing our AI timelines, and I'd probably prefer those researchers to orient to new questions that illuminate different facets of the AI strategic landscape.

I specialise in researching this topic. My impression is that barely anyone has looked at past technological trends, neither in academia nor in the LW/EA community. I am generally quite excited about more people looking into this space, because it seems neglected and the kind of topic where EA/LW type of people have a significant edge.

[-]romeostevensit6y60

This is incredibly interesting. Very much looking forward to Hanson's commentary.

[-]dspeyer6y50

What were the 38 trends you studied? How did you select them? How confident are you that the other 28 don't have discontinuities that you missed?

[-]teradimich6y40

How about paying attention to discontinuous progress in tasks that are related to DL? It is very easy to track with https://paperswithcode.com/sota . And https://sotabench.com/ is showing diminishing returns.

[-]Yandong Zhang6y40

Such a great article! I thought the AlexNet that led to the recent AI break through could be viewed as a discontinuity too. The background and some statistics result are well summarized in below link.

https://qz.com/1034972/the-data-that-changed-the-direction-of-ai-research-and-possibly-the-world/

[-]Robert Vroman5y20

Is railroad not discontinuous for land speed travel, particularly long distance?

[-]Greg van Paassen6y20

Some questions to ask yourselves. I don't want answers to them, but I think you need solid answers to them for yourselves.

1. What negative discontinuities did you find - situations where a trend was held in check for a long period, before resuming? What insights do you draw from them?

2. Why use the same timescale for all activities?

3. Why the binary threshold for significance?

4. Science is all about finding good questions to ask. I have a feeling that this isn't a very good question, nor is the object of study well defined. Why do you believe that this question/this cabinet of curiosities tells you something useful?

[-]ESRogs6y20

With planes and ICBMs crossing the ocean, there seemed to be a pattern where incremental progress had to pass a threshold on some dimension before incremental progress on a dimension of interest mattered, which gave rise to discontinuity. Is that a common pattern? (Is that a correct way to think about what was going on?)

This reminds me of Clayten Christensen's idea of disruptive innovation, where a new approach to a product may at first only be suitable for niche use cases, but has a higher growth rate of improvement than the traditional approach, and so eventually crosses some threshold and dominates the market.

(Disruptive innovations wouldn't necessarily produce discontinuous progress, but the underlying phenomenon of different rates of progress for different approaches to a problem seems to be the same as what's observed here.)

[-]spqr0a16y10

Interested to see a historical analysis of luminous efficacy. Spans 3 orders of magnitude, similar timeframe to other topics covered, and also like other topics here includes many sequential innovations as opposed to mere iteration on a particular technology.

[-]Basil Marte6y10

https://en.wikipedia.org/wiki/Optical_telegraph preexisted the electric telegraph for speed of information over land. (Although it has some related systems, such as heliographs.)

I'd guess materials science as a field with several discontinuous leaps. Bessemer process, duraluminium, carbon fiber reinforced plastics: I think these are the most famous candidates. (It's hard to put it into metrics, but nearly all non-immobile things were structurally built out of wood until Bessemer/Martin steel came around.)

Rocketry is intimately related to nuclear bombs. The impetus to develop it came from the fact that now a small payload could destroy a city. (In WW2, a V2 occasionally leveled an apartment block or two. This is not a performance that justifies investing in an ICBM.) The early space race was largely a demonstration of this capability, as a rocket capable of accelerating a multiple-ton payload into near-circular orbit required to hit the other side of the earth is necessarily capable of accelerating a few-hundred-kg payload into low earth orbit, and vice versa.

https://en.wikipedia.org/wiki/Duga_radar for power used in an active sensor (mostly searchlights and radars)?