The Track Record of Futurists Seems ... Fine

There's a lot of room for debate on the correctness of the resolutions of these predictions:

e.g. Heinlein in 1949:

Space travel we will have, not fifty years from now, but much sooner. It's breathing down our necks.

This is marked as incorrect, due to the marker assuming that this meant mass space travel, but I wouldn't interpret this as mass space travel unless there's some relevant context I'm missing here - keep in mind that this was from 1949, 8 years before Sputnik.^[1]

On the other hand:

All aircraft will be controlled by a giant radar net run on a continent-wide basis by a multiple electronic “brain.”

This is marked as correct, apparently due to autopilot and the "USAF Airborne Command Post"? But I would interpret it as active control of the planes by a centralized computer and mark it as incorrect.^[2]

Edited to add: there were a bunch i could have mentioned but want to remark on this one where my interpretation was especially different from the marker's:

Interplanetary travel is waiting at your front door — C.O.D. It’s yours when you pay for it.

This is also from 1949. The marker interprets this as a prediction of "Commercial interplanetary travel". I see it rather as a conditional prediction of interplanetary travel (not necessarily commercial), given the willingness to fund it, i.e. a prediction that the necessary technology would be available but not necessarily that it would be funded. If this is the right interpretation, it seems correct to me. Again, I could be completely wrong depending on the context. ^[3]

^{^}
Edited to add: I realized I actually have a copy of Heinlein's "Expanded Universe" which includes "Where To?" and followup 1965 and 1980 comments. In context, this statement comes right in the middle of a discussion of hospitals for old people on the moon, which considerably shifts the interpretation towards it being intended to refer to mass space travel, though if Heinlein were still here he could argue it literally meant any space travel.
^{^}
In context, it's not 100% clear that he meant a single computer, though I still think so. But he definitely meant full automation outside of emergency or unusual situations; from his 1980 followup: "But that totally automated traffic control system ought to be built. ... all routine (99.9%+ )takeoffs and landings should be made by computer."
^{^}
And now seeing the context, I stand by this interpretation: It's a standalone comment from the original, but Heinlein's 1965 followup includes "and now we are paying for it and the cost is high", confirming that government space travel counted in his view...but, given that he did assert we were paying for it, and interplanetary space travel has not occurred (I interpret the prediction as meaning human space travel), this actually might cut against counting this as a correct prediction.

[-]technicalities3y126

Data collector here. Strongly agree with your general point: most of these entries are extremely far from modern "clairvoyant" (cleanly resolving) forecasting questions.

Space travel. Disagree. In context he means mass space travel. The relevant lead-up is this:

"According to her, the Moon is a great place and she wants us to come visit her."
"Not likely!" his wife answers. "Imagine being shut up in an air - conditioned cave."
"When you are Aunt Jane's age, my honey lamb, and as frail as she is, with a bad heart thrown in, you'll go to the Moon and like it."

Re: footnote 1. He was a dishonest bugger in his old age so I don't doubt he would argue that.

Central piloting. Yep, you're right. We caught this before, but changed it in the wrong branch of the data. Going to make it 'ambiguous'; let me know if that seems wrong.

Commercial interplanetary travel. Disagree - "C.O.D." is an old-timey word meaning something so normal and cheap that you don't even need to pay for your ticket upfront - which implies that "you" is a consumer, not a government. (But again I see what you're saying.)

DM me for your bounty ($10)! I've linked to your comment in the changelog. Thanks!

[-]simon3y30

Central piloting. Yep, you're right. We caught this before, but changed it in the wrong branch of the data. Going to make it 'ambiguous'; let me know if that seems wrong.

I would call it a full miss myself.

I still strongly disagree on the commercial interplanetary travel meaning.

If "Cash on Delivery" has that old-timey meaning, it could push a bit to your interpretation, but not enough IMO.

My reasoning:

Interplanetary travel is waiting at your front door —

Actual interplanetary travel, or say a trip on a spaceship, cannot literally be waiting at your front door. So clearly, a metaphorical meaning is intended.

C.O.D. It’s yours when you pay for it.

Here he extends the metaphor.

But, in your view, that means it's cheap. I disagree, if it was cheap he wouldn't need to say "It's yours when you pay for it". Everything has to be paid for. If he meant it was cheap, he would just stop at C.O.D. and not say "It’s yours when you pay for it."

IMO, the "It's yours when you pay for it" clearly means that he expected it to cost enough that it would be a significant barrier to progress (and the prediction is that it is in effect the only barrier to interplanetary travel). I do suspect though that he did intend the reader to pick up your connotation first, for the shock value, and the "It's yours when you pay for it" is intended to shift the reader to the correct interpretation of what he means by C.O.D, i.e., it's meant to be taken literally within the metaphorical context (and by Gricean implicature a large cost is meant) and not as an additional layer of metaphor.

I suppose the 1965 comments could have been written to retroactively support an interpretation that would make the prediction correct, but I would bet most 1950 readers would have interpreted it as I did.

Also, I note that John C. Wright agrees with my interpretation (in your link to support Heinlein being a "dishonest bugger") (I didn't notice anything in that link about him being a dishonest bugger, though - could you elaborate?). Wright also agrees with me on the central piloting prediction; looking briefly through Wright's comments I didn't see any interpretation of Wright's that I disagreed with (I might quibble with some of Wright's scoring, though probably mostly agree with that too). Unfortunately Wright doesn't comment on whether he thinks Heinlein meant mass space travel as that was a side comment in the lunar retirement discussion and not presented specifically as a separated prediction in Heinlein's original text.

[-]johnswentworth3y200

Contrast this situation with my summary of the different lines of reasoning forecasting transformative AI. The latter includes:
Systematic surveys aggregating opinions from hundreds of AI researchers.
Reports that Open Philanthropy employees spent thousands of hours on, systematically presenting evidence and considering arguments and counterarguments.
A serious attempt to take advantage of the nascent literature on how to make good predictions; e.g., the authors (and I) have generally done calibration training,⁸ and have tried to use the language of probability to be specific about our uncertainty.
There's plenty of room for debate on how much these measures should be expected to improve our foresight, compared to what the "Big Three" were doing.

My guess would be these measures result in predictions somewhat worse than the Big Three. If you want a reference class for "more serious" forecasting, I'd say go look for forecasts by fancy consulting agencies or thinktanks. My guess would be that they do somewhat worse, mainly because their authors are optimizing to Look Respectable rather than just optimizing purely for accuracy. And the AI researcher surveys and OpenPhil reports also sure do look like they're optimizing a significant amount for Looking Respectable.

[-]technicalities3y60

Is the point that 1) AGI specifically is too weird for normal forecasting to work, or 2) that you don't trust judgmental forecasting in general, or 3) that respectability bias swamps the gains from aggregating a heavily selected crowd, spending more time, and debiasing in other ways?

The OpenPhil longtermists' respectability bias seems fairly small to me; their weirder stuff is comparable to Asimov (but not Clarke, who wrote a whole book about cryptids).

And against this, you have to factor in the Big Three's huge bias towards being entertaining instead of accurate (as well as e.g. Heinlein's inability to admit error).

Can you point at examples? (Bio anchors?)

[-]johnswentworth3y7-1

Is the point that 1) AGI specifically is too weird for normal forecasting to work, or 2) that you don't trust judgmental forecasting in general, or 3) that respectability bias swamps the gains from aggregating a heavily selected crowd, spending more time, and debiasing in other ways?

The third: respectability bias easily swamps the gains. (I'm not going to try to argue that case here, just give a couple examples of what such tradeoffs look like.)

This is much more about the style of analysis/reasoning than about the topics; OpenPhil is certainly willing to explore weird topics.

As an example, let's look at the nanotech risk project you linked to. The very first thing in that write-up is:

According to the definition set by the U.S. National Nanotechnology Initiative:
Nanotechnology is...

So right at the very beginning, we're giving an explicit definition. That's almost always an epistemically bad move. It makes the reasoning about "nanotech" seem more legible, but in actual fact the reasoning in the write-up was based on an intuitive notion of "nanotech", not on this supposed definition. If the author actually wanted to rely on this definition, and not drag in intuitions about nanotech which don't follow from the supposed definition, then the obvious thing to do would be to make up a new word - like "flgurgle" - and give "flgurgle" the definition. And then the whole report could talk about risks from flgurgle, and not have to worry about accidentally dragging in unjustified intuitions about "nanotech".

... of course that would be dumb, and not actually result in a good report, because using explicit definitions is usually a bad idea. Explicit definitions just don't match the way the human brain actually uses words.

But a definition does sound very Official and Respectable and Defendable. It's even from an Official Government Source. Starting with a definition is a fine example of making a report more Respectable in a way which makes its epistemics worse.

(The actual thing one should usually do instead of give an explicit definition is say "we're trying to point to a vague cluster of stuff like <list of examples>". And, in fairness, the definition used for nanotech in the report does do that to some extent; it does actually do a decent job avoiding the standard pitfalls of "definitions". But the US National Nanotechnology Initiative's definition is still, presumably, optimized more for academic politics than for accurately conveying the intuitive notion of "nanotech".)

The explanation of "Atomically Precise Manufacturing" two sections later is better, though it's mostly just summarizing Drexler.

Fast forward to the section on "Will it eventually be possible to develop APM?". Most of the space in this section is spent summarizing two reports:

The feasibility of atomically precise manufacturing has been reviewed in a report published by the US National Academy of Sciences (NAS). The NAS report was initiated in response to a Congressional request, and the result was included in the first triennial review of the U.S. National Nanotechnology Initiative. [...]

and

A Royal Society report was dismissive of the feasibility of ‘molecular manufacturing,’ [...]

Ok, so here we have two reports which absolutely scream "academic politics" and are very obviously optimized for Respectability (Congressional request! Triennial review! Institutional acronyms (IA)!) rather than accuracy. Given that the author of the OpenPhil piece went looking for stuff like that, we can make some inferences about the relative prioritization of Respectability and accuracy for the person writing this report.

So that's two examples of Respectability/accuracy tradeoff (definitions and looking for Official Institutional Reports).

[-]Ben Pace3y2023

Correct: "the screen [of a phone] can be used not only to see the people you call but also for studying documents and photographs and reading passages from books." I feel like this would've been an impressive prediction in 2004.

This is an excellent prediction for 1964, and I respect Asimov a great deal for this.

[-]Chris-Lons3y109

Thanks for another thought provoking post. This is quite timely for me, as I've been thinking a lot about the difference between the work of futurists as compared to forecasters.

These are people who thought a lot about science and the future, and made lots of predictions about future technologies - but they're famous for how entertaining their fiction was at the time, not how good their nonfiction predictions look in hindsight. I selected them by vaguely remembering that "the Big Three of science fiction" is a thing people say sometimes, googling it, and going with who came up - no hunting around for lots of sci-fi authors and picking the best or worst.

I think this is a clever way to try to avoid hindsight bias in selecting your futurists, but I think it's at least plausible that only reasonably good futurists could rise to the status of "the Big Three of science fiction". I'm assuming that the status is granted only several decades after the main corpus has been written and that reasonably good predictions (within the fiction) would help enormously in attaining it. On the other hand, imagine writers whose fiction became increasingly ridiculous as the future progressed because they did not make good predictions.^[1] Surely it would be very difficult for such authors to become part of the science fiction elite.

I'm not at all certain of this argument and would like to understand more about how cultural works move the "popular at release" to "classic" status.

At any rate, I think we should be at least moderately concerned that there could still be significant selection bias in the group being analyzed.

^{^}
For example, I would put C.S. Lewis' space trilogy in this category. They were good books and a forceful argument against the worst sorts of consequentialism, but imo, they were not great science fiction. Primarily because the way he imagines space and life on other planets seems completely ridiculous now.

[-]Liam Donovan3y102

Minor curiosity: What was the context behind Asimov predicting in 1990 that permanent space cities would be built within 10 years? It seems like a much wilder leap than any of his other predictions.

[-]technicalities3y51

Good catch! The book is generally written as the history of the world leading up to 2000, and most of its predictions are about that year. But this is clearly an exception and the section offers nothing more precise than "By the year 3000, then, it may well be that Earth will be only a small part of the human realm." I've moved it to the "nonresolved" tab.

DM me for your bounty ($10)! I added your comment to the changelog. Thanks!

[-]Bezzi3y*71

Asimov may not have been a professional forecaster, but he was still someone who had thought a lot about the future in the most realistic way possible (and he got invited quite often on TV to talk about it, if I remember correctly), especially considering that he wrote also a crazy amount of scientific nonfiction. Maybe he's more famous as a science fiction author, but he was also a very well-known futurologist, not just some random smart guy who happened to make some predictions. I would be quite surprised to hear about anyone else from the 60s with a better futurology record than him.

That said, I am still quite convinced that the average smart person would still make terrible predictions about the long-term future. The best example I can offer is this, one of the rare set of illustrations that got printed in 1899 France to imagine what France would look like in the year 2000. Of course, the vast majority of these predictions were comically bad.

It is worth to notice that we mainly know about these postcards because Asimov himself published a book about them in the 80s (this is not a coincidence because nothing is ever a coincidence).

[-]Lalartu23y0-3

I disagree that "France in the Year 2000" predictions were wrong. If judged by function rather than aesthetics they are more than half accurate.

[-]Jiro3y41

Asimov's laser beams for communication deserves to be a 1, assuming that 1 means ambiguous/near miss. Fiber optics are a thing, even if they don't actually use lasers. 3D TV was a thing around 2014 as well, and probably deserves a 1, even if it's not in cubes.

[-]technicalities3y33

From context I think he meant not fibre laser but "free-space optics", a then-hyped application of lasers to replace radio. I get this from him mentioning it in the same sentence as satellites and then comparing lasers to radio: "A continuing advance of communications satellites, and the use of laser beams for communication in place of electric currents and radio waves. A laser beam of visible light is made up of waves that are millions of times shorter than those of radio waves". So I don't think this rises above the background radiation (ha) of Asimov's vagueness.

As for 3D TV, if I expand the context you see it's an explicit replacement for screens: "wall screens will have replaced the ordinary set; but transparent cubes will be making their appearance in which three-dimensional viewing will be possible. In fact, one popular exhibit at the 2014 World's Fair will be such a 3-D TV, built life-size, in which ballet performances will be seen. The cube will slowly revolve for viewing from all angles." Also my understanding is that our 3D TVs don't allow any varying POV, let alone all angles.

Thanks! Added these to the changelog.

[-]simon3y10

"free-space optics"

While it's not our main communications method, infrared communication is a thing, and it's a lot closer to visible than radio.

Also, Elon Musk claims that SpaceX is going to enable laser links for inter-satellite communications between Starlink satellites soon (admittedly, not within the 2020 target year, but this is still pretty close!)

As for 3D TV, if I expand the context you see it's an explicit replacement for screens

My reading of the context is that screens are supposed to be the predominant form, and cube 3d is a prototype. This seems to be a correct prediction: see "crystal cube" here.

[-]Unnamed3y41

I recall hearing a claim that a lot of Kurzweil's predictions for 2009 had come true by 2019, including many that hadn't happened yet in 2009. If true, that supports the picture of Kurzweil as an insightful but overly aggressive futurist. But I don't know how well that claim backed up by the data, or if there even has been a careful look at the data to try to evaluate that claim.

[-]yagudin3y10

See:

[-][anonymous]3y30

I might have missed mention of this somewhere, but I think that some kind of analysis that provides some context on "what did the skeptics at the time say—especially for forecasts that resolved incorrectly vs. correctly" would be quite nice: I think it's potentially helpful to get a model of "(how often/when) were skeptics on the right side of the forecast, and were they accurate for reasons that ended up proving true?" Additionally, some case studies of examples to determine "were they justified for thinking the way they did" while excluding hindsight bias might be difficult, but similarly helpful.

Suppose hypothetically that the findings were something like "When futurists were on the right side of 50% but many of their contemporaries were skeptical at the time, it often was the case that the skepticism was not very engaged/persuasive/grounded (e.g., it was largely based on initial objections to which the futurists provided responses that went unaddressed by the skeptics; making assumptions that were verifiably wrong given available information at the time)." It seems quite improbable that you would get such a neat finding, but if the findings did vaguely resemble this—or if there were at least some not-misrepresentative anecdotes to this effect—then that could be a useful thing to highlight when discussing skepticism towards AGI predictions.

[-][anonymous]3y20

Another long-term forecast evaluation study which I don't think was mentioned (but might have simply missed): "Long-term forecasts of military technologies for a 20–30 year horizon: An empirical assessment of accuracy" ( https://www.sciencedirect.com/science/article/abs/pii/S0040162518304438?via%3Dihub ).

Forecast evaluation is often a messy endeavor, as I learned trying to do research on forecasting for S&T last summer (which is what led me to that article).

[-]HoldenKarnofsky3y31

This is included! It's linked from the second-to-last paragraph.

[-][anonymous]3y10

Incorrect: "transparent cubes will be making their appearance in which three-dimensional viewing will be possible. In fact, one popular exhibit at the 2014 World's Fair will be such a 3-D TV, built life-size, in which ballet performances will be seen. The cube will slowly revolve for viewing from all angles." Doesn't seem ridiculous, but doesn't seem right. Of course, a side point here is that he refers to the 2014 World's Fair, which didn't happen.

Yes, but...2014 was the second year of the second VR craze. The Occulus Rift dev kit 2 was shipping, and it was easily capable of showing a life size 3d ballerina.

As for that kind of cubic 3d display, those have existed for decades. https://en.wikipedia.org/wiki/Spinning_mirror_system / Swept-volume display

So the prediction was wrong, but, we found a better way to give people 3d displays.

[-]RationalActor3y10

Thank you for the interesting post, Holden!

The two key predictions that you make in your “Wild Century” blog series, as I understand it, are:

an AI that will be able to do the process of scientific inquiry (much) better and faster than humans, leading to a productivity explosion, and super-fundamental societal change
digital people/mind-uploading

…this century (plausibly).

These feel much bigger/wilder/more dramatic/more fundamental than the examples given here. This makes me a bit sceptical of how useful the evidence (entertainingly and fascinatingly) assembled in this post is.

[-]Flaglandbase3y10

Aerospace predictions were too optimistic:

Clarke predicted intercontinental hypersonic airliners in the 1970s ("Death and the Senator" 1961) . Heinlein predicted a base on Pluto established in the year 2000. Asimov only predicted suborbital space flights at very low acceleration that casual day tourists would line up to take from New York in the 1990s, but also sentient non-mobile talking robots and non-talking sentient mobile robots by that decade. Robert Forward predicted in the novel Rocheworld (1984) that the first unmanned space probe would return pictures from Barnard's Star in 2022 (though the images wouldn't arrive back on Earth till 2028).

On the flip side:

Clarke predicted in "Childhood's End" that it would take extensive searching through a specialized library (where you had to make an appointment through your university and show up in person) just to identify an astronomical catalog number in the 21st century. It would also take VERY expensive computer time with a worldwide waiting list to analyze the trajectory of a comet-like object in the novel "Rendezvous with Rama". That's because comets follow complex hyperbolic trajectories that require calculus far too difficult for humans to solve with pen and paper.

[-]concernedcitizen643y-42

isaac asimov was a snacc tbh

[-]Ben Pace3y60

I'm amused that we have action in the agree-disagree voting here.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

94

The Track Record of Futurists Seems ... Fine

94

94

The track records of the "Big Three"

Quick summary of how Arb created the data set

The numbers

Overall picture

Today's futurism vs. these predictions

Appendix: other studies of the track record of futurism

Footnotes