AI 2027 Response Followup

[-]the gears to ascension2mo60

Strong upvote, even though I think you're wrong about some important claims here, because you're being detailed enough for me to reply to.

... which I will do properly (ie, with citations into your post) tomorrow, if it still seems useful to be more specific. But the gist of what I'll defend better if needed is: I think while it's quite possible that the predictions function as OpenAI propaganda, that's separate from whether they're doing that because they are valuable - if someone had come up with these predictions in a box, isolated from OpenAI, it'd have similar effects; so then the question is separately about upstream causality of why say these things (credit and blame assignment), vs downstream causality of what these things will do (and what to do about it now). The upstream causality seems like a distraction, except inasmuch as it's relevant to downstream causality (eg, because properly assigned credit or blame might change the landscape of the present). IMO the main concern here is that these predictions, which were already being made by many people around the tech but not so specifically or with such careful argumentation, seem to be somehow being used by OpenAI to further their purposes. If that's because the predictions turn out correct, that maybe seems worse than if they were wrong, because they're pretty scary predictions - but either way, it's not good news that, in my view, there doesn't seem to be such a thing as bad publicity for AGI, and I still don't know for sure why that's happening. And that seems like where most of the value is in figuring out this discussion, to me, at least. Though the view you initially appeared to be writing down, that the predictions themselves are functioning as a propaganda piece in an upstream-causality intent sort of way. does seem to be a common one, so having a good and solid debate about it where we try to figure out and confirm the who-did-what-why a bit might well be worth the attention.

[-]SE Gyges2mo40

in my view, there doesn't seem to be such a thing as bad publicity for AGI, and I still don't know for sure why that's happening. And that seems like where most of the value is in figuring out this discussion, to me, at least.

It's an incentive problem.

There is no way to discuss something being dangerous that does not also render it valuable. People are incentivized to seek out value; our entire economy is based on it. It works beautifully, but it is terrible at mitigating externalities. We only dial back from dangerous or bad things after the disaster; so long as doing things is profitable, rational economic actors seek out high-risk activities as far as permitted, because they alone get the profit and the majority of the risk is to other people.

In my view Yudkowsky's body of work has had two main effects, which run in opposite directions:

Convincing many people that AI is extremely valuable, which is a large part of why we currently are where we are.
Convincing many people that AI is dangerous, which shows no signs of paying off yet but which may be crucially important at some future juncture. I am willing to pronounce it a complete failure at actually causing any regulatory regime whatsoever to come into existence thus far.

[-]StanislavKrym2mo10

Daniel is a thoughtful, strategic person who understands and thinks about AI strategy. He presumably wrote AI 2027 to try to influence strategy around AI. His perspective is going to be for playing as OpenAI. He will have used this perspective for years, totaling thousands of hours. He will have spent all of that time seeing AI research as a race, and trying to figure out how OpenAI can win. This is a generating function for OpenAI's investor pitch, and is also the perspective that AI 2027 takes.

S.K.'s comment: I would like to repeat the quote^[1] from the AI-2027 forecast, which I first mentioned in another comment. "The scenario itself was written iteratively: we wrote the first period (up to mid-2025), then the following period, etc. until we reached the ending. We then scrapped this and did it again.

We weren’t trying to reach any particular ending. After we finished the first ending—which is now colored red—we wrote a new alternative branch because we wanted to also depict a more hopeful way things could end, starting from roughly the same premises. This went through several iterations".^[2]

S.K.'s comment continues: In the unlikely event that it was DeepCent who aligned its AI and Consensus-1 ended up aligned, it would also be "a more hopeful way things could end". However, the story has OpenBrain AND DeepCent choose misaligning training environments and create misaligned AIs. The Slowdown Ending has OpenBrain retry solving alignment, this time with OOMs more effort. DeepCent, on the other hand, cannot retry without falling further behind.

Second: what information is available, and what information do you see a lot?
I think this is the main source of skew.

S.K.'s comment: the AI-2027 forecast relies on the following five pillars:

The compute forecast;
The timelines forecast;
The takeoff forecast;
The security forecast;
The AI goals forecast.

The forecast related to AI goals is unlikely to be skewed. Sections 1, 2 and 5 of the compute forecast don't actually rely^[3] on the existence of powerful AIs. The security forecast is harder to grade, since it relies on humans deciding to guard the secrets against other humans, but also has benchmark-based estimates. What rests is the timelines forecast and the takeoff forecast.

The former one is so unreliable that even the authors acknowledged it in April 2025 by having Eli forecast the median date of superhuman coders' appearance to be 2027 (2025 to 2039), 2028 (2025 to >2050) or 2030 (2026 to >2050).

The takeoff forecast rests on the assumption that superhuman coders and AI researchers will appear and will greatly accelerate the AI research. The exact rates of acceleration are most vulnerable to being skewed, especially if the AIs are high-level neuralese before becoming superhuman coders. But I don't think that we even have better ways to forecast the acceleration.

For a concrete example of this that I didn't dig into in my review, from the AI 2027 timelines forecast.
We first show Method 1: time-horizon-extension, a relatively simple model which forecasts when SC will arrive by extending the trend established by METR’s report of AIs accomplishing tasks that take humans increasing amounts of time.
We then present Method 2: benchmarks-and-gaps, a more complex model starting from a forecast saturation of an AI R&D benchmark (RE-Bench), and then how long it will take to go from that system to one that can handle real-world tasks at the best AGI company.
Finally we then provide an “all-things-considered” forecast that takes into account these two models, as well as other possible influences such as geopolitics and macroeconomics.
Are either RE-Bench or the METR time horizon^[4] metrics good metrics, as-is? Will they continue to extrapolate? Will a model that saturates them accelerate research a lot?

S.K.'s comment: the authors start "from a forecast saturation of an AI R&D benchmark (RE-Bench), and then [estimate] how long it will take to go from that system to one that can handle real-world tasks at the best AGI company". Saturating the RE-bench, unlike reaching the METR-like time horizon of a month, is, of course, NOT enough to accelerate AI research.

AI 2027’s “Vice President” (read: JD Vance) election subplot is long and also almost totally irrelevant to the plot. It is so conspicuously strange that I had trouble figuring out why it would even be there. I didn’t learn until after I’d written my take that JD Vance had read AI 2027 and mentioned it in an interview, which also seems like a very odd thing to happen. I went looking for the simplest explanation I could.

S.K.'s comment: the election-related subplot and mentions of Vance are due to the fact that 2028 is an election year in the USA. The American Constitution prohibits Trump from becoming the POTUS in 2028, so the Americans will have to choose between another Republican and a Democrat. The Republican candidate is most likely to be Vance.

Similarly, the line about Thiel getting the flying car could likely be a reference to a popular joke coined by Thiel: "We wanted flying cars, instead we got 140 characters."

^{^}
S.K.'s footnote: The quote is found in the collapsible section "How did we write it?" on the forecast's main page.
^{^}
S.K.'s footnote: the authors also claim that "It was overall more difficult, because unlike with the first ending, we were trying to get it to reach a good outcome starting from a rather difficult situation."
^{^}
S.K.'s footnote: Sections 3 and 4 have powerful AIs used for automating research, but they require only 5% of OpenBrain's compute.
^{^}
S.K.'s footnote: The METR benchmark has already run into issues with spurious failures and Grok 4's failure on fast tasks.

^{^}

If making a proper post out of a very long comment like this is considered poor form, I claim ignorance.

^{^}

A few people have said that it could be DeepMind. I think it could be but pretty clearly isn't. Among other things, DeepMind would not want or need to sell products they considered dangerous or to be possibly close to allowing RSI, because they are extremely cash-rich. If the forecast were about DeepMind, it would probably consider this, but it isn't, so it doesn't.

LESSWRONG
LW

LESSWRONG
LW

9

AI 2027 Response Followup

9

9