Analyzing A Critique Of The AI 2027 Timeline Forecasts

[-]Daniel Kokotajlo5mo3614

I note up front that at least Daniel Kokotajlo has indeed adjusted his estimates, and has moved his median from ‘AI 2027’ to ‘AI 2028’ based on events since publication

This is not quite right. My timelines move dup to median 2028 before we published AI 2027 actually, based on a variety of factors including iteratively updating our models. But it was too late to rewrite the whole thing to happen a year later, so we just published it anyway. I tweeted about this a while ago iirc.

[-]Ben Pace5mo20

Does it say this somewhere on the website?

[-]Daniel Kokotajlo5mo40

First footnote, right on front page: "We disagree somewhat amongst ourselves about AI timelines; our median AGI arrival date is somewhat longer than what this scenario depicts. This scenario depicts something like our mode. See our timelines forecast for more details."

[-]Daniel Kokotajlo5mo20

...But no, I guess it doesn't say that Daniel K's median is 2028, iirc. (haven't checked, going on memory) The timelines forecast has Eli and Nikola's medians because they were the ones who wrote it.

[-]Rohin Shah5mo2610

I mean, yes, if the goal of the post was to lower the status and prestige of AI 2027 and to do so through people reading the title and updating in that way, rather than to offer a helpful critique, then it is true that the title was the best local way to achieve that objective, epistemic commons be damned. I would hope for a different goal?

Come on, this is such an isolated demand for rigor. AI 2027 clearly had the goal of raising the status and prestige of belief in AI risk and short timelines. They employed tons of symmetric weapons in the pursuit of this goal. I'm maybe 95% sure you didn't substantially critique them for that.^[1] Why start with this much less viral post?

(To be clear I'm not against the use of symmetric weapons. I'm against censure of the side you disagree with that masquerades as being impartial, whether or not that was deliberate.)

^{^}
If you did, my apologies, it's hard to keep up.

[-]habryka5mo1814

I don't think AI 2027 did anything even close to as crude as calling the thing you are arguing against just "bad" in your title.

Indeed, I think overall AI 2027 is really doing remarkably well at being asymmetric in really a huge number of its choices (I am of course biased as having been involved in many of those choices, but I currently would say that AI 2027 is close to the very top at the intersection of "accessible" and "trying to make itself only succeed and compelling if indeed its claims are true" as I think any piece of media out there).

(I don't have a super considered take on whether I think the title is a fine title, but it seems clear to me that there is a spectrum of trying to make your thing asymmetrically successful and that titotal's critique is far away from AI 2027 on that spectrum, in the direction of symmetry instead of asymmetry)

[-]Rohin Shah5mo1818

Things I agree with:

AI 2027 was less crude in its use of symmetric weapons (which can often itself be a good symmetric weapon when the goal is to influence elites)
AI 2027 made lots of asymmetric choices (but so did titotal)
AI 2027 is doing better than "piece[s] of media" (but that bar is so incredibly low)

I disagree that titotal's critique is far away from AI 2027 on the relevant spectrum. For example, titotal's critique was posted on the EA Forum / LessWrong, and focused on technical disagreements, rather than going through a huge amplification / social media push, and focusing on storytelling.

(I'd agree that AI 2027 put in more effort / are more obviously "trying" relative to titotal, so they're far away as judged by intent, but I mostly care about outcomes rather than intent.)

You might say that obviously AI 2027 needed to do the amplification / social media push + storytelling in order to achieve its goals of influencing the discourse, and I would agree with you. But "influence the discourse" is ultimately going to be about status and prestige (given how discourse works in practice). If you're taking a stance against goals around status and prestige that trade off against epistemic commons, I think you also need to take a stance against AI 2027. (To be clear, I don't take that stance! I'm just arguing for consistency.)

[-]habryka5mo42

Before AI 2027 was posted with a big amplification / media push, it underwent as far as I can tell the single most intense set of review and feedback requests of any big writing project I've seen so far. I don't know whether it was literally posted on LessWrong, but I've seen comments from many many dozens if not hundreds of people over the many dozens of revisions that the scenario underwent.

Like, I am quite into public discourse being better than private Google Doc systems, but AI 2027 was so widely circulated pre-publication in Google Doc format, with lots of focus on technical disagreements, that this seems easily much superior to what is going on with this post.

[-]Rohin Shah5mo21

I don't see how this is responding to anything I've said? What in my comment are you disagreeing with or adding color to?

Again, my position is not "AI 2027 did something bad". My position is "stop critiquing people for having goals around status and prestige rather than epistemics, or at least do so consistently".

(Incidentally, I suspect bio anchors did better on the axis of getting good reviews / feedback, but that isn't particularly central to anything I'm claiming.)

[-]habryka5mo40

I was responding to this part:

For example, titotal's critique was posted on the EA Forum / LessWrong, and focused on technical disagreement

And I was saying that this is also true for the early drafts of AI 2027. Only after a long discussion of the technical disagreements did it go on to a huge amplification thing. This seems directly relevant to that section.

I am responding to the part about consistent standards. I don’t really understand what you believe here, clearly you care a lot about people not using lots of rhetorical tricks and adversarial persuasion tactics all the time, and we’ve talked about that in the past, so I am just straightforwardly arguing that on those dimensions titotal’s post was much worse compared to AI 2027.

We don’t need to come to agreement on this part, it does seem kind of hard to evaluate. But in as much as your top level comment is arguing some kind of asymmetric standard is being applied, that just seems super wrong to me. I don’t know where I would put the line of encourage/discourage, but I don’t see any inconsistency in being unhappy with what titotal is doing and happy about what AI 2027 is doing.

[-]Rohin Shah5mo177

I don’t see any inconsistency in being unhappy with what titotal is doing and happy about what AI 2027 is doing.

I agree with this. I was responding pretty specifically to Zvi's critique in particular, which is focusing on things like the use of the word "bad" and the notion that there could be a goal to lower the status and prestige of AI 2027. If instead the critique was about e.g. norms of intellectual discourse I'd be on board.

That said I don't feel like your defense feels all that strong to me? I'm happy to take your word for it that there was lots of review of AI 2027, but my understanding is that titotal also engaged quite a lot with the authors of AI 2027 before publishing the post? (I definitely expect it was much lower engagement / review in an absolute sense, but everything about it is going to be much lower in an absolute sense, since it is not as big a project.)

If I had to guess at the difference between us, it would be that I primarily see emotionally gripping storytelling as a symmetric weapon to be regarded with suspicion by default, whereas you primarily view it as an important and valuable way to get people to really engage with a topic. (Though admittedly on this view I can't quite see why you'd object to describing a model as "bad", since that also seems like a way to get people to better engage with a topic.) Or possibly it's more salient to me how the storytelling in the finished AI 2027 product comes across since I wasn't involved in its creation, whereas to you the research and analysis is more salient.

Anyway it doesn't seem super worth digging to the bottom of this, seems reasonable to leave it here (though I would be interested in any reactions you have if you felt like writing them).

EDIT: Actually looking at the other comments here I think it's plausible that a lot of the difference is in creators thinking the point of AI 2027 was the scenario whereas the public reception was much more about timelines. I feel like it was very predictable that public reception would focus a lot on the timeline, but perhaps this would have been less clear in advance. Though looking at Scott's post, the timeline is really quite central to the presentation, so I don't feel like this can really be a surprise.

[-][anonymous]5mo10

To clarify, by AI 2027 do you include the timeline model? If so, I'd be interested to know if the reviews caught and/or discussed any of the primary criticisms that titotal has brought up here, particularly the "model is insensitive to starting conditions" bits.

(I recognize I'm butting into a conversation so feel absolutely free to ignore this.)

[-]habryka5mo115

I don't know! I would have to look through all the Google Docs comments and like 10 different versions.

In general though, I seem to have a very different relationship to all the supplements than some other people reading AI 2027, and I kind of wonder whether it would just be better to not have the supplements at all.

From my perspective the key thing is the scenario and the associated expandable boxes and explanations. And then I view most of the supplements as kind of helpful essays for trying to understand and explain some of the intuitions that generated the scenario, but the process for the whole thing is very much not "there is an externally validatable scientific model that was built, then that model was used to generate a scenario". The key engagement I am interested in is people arguing against the scenario, not doing some kind of weird "oh, but your models aren't externally validatable and actually in order to say anything about the future of AI your models need to be conceptually perfect".

I really don't think the graph-fitting described in the timelines supplement was that causally upstream of the beliefs of almost any of the people involved, and I kind of view it more as a single individual sanity-check on whether the basic premise of the scenario checks out. When people try to forecast things as complicated as this, they don't create nice formal models, they have a model in their head that handles a huge number of edge-cases, and is trying to be consistent with much much more things than the formal model could ever represent. Ideally the research supplements would say something like that at the top, though it's plausible that some of the AI Futures Project team relate to their epistemic process differently (though if they do, I think they are just kind of confused).

I don't even think the Timelines Forecast supplement says anything like "this timelines forecast is the basis of the timeline of the mainline scenario". It's just like, a semi-random methodology for forecasting a transformative AI timeline that vaguely informed the main scenario. Conceptually, it feels similar to just doing a random fermi estimate in the middle of a blog post to sanity-check that the thing I am thinking about isn't completely crazy.

I think it's still good to engage with it on its own terms, and think there is value in that, but it's really not what seems remotely most productive to me when thinking about all of AI 2027.

[-][anonymous]5mo83

In general though, I seem to have a very different relationship to all the supplements than some other people reading AI 2027, and I kind of wonder whether it would just be better to not have the supplements at all.

I think this is likely to be true, yes. FWIW, most of the non-AI-researcher people I have talked to about AI 2027 are extremely surprised to hear that the story was not generated in any meaningful sense by the model supplements. It may not explicitly say this - I agree that if folks parse the language on the website very carefully they can plausibly come to that conclusion - but it seems like a pretty crucial thing to be explicit about, just so folks know how to interpret things.

[-]Daniel Kokotajlo5mo90

It is false that the story was not generated in any meaningful sense by the model supplements.

[-][anonymous]5mo50

Thanks for the correction! I'm guessing you don't want to, but I would appreciate an elaboration on your part; is @habryka's description below inaccurate, or did I misinterpret it?

It's just like, a semi-random methodology for forecasting a transformative AI timeline that vaguely informed the main scenario. Conceptually, it feels similar to just doing a random fermi estimate in the middle of a blog post to sanity-check that the thing I am thinking about isn't completely crazy.

[-]Daniel Kokotajlo4mo*170

OK I just had a chat with Eli to try to trace the causal history as best we can remember. At a high level, we were working on the scenario and the supplementary research in parallel, and went back and forth making edits to both for months, and our views evolved somewhat over the course of that time.

Timelines: We initially set AGI in 2027 based on my AGI median, which was informed based on a combination of arguments regarding gains from scaling up agency training, as well as a very crude, handwavy version of what later became the benchmarks and gaps model. Later timelines modeling (the stuff that actually went on the website) along with some additional evidence that came out, pushed my median back to 2028. We denoted this in a footnote on the site (footnote #1 in fact) and I posted a shortform about it (plus a tweet or two). tl;dr is that 2027 was my mode, not my median, after the update. We considered rewriting the scenario to happen about one year later, due to this, but decided against since that would have taken a lot of extra time and didn't really change any of the implications. If the timelines model had given very different results which changed our views against 2027 being plausible, then we would have re-written the scenario. I also mentioned this to Kevin Roose in my interview with him (my somewhat later timelines, the difference between median and mode). I didn't expect people to make such a big deal of this.
Takeoff: The takeoff model for our first scenario, the "practice scenario" which we basically scrapped, was basically a simplified version of Davidson's takeoff speeds model. (takeoffspeeds.com) Later takeoff modeling informed which milestones to focus on the scenario (superhuman coder, superhuman AI researcher, etc.) and what AI R&D progress multiplier they should have. Our memory isn't clear on to what extent they also resulted in changes to the speed of the milestone progression. We think an early crude version of our takeoff model might have resulted in significant changes, but we aren't sure. We were also working on our takeoff model up until the last minute, and similar to the timelines model mostly used it as a sanity check.
Compute: The first version of this was done in early 2024, and the result of it and future versions were directly imported into the scenario.
AI Goals: Early versions of this supplement were basically responsible for our decision to go with instrumentally convergent goals as the AIs' ultimate goals in the scenario.
Security: This one was in between a sanity check and directly feeding into the scenario. It didn't result in large changes but confirmed the likelihood of the weight theft and informed various decisions about e.g. cyberattacks.

So.... Habryka's description is somewhat accurate, certainly more accurate than your description ("no meaningful sense"). But I think it still undersells it. That said, it's definitely not the case that we wrote all the supplements first and then wrote the scenario based on the outputs of those calculations; instead, we wrote them in parallel, had various shitty early versions, etc.

If you want to know more about the evidence & modelling that shaped our views in early 2024 when we were starting the project, I could try to compile a list. I've already mentioned takeoffspeeds.com for example. There's lots of other writing I've put on LessWrong on the subject as well.

Does this help?

[-]habryka5mo40

My guess is there is no confusion about this, but to be clear, I didn't intend to speak on behalf of the AI 2027 team. Indeed, it's plausible to me they disagree with it, though my honest belief in that case is that they are confused about the sources of their own beliefs, not that my statement is wrong. I.e. I said:

Ideally the research supplements would say something like that at the top, though it's plausible that some of the AI Futures Project team relate to their epistemic process differently (though if they do, I think they are just kind of confused).

[-]elifland5mo72

The timelines model didn't get nearly as many reviews as the scenario. We shared the timelines writeup with all of the people who we shared the later drafts of the scenario with, but I think almost none of them looked at the timelines writeup.

We also asked a few people to specifically review the timelines forecasts, most notably a few FutureSearch forecasters who we then added as a final author. However, we mainly wanted them to estimate the parameter values and didn't specifically ask them for feedback on the underlying modeling choices (though they did form some opinions, for example they liked benchmark and gaps much more than time horizon extension; also btw the superexponential plays a much smaller role in benchmarks and gaps). No one brought up the criticisms that titotal did.

In general the timelines model certainly got way less effort than the scenario, probably about 5% as much effort. Our main focus was the scenario as we think that it's a much higher value add.

I'm been pretty surprised at to how much quality-weighted criticisms have focused on the timelines model relative to the scenario, and wish that it was more tilted toward the scenario (and also toward the takeoff model, which IMO is more important than the timelines model but has gotten much less attention). To be clear I'm still very glad that these critiques exist if the alternative is that they didn't exist and nothing replaced them.

[-]GideonF5mo*118

I suspect part of the reasons for the quality-weighted criticism of the timelines rather than the scenario:

If it is the case that you put far less effort into the timelines model than the scenario, then the timelines model is probably just worse - some of the more obvious mistakes that titotal points out probably don't have analogies in your scenario, so its just easier to criticise the timelines model, as there is more to criticise there
In many ways, the timelines model is pretty key to the headline claim of your scenario. The other parts (scenario and takeoff) are useful, high quality contributions but in many ways are less meaningfully novel than the very aggressive timelines. Your takeoff model, for example, is well within the range of speeds considered in the community for years - indeed, it is far slower than a Yudkowskian takeoff for example. This isn't to degrade it - the level of detail in the scenario is commendable and the quality in that respect is genuinely novel. But in terms of what the media coverage, and impact of the work, its the timelines that I suspect are the most significant

[-]Ben Pace4mo5-3

(FWIW in this comment I am largely just repeating things already said in the longer thread... I wrote this mostly to clarify my own thinking.)

I think the conflict here is that, within intellectual online writing circles, attempting to use the title of a post to directly attempt to set a bottom line in the status of something is defecting on a norm, but this is not so in the 'internet of beefs' rest of the world, where titles are readily used as cudgels in status fights.

Within the intellectual online writing circles, this is not a good goal for a title, and it's not something that AI 2027 did (or, like, something that ~any ACX post or ~any LW curated post does)^[1]. This is not the same as "not putting your bottom line in the title", it's "don't attempt to directly write the bottom line about the status of something in your title".

I agree you're narrowly correct that it's acceptable to have goals for changing the status of various things, and it's good to push back on implying that that isn't allowed by any method. But I think Zvi did make the point that the title itself of the critique post attempted to do it using the title and that's not something AI 2027 did and is IMO defecting on a worthy truce in the intellectual online circles.

^{^}
To the best of my recollection. Can anyone think of counterexamples?

[-][anonymous]4mo86

To the best of my recollection. Can anyone think of counterexamples?

It's difficult to determine what you would or wouldn't call "directly writ[ing] the bottom line about the status of something in your title."

titotal's post was titled "A deep critique of AI 2027’s bad timeline models." Is that more or less about the status of the bottom line than "Futarchy's fundamental flaw" is? What about "Moldbug sold out" over on ACX?

In any case, it does seem LW curated posts and ACX posts both usually have neutral titles, especially given the occasionally contentious nature of their contents.

[-]Ben Pace4mo20

"Moldbug sold out" is definitely an attack on someone's status. I still prefer it, because it makes a concrete claim about why. For instance, if the AI 2027 critique post title was "AI 2027's Graphs Are Made Up And Unjustified" this would feel to me much better than something only about status like "AI 2027's Timeline Forecasts Are Bad".

Added: I searched through a bunch of ACX archives specifically for the word 'bad' in titles, I think both titles make a substantive claim about what is bad (Bad Definitions Of "Democracy" And "Accountability" Shade Into Totalitarianism and Perhaps It Is A Bad Thing That The World's Leading AI Companies Cannot Control Their AIs, the latter of which is slightly sarcastic while making the object level claim that the AI companies cannot control their AIs).

Added2: It was easier to search the complete SSC history for 'bad'. The examples are Bad Dreams, How Bad Are Things?, Asymmetric Weapons Gone Bad, and Response To Comments: The Tax Bill Is Still Very Bad, which was the sequel to The Tax Bill Compared To Other Very Expensive Things. The last one is the only one similar to what we're discussing here, but in-context it is said in response to his commenters and as a sequel to a post which did a substantive thing, the title was not the primary thesis for the rest of the internet, which again seems different to me.

[-][anonymous]4mo52

For instance, if the AI 2027 critique post title was "AI 2027's Graphs Are Made Up And Unjustified" this would feel to me much better than something only about status like "AI 2027's Timeline Forecasts Are Bad".

But then that wouldn't be an accurate description of what titotal's post is about.

"AI 2027's authors' arguments for superexponential growth curves are conceptually flawed, and their exponential model is neither exponential nor well-justified, and their graphs are made up and unjustified, and their projections don't take into account many important variables, and benchmark+gaps is a worse model than the simplified one [for technical reasons], and these kinds of forecasts should be viewed with inherent skepticism for the following reasons" would be a proper summary of what is going on... but obviously it's not suitable as a title.

I mean... the reason the AI 2027 critique isn't titled "AI 2027's Graphs Are Made Up And Unjustified" is obviously because the critique is about so much more than just some graphs on ACX and Twitter, right? That's just one small part of the criticism, regardless of how much post-publication public discourse has focused on that one aspect.

The post is ultimately about why the timeline forecasts are (according to the author) bad, so it seems quite hard to title it something direct and concrete when it's a compilation of many separate issues titotal has with AI 2027.

[-]Rohin Shah4mo62

Hmm, interesting. I was surprised by the claim so I did look back through ACX and posts from the LW review, and it does seem to back up your claim (the closest I saw was "Sorry, I Still Think MR Is Wrong About USAID", note I didn't look very hard). EDIT: Actually I agree with sunwillrise that "Moldbug sold out" meets the bar (and in general my felt sense is that ACX does do this).

I'd dispute the characterization of this norm as operating "within intellectual online writing circles". I think it's a rationalist norm if anything. For example I went to Slow Boring and the sixth post title is "Tema Okun's "White Supremacy Culture" work is bad".

This norm seems like it both (1) creates incentives against outside critique and (2) lessens the extremes of a bad thing (e.g. like a norm that even if you have fistfights you won't use knives). I think on balance I support it but still feel pretty meh about its application in this case. Still, this did change my mind somewhat, thanks.

[-]Ben Pace4mo1210

I am both surprised and glad my comment led to an update :)

FWIW I never expect the political blogs to be playing by the good rules of the rest of the intellectual writing circles, I view them more as soldiers. Not central examples of soldiers, but enough so that I'd repeatedly be disappointed by them if I expected them to hold themselves to the same standards.

(As an example, in my mind I confidently-but-vaguely recall some Matt Yglesias tweets where he endorsed dishonesty for his side of the political on some meta-level, in order to win political conflicts; interested if anyone else recalls this / has a link.)

[-]Rohin Shah4mo20

Andrew Gelman: "Bring on the Stupid: When does it make sense to judge a person, a group, or an organization by its worst?" (Not quite as clearcut, since it doesn't name the person in the title, but still)

(If this also doesn't count as "intellectual writing circles", consider renaming your category, since I clearly do not understand what you mean, except inasmuch as it is "rationalist or rationalist-adjacent circles".)

[-]Ben Pace4mo20

I certainly consider Gelman a valid example of the category :)

[-]habryka4mo20

The Gelman post in question is importantly not about arguing for the linked post being bad/stupid, it was taking it fully as a given. I actually think that's an importantly different dynamic because if you are in a context where you can actually presume with your audience that something is bad, then writing it in a title isn't actually influencing the status landscape very much (though it's tricky).

Similarly, I think on LessWrong writing a title which presumes the falsity of the existence of a christian god would in other contexts I think be a pretty bad thing to do, but on LessWrong be totally fine, for similar reasons.

[-][anonymous]4mo20

As an example, in my mind I confidently-but-vaguely recall some Matt Yglesias tweets where he endorsed dishonesty for his side of the political on some meta-level, in order to win political conflicts; interested if anyone else recalls this / has a link

I won't say I would necessarily be surprised, per se, if he had written something along these lines, at least on Twitter, but as as general matter Matt believes Misinformation mostly confuses your own side, where he wrote:

My bottom line on this is that saying things that are true is underrated and saying things that are false is overrated.
We’re all acutely aware of the false or misleading things our political opponents say, and it’s easy to convince yourself in the spirit of “turnabout is fair play” that the key to victory is to play dirty, too. The real problem, though, is that not only does your side already say more false and misleading things than you’d like to admit, but they are almost certainly saying more false and misleading things than you realize. That’s because your side is much better at misleading you than they are at misleading people outside of your ideological camp, and this kind of own-team deception creates huge tactical and strategic problems.

I do believe Matt's support of truth-telling in political fights is instrumental rather than a terminal value for him, so perhaps him articulating this is what you were thinking of?

[-]Jonas Hallgren5mo61

Titotal wraps up by showing you could draw a lot of very distinct graphs that ‘fit the data’ where ‘the data’ is METR’s results. And yes, of course, we know this, but that’s not the point of the exercise. No, reality doesn’t ‘follow neat curves’ all that often, but AI progress remarkably often has so far

I think this is true from a compute-centric perspective over the last years yet I'm still suspicous about whether this reflects the actual territory. Since Ajeya's bio-anchors work, most serious timeline forecasting has built on similar foundations, getting increasingly sophisticated within this frame. Yet if I channel my inner Taleb, I might think that mathematical rigor within a potentially narrow conceptual space might be giving us false confidence.

I'm going to ask a bunch of questions without providing answers to illustrate what I mean about alternative modeling approaches:

Where does your outside view start taking in information? Why that specific date? Why not in the 1960s with logic based AI? Why not in the 90s when NNs first came out?
Why not see this as a continuation of better parallelisation techniques and dynamic programming? There's a theoretical CS view of this that says something about the potential complexity of computer systems based on existing speedups that one can use as the basis of prediction, why not use that?
Why not take something like a more artificial life based view on this looking at something like the average amount of information compression you get over time in computational systems?
1. One of the most amazing things about life is that it has remarkable compression of past events into future action plans based on a small sliver of working memory. One can measure this over time, why is this not the basis of prediction?
Why are we choosing the frame of compute power? It seems like a continuation of the bio-anchors frame and a more sophisticated model of that which seems to be the general prediction direction over the last 4 years yet I worry that as a consequence the modelling gets fragile with respect to errors in the frame itself. Don't get me wrong, physical resources is always a great thing to condition on but the resource quantity doesn't have to be compute?

Rather than building increasingly sophisticated models within the same conceptual frame, we might be better served by having multiple simpler models from fundamentally different frames? Five basic models asking "what if the modelling frame is X?" where X comes from different fields (artificial life, economics, AI, macrohistory (e.g Energy and Civilization or similar), physics as examples) might give us more robust uncertainty estimates than one highly detailed compute-centric model?

Convergence without mentioning other models feels like a pattern we see when expert communities miss major developments. A consequence of mathematical sophistication that gets built on top of frame assumptions that turn out to be incomplete. The models become impressively rigorous within a potentially narrow conceptual space.

I'm not saying compute-based models are wrong, but rather that our confidence in timelines predictions might be artificially inflated by the appearance of convergence when that convergence might just reflect shared assumptions about which variables matter most. If we're going to make major decisions based on these models, shouldn't we at least pressure-test them against fundamentally different ways of thinking about the underlying dynamics?

[-]Daniel Kokotajlo5mo6-1

You wouldn’t put p(SC in 2025) at 5.8% if we were currently at fifteen nanoseconds. Changing the initial conditions a lot seems to break the model.

I think interpreting this is a bit more complicated than it seems. In some conditions I think I would actually do that. It depends on how long the trend had been going. It's reasonable to extrapolate a trend about as far as it's been going for, I think. All of this is a bit weird because it's not possible to have time horizons less than a second or so anyway, by definition, because the way the horizon length is computed is by comparing to how long it takes humans to do a task. Or, what would those tasks even look like?

[-]Rohin Shah5mo143

But it isn't trend extrapolation?

If the current doubling time is T, and each subsequent doubling takes 10% less time, then you have infinite doublings (i.e. singularity) by time 10T. So with T = 4.5 months you get singularity by 45 months. This is completely insensitive to the initial conditions or to the trend in changes-in-doubling-time (unless the number "10%" was chosen based on trend extrapolation, but that doesn't seem to be the case).

(In practice the superexponential model predicts singularity even sooner than 45 months, because of the additional effect from automated AI R&D.)

[-]mishka5mo64

Thanks for writing this!

Of course, we are still missing METR-style evaluations measuring the ability to complete long tasks for all recent systems (Claude 4, newer versions of Gemini-2.5, and, importantly, for agentic frameworks such as OpenAI Codex, Claude Code and similar systems).

When we obtain those evaluations, we’ll have a better understand of the shape of the curve, whether the doubling period keeps shrinking, and if so, how rapidly it shrinks…

[-]dirk5mo3-19

It is called ‘A deep critique of AI 2027’s bad timeline model,’ one could simply not use the word ‘bad’ here and we would still know you have strong disagreements with it,

I think it was meant as a critique of 2027's models of bad timelines; was that not the case?

[-]JustisMills5mo97

I read it how Zvi did.

[-]npostavs5mo75

I don't think this interpretation can hold up: the body of titotal's post doesn't deal with the good vs bad timeline. It's just about the uncertainty of modelling AI progress which applies for both the good and bad timelines.

[-]idly4mo0-3

I'd like to comment on your discussion of peer review.

'Tyler Cowen’s presentation of the criticism then compounds this, entitled ‘Modeling errors in AI doom circles’ (which is pejorative on multiple levels), calling the critique ‘excellent’ (the critique in its title calls the original ‘bad’), then presenting this as an argument for why this proves they should have… submitted AI 2027 to a journal? Huh?'

To me, this response in particular suggests you might misunderstand the point of submitting to journals and receiving peer review. The reason Tyler says they should have submitted it is not because the original model and publication being critiqued is good and especially worthy of publication, it is because it would have received this kind of careful review and feedback before publication, as solicited from an editor independent of the authors, and anonymously. The authors would then be able to improve their models accordingly and the reviewers and editor would decide if their changes were sufficient or request further revisions.

It is a lot of effort to engage with and critique this type of work, and it is unlikely titotal's review will be read as widely as the original piece, or the updated piece once these criticisms are taken into account. And I also found the responses to his critique slightly unsatisfying - only some of his points were taken on board by the authors, and I didn't see clear arguments why others were ignored.

Furthermore, it is not reasonable to expect most of the audience consuming AI 2027 and similar to have the necessary expertise and time to go through the methodology as carefully as titotal has done. Those readers are also particularly unlikely to read the critique and use it to shape their takeaways of the original article. However, they are likely to see that there are pages and pages of supplementary information and analysis that looks pretty serious and, based on that, assume the authors know what they are talking about.

You are right that AI research moves fast and tends to not bother waiting for the peer review process to finish, which can for sure be frustratingly time-consuming. However, realistically, a lot of ML research articles that are widely shared and hyped without going through peer review are really bad, don't replicate and don't even make an attempt to check the robustness of their findings. The incentive structure changes, leading to researchers overstating their findings on abstracts in order for articles to be picked up on social media, rather than expressing things more cautiously lest their statements be picked apart by the anonymous reviewers. Progress still gets made and very quickly, and the rapid sharing of preprints is definitely really helpful for disseminating ideas early and widely, but this aspect of the field does come with costs and we can't ignore that.

Finally, going through peer review doesn't prevent people from performing additional critique and review, like titotal has done, once an article has been published. It is not either-or. In many journals, peer review reports and responses are also published once the article is accepted, so this is also public.

Peer review is by no means a perfect system and I myself think it should be significantly reworked. However, I think the strengths and weaknesses of the existing structures are often not very well understood by the members of this community who argue for it to be gotten rid of wholesale.

LESSWRONG
LW

LESSWRONG
LW

76

Analyzing A Critique Of The AI 2027 Timeline Forecasts

76

76

Table of Contents

The Headline Message Is Not Ideal

An Explanation of Where Superexponentiality Is Coming From

Three Methods

Time Horizon Extension Method

The Public Versus Internal Gap

The Difficulty Gap

Recent Progress

Infinite Time Horizons

Intermediate Speedups

Is There A Flawed Graph Still Up?

Some Skepticism About Projection

Part 2: Benchmarks and Gaps and Beyond

Benchmarks

The Time Horizon Part of the Second Model

Why The Thresholds?

The Gap Model

Eli Responds On LessWrong

On Eli’s Recent Update

Conclusion

Perhaps The Most Important Disagreement