AIs will greatly change engineering in AI companies well before AGI

[-]Daniel Kokotajlo2moΩ8170

I appreciate your recent anti-super-short timelines posts Ryan and basically agree with them. I'm curious who you see yourself as arguing against. Maybe me? But I haven't had 2027 timelines since last year, now I'm at 2029.

[-]ryan_greenblatt2moΩ460

Some AI company employees with shorter timelines than me mostly. I also think that "why I don't agree with X" is a good prompt to express some deeper aspect of my models/views. It also makes a good reasonably engaging hook for a blog post.

I might write some posts responding arguments for longer timelines that I disagree with if I feel like I have something interesting to say.

[-]StanislavKrym2mo10

My case against long timelines is based on waiting for algorithmic breakthroughs which Kokotajlo on July 28 believed to have a chance of "maybe like 8%/yr". Seth Herd replied to my case as follows: "You estimate c by looking at how many breakthroughs we've had in AI per person year so far. That's where the 8% per year comes from. It seems low to me with the large influx of people working on AI (italics mine -- S.K.), but I'm sure Daniel's math makes sense given his estimate of breakthroughs to date"

I didn't interview any AI company employees, but I conjecture that they are overconfident in their ability to make such breakthroughs.

[-]anaguma2mo20

What made you update from 2028?

[-]Daniel Kokotajlo2mo80

Newer better timelines model mainly. Still working on it. But also, METR's downlift study, GPT-5 being on trend, various misc other things.

[-]anaguma2mo10

What is the first point at which your new model diverges from the AI 2027 timeline?

[-]Daniel Kokotajlo2mo52

Not sure how to interpret the question. Some benchmark scores are somewhat lower today than AI 2027 predicted, and our new model takes them into account, so in some sense it's already diverging, but only very slightly. 2026 should see a big divergence though, one that's clearly not just noise. And then, obviously, 2027 will look totally different (on the median trajectory).

[-]Noosphere892mo100

I generally agree with this, so I'll just elaborate on disagreements, anything that I don't mention you should assume I agree with it.

On Amdahl's law:

These AIs wouldn't be able to automate some tasks (without a human helping them) and this bottleneck would limit the speed-up due to Amdahl's law.

While I agree in the context of the post, I generally don't like Amdahl's law arguments, and tend to think they're a midwit trap, because people forget that more resources don't just cause people to solve old problems more efficiently, but to make new problems practical at all, and this is why I believe parallelization is usually better than pessimists argue, due to Gustafson-Barsis's law.

This doesn't matter here, but it does matter once you fully automate a field.

There is an obvious consequence that this will cause increased awareness and salience of: AI, AI automating AI R&D, and the potential for powerful capabilities in the short term.

So I agree there will be more salience, but I generally expect this to be pretty restrained, and in genpop, I expect much more discontinuous salience and responses, and I expect much weaker responses until we have full automation of AI R&D at least, and maybe even longer than that.

A key worldview difference is I expect genpop already believes in/is motivated to hear this argument for a very long time, regardless of whether this is correct:

"Now that we've seen AIs automate AI R&D and no one is even claiming that we're seeing explosive capabilities growth, the intelligence explosion has been disproven; compute bottlenecks really are decisive. (Or insert whichever bottleneck this person believes is most important.) The intelligence explosion must have been bullshit all along and look, we don't see any of these intelligence explosion proponents apologizing for being wrong, probably they're off inventing some new milestone of AI to fearmonger about."

[-]Gavin Runeblade2mo31

A factor I didn't see you include is changing the percentage of work performed by top engineers vs average and poor engineers. Here is one article aimed at the general labor pool,

https://fortune.com/2025/08/26/stanford-ai-entry-level-jobs-gen-z-erik-brynjolfsson/

But the effect is extremely pronounced in software engineering, much more than average across labor fields. While the increase in performance may be 1.05, it is being applied to the top performers who are much more above baseline than that indicates. From recent research on the topic (https://80000hours.org/career-guide/personal-fit/):

"A small percentage of the workers in any given domain is responsible for the bulk of the work. Generally, the top 10% of the most prolific elite can be credited with around 50% of all contributions, whereas the bottom 50% of the least productive workers can claim only 15% of the total work, and the most productive contributor is usually about 100 times more prolific than the least."

So what we are seeing happening is the low end workers are getting let go and not replaced. The mod tier to a lesser extent, and that top 10% are doing more and more of the work. This alone, changing the ratio of work hours to more heavily favor the most productive workers, has a big impact. And then amplifying them is far more effective than amplifying the average or bottom tier of worker.

Long story short (too late) I think even 1.2 is too low given the actual workers are between 10x and 100x the ones who are getting let go.

And which don't have some other much stronger relevant capability. ↩︎
I think speed-ups from AIs prior to superhuman coder probably accelerate timelines to superhuman coder by around 30%, though a bunch of this speed-up occurs right before superhuman coder. ↩︎
Here's a more precise definition of "less than 8-hour reasonably self-contained task": If we randomly sample research engineers within the AI company and get them to complete the task (without any additional context beyond what they already have working for the company), the 20th percentile completion time is less than 8 hours (as in, 20% of these randomly selected research engineers successfully complete the task in less than 8 hours). ↩︎
This isn't that dependent on the task suite, though it might be somewhat dependent on the methodology. I think you get pretty similar results if you apply the same methodology to other datasets of easily verifiable agentic SWE tasks. ↩︎
There is some chance that AI assistance is actually slowing down engineering in AI companies right now. ↩︎
This curve computes overall AI progress. You might object that you expect AI R&D acceleration to increase exponentially, but converting from AI R&D acceleration to overall AI progress will result in a different function than exponential. It turns out that doing everything in terms of AI R&D acceleration and then converting to overall AI progress is equivalent; if you do this conversion (using the constants in "Appendix: How do speed-ups to engineering translate to overall AI progress speed-ups?"), you get the exact same numbers. ↩︎
This estimate isn't fully independent because the 5x AI R&D acceleration estimate for superhuman coder uses a similar estimation strategy as I use to convert from engineering acceleration to overall speed-ups. ↩︎
If the overall AI progress acceleration at superhuman coder is 8x, then the superhuman coder milestone is moved earlier by around 0.5 years. You might think that this moves earlier large acceleration (e.g. 3.5x) substantially further because we've increased the superhuman coder acceleration, but this actually doesn't make much of a difference to when we see large acceleration because the change in acceleration over time is (super-)exponential (so you reach large acceleration soon before you reach superhuman coder). ↩︎
And there are more details and caveats in these predictions, but these are the core predictions. Also, I do think that various other types of predictions are being (partially) falsified, e.g., predictions about a very abrupt/fast takeoff would look increasingly bad when/if this happens. ↩︎
I don't think it's unreasonable in principle to define AGI in a way such that AGI has already been achieved, but in practice this is atypical usage of the term, particularly relative to how people used the term more than several years ago. I generally think it's better to avoid using the term AGI except as a short term to refer to some vaguely defined high level of capability (e.g. I use "AGI" at the start of this post because most readers will interpret that as a high level of capability without me needing to say something more complicated). ↩︎

LESSWRONG
LW

LESSWRONG
LW

46

AIs will greatly change engineering in AI companies well before AGI

46

Ω 24

46

Ω 24

AI progress speed-ups don't seem large enough

Interpolating between now and superhuman coder doesn't indicate large speed-ups within 2 years

What about speedups from mechanisms other than accelerating engineering?

Other reasons to expect very short timelines

Implications of a several year period prior to full automation of AI R&D where research engineering is greatly changed

Appendix: sanity check of engineering speed-up

Appendix: How do speed-ups to engineering translate to overall AI progress speed-ups?