snewman

LESSWRONG
LW

snewman — LessWrong

6mo

If this is GPT-5 in “Thinking” mode, I wonder what “Pro” mode looks like

Amidst the unrelenting tumult of AI news, it’s easy to lose track of the bigger picture. Here are some ideas that have been developing quietly in the back of my head, about the path from here to AGI.

I drafted this post a couple of weeks ago. The subsequent launch of GPT-5 didn’t lead me to make any changes. That says something about how uneventful GPT-5 is.
Current AIs aren’t AGI. But I don’t know why.

I mean, I have thoughts. I talk about missing functions like “memory” and “continuous learning”, and possibly “judgement” and “insight”. But these are all debatable; for

... (read 4758 more words →)

Updates from Comments on "AI 2027 is a Bet Against Amdahl's Law"

snewman

9mo

AI 2027 is a Bet Against Amdahl's Law was my attempt to summarize and analyze "the key load-bearing arguments AI 2027 presents for short timelines". There were a lot of great comments – every time I post on LW is a learning experience. In this post, I'm going to summarize the comments and present some resulting updates to my previous analysis. I'm also using this post to address some comments that I didn't respond to in the original post, because the comment tree was becoming quite sprawling.

TL;DR: my previous post reflected a few misunderstandings of the AI 2027 model, in particular in how to interpret "superhuman AI researcher". Intuitively, I still have... (read 3652 more words →)

•••

Interpreting the METR Time Horizons Post

snewman

9mo

[EDIT: initial publication did not have the link to the original post, and was missing the footnotes. Sorry about that! Fixed now.]

[This has been lightly edited from the original post, eliminating some introductory material that LW readers won't need. Thanks to Stefan Schubert for suggesting I repost here. TL;DR for readers already familiar with the METR Measuring AI Ability to Complete Long Tasks paper: this post highlights some gaps between the measurements used in the paper and real-world work – gaps which are discussed in the paper, but have often been overlooked in subsequent discussion.]

It's difficult to measure progress in AI, despite the slew of benchmark scores that accompany each new AI... (read 2887 more words →)

AI 2027 is a Bet Against Amdahl's Law

snewman

10mo

EDIT: I've written a followup post, summarizing and responding to the key themes raised in the comments.

AI 2027 lies at a Pareto frontier – it contains the best researched argument for short timelines, or the shortest timeline backed by thorough research^[1]. My own timelines are substantially longer, and there are credible researchers whose timelines are longer still. For this reason, I thought it would be interesting to explore the key load-bearing arguments AI 2027 presents for short timelines. This, in turn, allows for some discussion of signs we can watch for to see whether those load-bearing assumptions are bearing out.

To be clear, while the authors have short timelines, they do not claim... (read 2602 more words →)

127

•••

What Indicators Should We Watch to Disambiguate AGI Timelines?

snewman

(Cross-post from https://amistrongeryet.substack.com/p/are-we-on-the-brink-of-agi, lightly edited for LessWrong. The original has a lengthier introduction and a bit more explanation of jargon.)

No one seems to know whether transformational AGI is coming within a few short years. Or rather, everyone seems to know, but they all have conflicting opinions. Have we entered into what will in hindsight be not even the early stages, but actually the middle stage, of the mad tumbling rush into singularity? Or are we just witnessing the exciting early period of a new technology, full of discovery and opportunity, akin to the boom years of the personal computer and the web?

AI is approaching elite skill at programming, possibly barreling into superhuman... (read 3758 more words →)

142

Towards Better Milestones for Monitoring AI Capabilities

snewman

In the present moment, with so much hype around the accomplishments of generative AI, there is a lot of confusion and disagreement regarding timelines for further progress in capabilities. Assumptions regarding human-level AGI range from a few years to at least a few decades. On the one hand, some people talk as if current LLMs are nearly there and something like GPT-6 or Claude 4 will cross the line. On the other, in Through a Glass Darkly, Scott Alexander summarized current forecasts as projecting "transformative AI" being 10 to 40 years away.

Forecasting timelines is genuinely very difficult (!), so disagreement is to be expected. To monitor progress and calibrate forecasts of AI... (read 4158 more words →)

The AI Explosion Might Never Happen

snewman

[This is a crosspost from https://amistrongeryet.substack.com/p/recursive-self-improvement-foom, lightly edited for the LessWrong audience. This is my first LessWrong post; feedback greatly appreciated!]

LessWrong readers will be familiar with the concept of recursive self-improvement: as AIs become increasingly capable, they will acquire the ability to assist in their own development. Eventually, we will manage to create an AI that is slightly better than us at AI design. Since that system is better at AI design than its human creators, it should be able to design an AI better than itself. That second system should then be able to design its own improved successor, and so forth.

A lot of people seem to believe that, once AIs... (read 2618 more words →)

LESSWRONG
LW

LESSWRONG
LW

What Indicators Should We Watch to Disambiguate AGI Timelines?

AI 2027 is a Bet Against Amdahl's Law

Interpreting the METR Time Horizons Post

Updates from Comments on "AI 2027 is a Bet Against Amdahl's Law"

snewman

snewman

35 Thoughts About AGI and 1 About GPT-5

Updates from Comments on "AI 2027 is a Bet Against Amdahl's Law"

Interpreting the METR Time Horizons Post

AI 2027 is a Bet Against Amdahl's Law

What Indicators Should We Watch to Disambiguate AGI Timelines?

Towards Better Milestones for Monitoring AI Capabilities

The AI Explosion Might Never Happen

snewman

What Indicators Should We Watch to Disambiguate AGI Timelines?

AI 2027 is a Bet Against Amdahl's Law

Interpreting the METR Time Horizons Post

Updates from Comments on "AI 2027 is a Bet Against Amdahl's Law"

snewman

snewman

35 Thoughts About AGI and 1 About GPT-5

Updates from Comments on "AI 2027 is a Bet Against Amdahl's Law"

Interpreting the METR Time Horizons Post

AI 2027 is a Bet Against Amdahl's Law

What Indicators Should We Watch to Disambiguate AGI Timelines?

Towards Better Milestones for Monitoring AI Capabilities

The AI Explosion Might Never Happen