I agree, and I would add that today many people aren't thinking about what system a mind is embedded in, and are trying to define AGI based on elicited capabilities instead of what a mind could do in the right environment.
Human baseline comparison: When did humans first become able to design rockets to the moon?
Normal answer: the 1960s.
Contrarian answer 1: Sometime in the paleolithic, we just didn't have an environment that elicited it until the 20th century.
Contrarian answer 2: We still can't. There is no human you can put in a room with a scratchpad and the instructions "Design a rocket to the moon" and expect them to produce a working plan that, when executed independently by them, reliably achieves that goal.
I regularly see the equivalent of all three of these in discussions and definitions of AGI. I suspect there is at least one existing model that, 50 years from now, we would say unambiguously 'counts' as AGI by analogy to the second definition above, after a proof by demonstration of giving it the right environment. I would not be surprised if there are at some point ASIs (under the first or second definition) that wouldn't even count as AGI by analogy to the third definition above.
The reason I wanted to make this linkpost now rather than some other time is because discussions over AGI and whether or not LLMs are or aren't AGI are happening right now, and the point of the linkpost is that the term AGI is for our purposes useless at this point, because we are now in the fuzzy cloud now that AI can do real economic work.
Some choice paragraphs:
I think this was an underrated reason for why the term AGI lost its value, because as it turned out, certain definitions that do connect/correlate very well in humans are easier to disentangle than we thought for AIs (though humans are also jagged, but we don't realize because we are comparing against our own baselines).
On the usefulness of AGI 1-2 decades ago:
Now I should stop here to note that the gesturing was likely wrong on overestimating how good AIs would become once they got good at language, and there are a couple of reasons for this, but the big ones are that AIs could be capable without being nearly as sample-efficient as humans, either at pre-training or post-training, and this probably made them assume less jaggedness in AI than currently exists, and the other big one is assuming more neuralese/recurrence for AI than what currently exists, and as it turned out the direction AI would go in would deemphasize forward passes in favor of Chain-of-Thought, which improved capabilities and interpretability (this is demonstrated by the fact that No-CoT doubling times are a little over a year, compared to general doubling times on the order of 100 days), but critically only boosted AI capabilities in an interpretable manner.
Now onto this:
Yep, I think that once AIs could do real economic work, which I'd peg at November 2025, the term AGI lost all of it's value, and the question is what more specific terms are necessary in order to recapture the lost value.
Here's one use-case for more specific terms
Also valuable is that with more specific terms, you open yourself to a greater risk that your theory is falsified, which is good, but not how humans normally reason, unfortunately.
Another use-case that needs more specific terms:
(This is a good time to note that I have a disagreement with Helen Toner that I also don't think superintelligence is that far away in some domains because of inference scaling, and while inference scaling has it's limits because it requires verifiability that only exists for a relatively small portion of the current economy, in the domains where this does exist, it's conceptually easy to scale current systems to superintelligence in those domains).
And finally, the reason the linkpost exists at all:
This is a huge problem, but one that is kind of being worked on by Daniel Kokotajlo and others at the AI Futures Project, which used more specific terms and moved away from the term AGI in it's modelling (which I thank them for doing).
And yeah, AGI/superintelligence has at this point become far too vague, unfortunately to make it a useful term. It was useful 1-2 decades ago, but we should aim to stop relying on it now that we have more evidence.