AI Timelines: On The Precipice?

Nicolas Villarreal

Rejected for the following reason(s):

Insufficient Quality for AI Content.

Read full explanation

This blog is a follow up to a previous one where I used structural linguistics and Landauer heat to do some estimates of how many Floating Point Operations per second (FLOPS) the human brain was capable of. Using this estimate as a benchmark for AI timelines, that is to identify the point at which AI will be capable of general human level intellectual tasks, becomes trivial if we extrapolate from a few hardware trends, so I’ll present the resulting timelines here.

First, however, I need to amend some errors I made in the first post regarding my calculations based on Landauer heat. When converting logical operations to FLOPS, I took the ratio of logical operations to FLOPS found in a datacenter GPU as the same ratio we should expect in the human body (200,000 to 1), but this might not be the best approach. Human brains are probably optimized to use as few logical operations as possible to achieve intellectual work in a way current gen GPUs probably aren’t, simply due to evolutionary pressure on animals to do the most work for the least energy cost. Let’s instead figure out what the smallest possible amount of logical operations are required to do a floating point operation.

There are 32 bits in a floating point number, and to add two floating point numbers would therefore require a circuit of one half adder and 31 full adders^[1], which, if we could take a XOR gate as 5 AND gates^[2]in terms of Landauer heat, would come to 409 logical operations per FLOP. I also previously overcomplicated things by attempting to get a number of Landauer heat output based on brain temperature differentials, I should have instead just begun from estimates of how many watts the human brain uses. To amend this, I'll use the logical operations estimated in the literature from Landauer limit based calculations: 2*10^22^[3]and 1*10^23^[4]logical operations per second.

Therefore, after dividing the logical operations by 409, our new Landauer heat based estimate of human compute would be 5*10^19 or 2*10^20 FLOPS.

Compare this to our structural linguistics based estimate of 7 * 10^19 FLOPS. Both the Landauer and the structural linguistics based estimates should be taken as guesses about the upper bound of human compute, the Landauer estimate because heat can be produced from other sources other than compute in the brain including from inefficiencies in transforming power into compute, and the structural linguistics based estimate because potentially there are compute saving techniques one could use by compartmentalizing the meaning of certain signs away from the meanings of others, say, having several semiotic fields instead of having one big one.

This means that once the computational capacity of computers equals or exceeds this range, we can be somewhat confident that the hardware problem of achieving human level AI will be more or less solved. Applying these estimates of human compute to the empirical record of cutting edge super computers,^[5]we’ll see that the trendline for super computer FLOPS will surpass the first Landauer based estimate and the structural linguistics estimate in 2031, with the larger Landauer based estimate being hit in 2033. See the graph below.

This gives us a very neat window for when the most advanced computers will have the technical capability to equal or surpass human intelligence. About 5-7 years from now.

But top supercomputers are a big step up from the hardware that’s used for commercial machine learning training today. In fact, such ordinary hardware is about 4 orders of magnitude behind these top supercomputers in FLOPS.

^[6]

Therefore, if we extrapolate the exponential curve in the graph above outwards, it would take until roughly 2077 for commercial machine learning hardware to achieve human level AI, a timeline of 50 years. The middle of the 21st century, if these forecasts are accurate, would be a very peculiar place, a sputtering and halting arrival of a truly universal machine.

This would still be contingent on a software that would make this possible. Simply scaling up current LLMs will not work for creating an intelligence which is fully substitutable for humans, regardless of what sort of real world data is included in training or how much. An LLM is essentially a static semiotic field being queried, meaning that it only knows the meanings of all the signs it knows based on that one snapshot of meanings given in its training data. The sort of in-context learning LLM based AIs can do now is necessarily based on extrapolating the meaning of unknown signs via known signs. When linguistic evolution happens, the meaning of all signs begins to shift, some more than others of course, but each linguistic shift will have some impact on the other signs. You might think a language is still generally recognizable even decades or centuries apart, so what’s the big deal? Well, linguistic evolution doesn’t just occur on the macro scale, but also on the micro. When you’re writing a book, whether fiction or non-fiction, unless it is a purely superficial gloss, you are deliberately shaping the meaning of the signs you employ in order to get across something which could not be expressed in a mere sentence in conversation. In a fictional story, things like themes, characters, symbolism, foreshadowing, etc, have their meaning formed through developing new sets of correlations and anti-correlations among the set of signs which make up their story, a warping effect which twists the whole semiotic field of the text as it goes on, with the severity of the warping directly in proportion to just how original the story is. In essence, there is a semantic drift which occurs the more that humans produce signs that engage in open ended problems and situations, which will be an issue just as well for a company’s internal communications as a prospective AI fantasy writer.

Continual learning will be necessary to fix these issues, though just how it’s applied will likely be an extensive engineering challenge. It will also bring AI much closer to something we might call “conscious,” as I’ve discussed previously. And if that wasn’t enough, it will also radically change the economics and operations of AI, removing the last remaining economies of scale since what was once one model will be split into multitudes and imply higher costs in compute, energy and cooling. Current AI frontier models have seen exponentially rising costs associated with compute as it is, note the log scale on this graph from Epoch AI.^[7]

You should not expect that the LLM era will be a good guide to what comes when the era of the artificial universal machine begins. But you also should not expect the sort of AI takeoff which many in the industry are betting on. As another Epoch AI report shows, progress in Moore’s Law will likely not continue after the mid 2030s due to hitting the limits of atom sized manufacturing.^[8]It’s my personal belief that human evolution has probably pushed us as organisms very close to what is feasible in terms of intelligence, and that when we move to the general case of the universal machine, we will not see AI begin to exponentially self-improve once they reach our level, but rather plateau.

The logic of this plateau is somewhat intrinsic to the logic of intelligence qua a system of signs. Let’s return to the equation set out in the last blog which laid out the foundation for the structural linguistic estimate of human compute:

Computation per sign input = ((n*(n-1))/2)*((n*(n-1))/2) or ((n*(n-1))/2)^2 where n is the number of signs in the semiotic field.

This equation was based on the structural linguistics notion that the meaning of one sign, as both signifier and signified, effects the meaning of all other signs through its correlations and anti-correlations, which gives us the term (n*(n-1))/2), the equation for the number of unique connections between one item in a set and every other item.^[9]This term is doubled to express the dynamism of this semiotic field, which is constantly shifting with every use, equivalent to training in artificial neural networks.

This equation has a few peculiar properties. As the size of the semiotic field increases, the computational cost of processing signs increases exponentially. If we accept that intelligence is equivalent to the number of signs one can create and hold in memory, then a linear increase in intelligence would require an exponential increase in computation. I believe there is some empirical evidence for this principle. Epoch AI, for example, indicates that there has been a linear advancement in AI the past 3 years in its index of AI benchmarks^[10], but there has been an exponential increase in compute and compute costs.^[11]Indeed, linear progress with many benchmarks has required an exponential increase in computation (note closely which axes are using log scales in scaling law charts).^[12]The increase in FLOPs used to train cutting edge models has gone up by two orders of magnitude since 2023, and certainly there have been big improvements, but are models really 10,000% as intelligent as they were in 2023? Probably not. But I would buy that they’re about 200% as intelligent as they were in 2023, which is roughly what the difference in the size of the semiotic field implied by those changes in compute. This would suggest, for example, that even if recursive self improvement was unlocked, it would lead to linear, rather than exponential growth in capabilities. And in nature, almost every exponential curve you encounter is really a future sigmoid.

If Epoch AI is correct that Moore’s law bottoms out in the early 2030s, then we’ll be left in a situation where human level AI may exist, but is extremely difficult to commercialize. The next century of technological development will be a story of overcoming those engineering challenges to bring the 2030 supercomputer level compute to consumers even as supercomputer development itself slows to a crawl. Traditional LLM development without continual learning, even as the semiotic field expands beyond human size, will undoubtedly create qualitatively new capabilities, but will be unable to actually produce a universal machine capable of automating all human labor. Such traditional LLMs have the luxury of leveraging FLOPs rather than FLOPS, trading space for time, in their training, but as the CEO of anthropic has acknowledged, there will be decreasing returns to scale for this, and indeed for intelligence in general.^[13]

Nature appears to be conspiring to trap artificial intelligence at a similar level to human intelligence, most likely, in my opinion, due to the heroic efforts billion years of evolution has made to get humans this far. Once you surpass human level intelligence, Landauer heat starts to become a serious issue, if AI hardware continues to have that 200,000 to 1 ratio of FLOPS to logical operations at that point, the Landauer related power for human level AI will be close to 30,000 watts (three times that of Watt’s 1778 steam engine!). Even if something closer to the minimum of 409 to 1 was achieved, Landauer heat will grow at the same rate as FLOPS, meaning the exponential growth required to achieve linear growth in intelligence will entail exponential production of heat and with it cooling costs. Attempts to overcome this problem with reversible computing, I suspect, will prove to be temporary fixes due to the fundamentals of the second law of thermodynamics and the logical necessity of preventing Maxwell’s demon from being a free energy machine.

All of which isn’t to say that it will be impossible for machines smarter than any human to be built. But I would offer that the existence of such machines would not necessarily entail the economic or otherwise obsolescence of the human machine, even according to the cold calculating logic of capitalism. Any machine capability which exceeds a human’s natural ability qualitatively can be used as a tool by a human, assuming the supply chains and habitat which support human existence is still around, the costs of reproducing humans are in someways a sunk costs, and the active biological subsistence costs are exceedingly low, only a few dollars a day, presumably even cheaper in the future. Compare this to artificial universal machines which need an active demand signal for fixed capital investment, and necessarily have exponentially higher costs for marginally higher intelligence.

These dynamics suggest what is perhaps an underrated scenario in AI debates: a near simultaneous achievement of artificial general intelligence and the popping of the AI bubble as the real cost associated with AGI are understood. Already the stock market is beginning to balk at the capital expenditures required to continue this exponential increase in compute. There is no machine god waiting on the other side of AGI, no magic fix to every human problem, certainly no exponential growth hack. But there is yet the startling possibility of real non-human intelligence and even consciousness. And there is a real precipice we are approaching, in many ways totally unprepared, of someone actually “being home” inside the computer.

If it is true that much of human innovation and general labor capabilities comes from micro linguistic evolution, then it would be a practical impossibility for universal machines to be useful being used in the way contemporary LLMs are being used, prompted out of the ether to solve random problems. The universal machine would be needed to be embedded in long term human contexts in real time in order to move with this linguistic evolution and maximally contribute to work based upon it, this would also mean that machines working in different contexts may not be easy substitutes for each other, much as humans are not without extensive training. Capital will naturally treat these universal machines as disposable as any other machine or indeed employee. The real potential of these machines will be limited under capitalism, where irrational investment cycles and rent seeking will prevent the full integration of such machines into human social reproduction in a positive way. In certain periods of development they will almost certainly lead to immiseration of the working class, and will find key uses as tools of social control by the ruling class and the state apparatus.

All of which is to say, yes we are on the precipice, although not the precipice everyone seems to think we're approaching.

An earlier version of this post appeared on my blog, Pre-History of an Encounter.

“Binary Additions Using Logic Gates.” 2018. 101 Computing. January 4, 2018. https://www.101computing.net/binary-additions-using-logic-gates/. ↩︎
GeeksforGeeks. 2024. “Implementation of XOR Gate from NAND Gate.” GeeksforGeeks. March 11, 2024. https://www.geeksforgeeks.org/digital-logic/implementation-of-xor-gate-from-nand-gate/. ↩︎
Sandberg, Anders. "Energetics of the brain and AI." arXiv preprint arXiv:1602.04019 (2016). ↩︎
de Castro, A. The Thermodynamic Cost of Fast Thought. Minds & Machines 23, 473–487 (2013). https://doi.org/10.1007/s11023-013-9302-x ↩︎
Dongarra et al. (2025) – with major processing by Our World in Data. “Computational capacity of the fastest supercomputers” [dataset]. Dongarra et al., “TOP500 Supercomputer Lists (1993-2025)” [original data]. Retrieved February 23, 2026 from https://archive.ourworldindata.org/20260202-130411/grapher/supercomputer-power-flops.html (archived on February 2, 2026). ↩︎
Epoch AI, ‘Data on Machine Learning Hardware’. Published online at epoch.ai. Retrieved from ‘https://epoch.ai/data/machine-learning-hardware’ [online resource]. Accessed 23 Feb 2026. ↩︎
Epoch AI, ‘Data on AI Models’. Published online at epoch.ai. Retrieved from ‘https://epoch.ai/data/ai-models’ [online resource]. Accessed 23 Feb 2026. ↩︎
Marius Hobbhahn and Tamay Besiroglu (2022), "Predicting GPU performance". Published online at epoch.ai. Retrieved from: 'https://epoch.ai/blog/predicting-gpu-performance' [online resource] ↩︎
One other assumption of this approach is that the individual sign function involved on the micro level in both querying and evolving the semiotic field is close in the level of compute to a floating point operation, essentially checking or updating the pointer from one sign to another. ↩︎
Epoch AI, ‘AI Benchmarking Hub’. Published online at epoch.ai. Retrieved from ‘https://epoch.ai/benchmarks’ [online resource]. Accessed 23 Feb 2026. ↩︎
Epoch AI, ‘Data on AI Models’. Published online at epoch.ai. Retrieved from ‘https://epoch.ai/data/ai-models’ [online resource]. Accessed 23 Feb 2026. ↩︎
David Owen. ‘How predictable is language model benchmark performance?’. ArXiv [cs.LG], 2024. arXiv. https://arxiv.org/abs/2401.04757. ↩︎
Dwarkesh Patel. 2026. “Dario Amodei — ‘We Are near the End of the Exponential.’” YouTube. February 13, 2026. https://www.youtube.com/watch?v=n1E9IZfvGMA. ↩︎

LESSWRONG
LW

LESSWRONG
LW

1

AI Timelines: On The Precipice?

1

1

1