Your examples of tasks that are hard to approximate - irreducibly complex calculations like checksums; computational tasks that inherently require a lot of time or memory - seem little more than speed bumps on the road to surpassing humanity. An AI can do such calculations the normal way if it really needs to carry them out; it can imitate the appearance of such calculations if it doesn't need to do so; and meanwhile it can use its power to develop superhuman problem-solving heuristics, to surpass us in all other areas...

Reply

[-]samhealy2y30

Agreed, largely.

To clarify, I'm not arguing that AI can't surpass humanity, only that there are certain tasks for which DNNs are the wrong tool and a non-AI approach is and possibly always will be preferred.

An AI can do such calculations the normal way if it really needs to carry them out

This is a recapitulation of my key claim: that any future asymptotically powerful A(G)I (and even some current ChatGPT + agent services) will have non-AI subsystems for tasks where precision or scalability is more easily obtained by non-AI means, and that there will probably always be some such tasks.

Reply

[-]Gurkenglas2y20

3.4 × 10^44

Where is your reductio getting these numbers?

Reply

[-]samhealy2y1-8

Plucked from thin air, to represent the (I think?) reasonably defensible claim that a neural net intended to predict/synthesise the next state (or short time series of states) of an operating system would need to be vastly larger and require vastly more training than even the most sophisticated LLM or diffusion model.

Reply

[-]samhealy2y*10

To clarify: I didn't just pick the figures entirely at random. They were based on the below real-world data points and handwavy guesses.

ChatGPT took 3.23 x 10^23 FPOPs to train
ChatGPT has a context window of 8K tokens
Each token is roughly equivalent to four 8-bit characters = 4 bytes, so the context window is roughly equivalent to 4 x 8192 = 32KB
The corresponding 'context window' for AIOS would need to be its entire 400MB+ input, a linear scaling factor of 1.25 x 10^4 from 32KB, but the increase in complexity is likely to be much faster than linear, say quadratic
AIOS needs to output as many of the 2 ^ (200 x 8 x 10 ^ 6) output states as apply in its (intentionally suspect) definition of 'reasonable circumstances'. This is a lot lot lot bigger than an LLM's output space
(3.23 x 10 ^ 23) x (input scaling factor of 1.56 x 10 ^ 8) x (output scaling factor of a lot lot lot) = conservatively, 3.4 x 10 ^ 44
Current (September 2023) estimate of global compute capacity is 3.98 x 10 ^ 21 FLOPS. So if every microprocessor on earth were devoted to training AIOS, it would take about 10 ^ 23 seconds = about 30000000000000000 years. Too long, I suspect.

I'm fully willing to have any of this, and the original post's argument, laughed out of court given sufficient evidence. I'm not particularly attached to it, but haven't yet been convinced it's wrong.

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

-3

AIOS

-3

-3

Summary

Welcome to AIOS!

Epilogue