Adam_Barker — LessWrong

LESSWRONG
LW

Replying toRetrospective on ‘GPT-4 Predictions’ After the Release of GPT-4

Retrospective on ‘GPT-4 Predictions’ After the Release of GPT-4

Data seems to be a bottleneck, so we should expect the number of model parameters to run high to compensate.
Note, that a MMLU of 100% should be achievable using a model the same size as Megatron-Turing NLG, and a data only 2.1x more data than GPT-4, which should be achievable in the near term.

Replying toGPT-4 Predictions

Adam_Barker3y

GPT-4 Predictions

Here's an equation for the MMLA vs Loss plot:

A MMLA = 100% corresponds to a loss of 1.8304. Using the scaling laws, listed here, this can be reached using:

The GPT-4 dataset (4Gtokens) and a model 11x the size of Megatron-Turing NLG (6 trillion parameters). Compute time: 111 days on Eos.
GPT-4's 175B params with 18.5 trillion training tokens (4.6x the size of GPT-4's dataset). Compute time: 16 days on Eos, but getting that many tokens may be a problem.
Megatron-Turing NLG's 530B parameters, and 8.5 trillion tokens (2.1x the size of GPT-4's dataset). Compute time: 23 days on Eos. This is a much more reachable dataset.

The nx compute speed of Eos used for GPT-4 was 18.4 ExaFLOP/s.

Replying toAn Alien God

Adam_Barker7y

An Alien God

To strongman a counter to the fox-rabbit-toaster argument: when you make a toaster, you do make a part to prevent electricity from getting to the coils. In fact, you make several of them. First, you make a timer to shut the toaster off when it's done. You also make a fuse, an some over-heating shut-off switch. These prevent a run-away situation that would cause the toaster to burn down your house. So an advocate of a god who makes both rabbits and foxes could point out that foxes and rabbits make a self-regulating population control system, and that would follow the same sort of engineering logic as putting a fuse on a toaster.

Replying toAn Alien God

Adam_Barker7y

An Alien God

Calling evolution uninteligent is foolish egocentrism. This is a process that has produces a world of nanotech-based creatures of astronomical complexity, using the entire surface of the Earth as a giant quantum computer. The simplest element of biological creation--the behavior of a single protein--requires computation at or beyond the limit of humanity's computing capability. It's staggering to me that all the structured computation and creative outpouring of evolution gets called "uninteligent" despite producing creatures of complexity vastly beyond the capability of the human mind to comprehend. It is a mind, and an (super) intelligent one by virtue of this ability to create complex structures through astronomical volumes of structured computation, but it... (read more)