x

LESSWRONG

LW

Scaling Laws — LessWrong

Scaling Laws

Edited by riley, plex last updated 18th Jun 2023

Scaling Laws refer to the observed trend that the scaling behaviors of deep neural networks (i.e. how the evaluation metric of interest varies as one varies the amount of compute used for training (or inference), number of model parameters, training dataset size, model input size, or number of training steps) follows variants of power laws.

External links

"Broken Neural Scaling Laws" paper

Scaling laws graph from Scaling Laws for Neural Language Models

Add Posts

1

1

Posts tagged Scaling Laws

8

137"Can AI Scaling Continue Through 2030?", Epoch AI (yes)

2y

5

7

425chinchilla's wild implications

4y

129

4

185What will GPT-2030 look like?

3y

43

4

21/r/MLScaling: new subreddit for NN scaling research/discussion

6y

0

3

82Thoughts on the Alignment Implications of Scaling Language Models

5y

11

3

35My ML Scaling bibliography

5y

9

3

32Google's new text-to-image model - Parti, a demonstration of scaling benefits

4y

4

2

244How to game the METR plot

7mo

32

2

175o1: A Technical Primer

2y

19

2

111Paper: On measuring situational awareness in LLMs

Owain_Evans, Daniel Kokotajlo, Mikita Balesni, Tomek Korbak, Asa Cooper Stickland, Meg, Maximilian Kaufmann

3y

17

2

88Scaling Hypothesis #2: Are Humans Just More Over-Parameterized?

1mo

20

2

86Model Size Scaling in 2023-2031

1mo

17

2

68Superhuman Coders in AI 2027 - Not So Fast

dschwarz, FutureSearch

1y

0

2

63Ethan Caballero on Private Scaling Progress

Michaël Trazzi

4y

2

2

58Inverse Scaling Prize: Second Round Winners

Ian McKenzie, Sam Bowman, Ethan Perez

3y

17

Load More (15/82)

Add Posts