Ethan finds empirically that neural network scaling laws (performance vs size, data, other things) are characterised by functions that look piecewise linear on a log log plot, and postulates that a “sharp left turn” describes a transition from a slower to a faster scaling regime. He also postulates that it might be predictable in advance using his functional form for scaling.

[-]weverka3y10

You drew a right turn, the post is asking about a left turn.

[+]weverka3y-50

Moderation Log

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

14

[ Question ]

How is the "sharp left turn defined"?

14

14

1 Answers sorted by
top scoring

Dec 09, 2022

14

[ Question ]

How is the "sharp left turn defined"?

14

14

1 Answers sorted by top scoring

Dec 09, 2022

1 Answers sorted by
top scoring