x

LESSWRONG

LW

Thomas Dullien — LessWrong

Thomas Dullien

Thomas Dullien

Message

1

3y

Thomas Dullien

3y

A Mechanistic Interpretability Analysis of Grokking

Thomas Dullien3yΩ010

Good stuff. A few thoughts:

1. Assuming a model has memorized the training data, and still have enough "spare capacity" to play lottery ticket hypothesis to find generalizing solutions to a subset of the memorized data, you'll eventually end up with a number of partial solutions that generalize to a subset of the memorized data (obviously assuming some form of regularization towards simplicity). So this may be where the "underparametrized" regime of ML of the past went wrong: That approach tried to force the model into generalization without memorization, b... (read more)