LESSWRONG
LW

112
Niklas Todenhöfer
-4030
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
Against Almost Every Theory of Impact of Interpretability
Niklas Todenhöfer11mo-30

interp?

interpolation, interpretation or which word is meant here? 

Thank you 🙏 for elaborating

Reply
A Longlist of Theories of Impact for Interpretability
Niklas Todenhöfer11mo10

Less ambitious - The could be Goodharted, but it's expensive, and this shifts the inductive biases to favour aligned cognition


Hey 👋 , maybe a "tool" or which word were missing between 
"Less ambitious - The" & "[...]could be Goodharted,"?

Thank you 🙏 for your feedback what the sentence refers to with "The".

Reply
A short introduction to machine learning
Niklas Todenhöfer1yΩ010

instead deep learning tends to generalise incredibly well to examples it hasn’t seen already. How and why it does so is, however, still poorly-understood.


In my opinion generalisation is a very interesting point!

Are there any new insights into deep learning generalisation, similar to the ideas of:

1) implicit regularisation through optimisation methods like stochastic gradient descent, 
2) the double descent risk curve where more parameters can reduce error again, 
or 
3) margin-based measures to predict generalisation gaps? 

Or more generally asked: 
How do we maybe ensure regular update(s) of this or similar article(s)?

Reply