Neural networks generalize because of this one weird trick — LessWrong