The memorization-generalization spectrum and learning coefficients — LessWrong