Grammars, subgrammars, and combinatorics of generalization in transformers — LessWrong