Understanding LLMs: Some basic observations about words, syntax, and discourse [w/ a conjecture about grokking] — LessWrong