x
Physics of Language models (part 2.1) — LessWrong