x
LLMs Universally Learn a Feature Representing Token Frequency / Rarity — LessWrong