x
Do Transformers Habituate? Investigating Repetition Suppression in Language Models — LessWrong