Do LLMs Implement NLP Algorithms for Better Next Token Predictions?

simeon_c

[ Question ]

Do LLMs Implement NLP Algorithms for Better Next Token Predictions?

by simeon_c

1 min read19th Sep 20231 answer No comments

5

AI

Frontpage

Do you think that base LLMs implement forms of "meta" algorithms like TF-IDF to predict the next token better?

Intuitively, it would be pretty smart to proceed that way because a text tends to be coherent in terms of style and in terms of vocabulary, so it would be very useful for an LLM to implement internally some NLP algorithms that help a ton in guessing the next words.

It's a question I'm interested in because if there's any "meta" algorithm implemented internally whose efficiency depends on the training setup the LLM is in, it increases the chances that situational awareness would arise.

New Answer

New Comment

1 Answers sorted by
top scoring

Charlie Steiner

Sep 21, 2023

20

Yes, absolutely, but I don't expect algorithms to be implemented in separable chunks the way a human would do it. Comparing frequencies of various words just needs an early attention head with broad attention. But such an attention head will also be recruited to do other things, not just faithfully pass on the sum of its inputs, and so you'd never literally find TF-IDF.

Moderation Log

LESSWRONG
LW

[ Question ]

Do LLMs Implement NLP Algorithms for Better Next Token Predictions?

5

New to LessWrong?

1 Answers sorted by
top scoring

Sep 21, 2023

[ Question ]

Do LLMs Implement NLP Algorithms for Better Next Token Predictions?

5

New to LessWrong?

1 Answers sorted by top scoring

Sep 21, 2023

1 Answers sorted by
top scoring