veered's Shortform

veered

veered's Shortform

1 min read7th Apr 20232 comments

This is a special post for quick takes by veered. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.

New to LessWrong?

2 comments, sorted by

top scoring

Click to highlight new comments since: Today at 2:16 PM

[-]veered1y20

For GPT-style LLMs, is it possible to prove statements like the following?

Choose some tokens , $B$ and a fixed $L L M$ :

There does not exist a prefix of tokens $P$ such that $L L M (P + A) \to B$

More generally, is it possible to prove interesting universal statements? Sure, you can brute force it for LLMs with a finite context window but that's both infeasible and boring. And you can specifically construct contrived LLMs where this is possible but that's also boring.

I suspect that it's not possible/practical in general because the LLM can do arbitrary computation to predict the next token, but maybe I'm wrong.

[-]JBlack1y20

Yes, in general statements like this are theoretically possible to prove, but not remotely practical. There might be some specific (A,B,LLM) triples for which you can prove such a statement but I expect that none of these are generalizable to actually useful statements.

No GPT-style architecture is (in itself) capable of truly universal computation, but in practice functions they can implement are far beyond our ability to adequately analyze.

Moderation Log