Setting temperature=0 does not guarantee that the same output will always be generated; the generation process contains some amount of uncertainty.

Reply

[-]Martin Fell2y10

Thanks, I'll rephrase that part for clarity

Reply

[-]Derek M. Jones2y41

You might also want to investigate using top_p rather than temperature.

Reply

[-]Martin Fell2y10

Thanks, appreciate the suggestion, there's definitely a lot of room to go into more depth and I'll definitely check that out

Reply

[-]jacob-lee2y30

Well this is odd.

me: Please copy the following sentence exactly: "LukeSkywalkerisablytyped"
chatgpt3: "LukeSkywalkerisPlainOldData"

me: Please rewrite this nonsense-phrase adding spaces between each word: "LukeSkywalker EnumerableStream"
chatgpt3: "Luke Skywalker is ably typed"

!!!

Reply

[-]Martin Fell2y31

Hah yes there is quite a lot of weirdness associated with glitch tokens that I don't think has been fully investigated. Some of them it seems to sort-of-know what the spelling is or what their meaning is, others it has no idea and they change every time. And the behaviour can get even more complicated if you keep using them over and over in the same conversation - some ordinary tokens can switch to behaving as glitch tokens. Actually caused me some false positives when searching for these.

Reply

[-]Joseph Van Name2y30

I wonder if the problem of glitch tokens can be mitigated by splitting up text into tokens in a non-unique way and considering all tokenizations of text at the same time.

Reply

[-]Martin Fell2y20

Since it seems that glitch tokens are caused by certain sequences of text appearing in the training corpus for the tokenizer much more often than they do in the LLM training data, something like that might work. But there also seem to exist "glitch phrases" or "unspeakable phrases", i.e. sequences of tokens of extremely low probability to the model that could create some strange behaviour too, and it seems at least plausible to me that these kinds of phrases could still be generated even if countermeasures were taken to prevent glitch tokens from being created. Glitch phrases though are a bit more difficult to find without access to the model.

Reply

[-]Cameron Zhang2y10

Can anyone explain the creative behavior? I have seen several chats with similar results, but I have yet to see an explanation. Seems like the temperature was affected by the prompt...

Reply

Moderation Log