brockmanmatt

Posts

Sorted by New

Comments

Sufficiently Advanced Language Models Can Do Reinforcement Learning

Ah, sorry, I forgot to add a link to how to evolve the labels. There's a couple different methods in http://gptprompts.wikidot.com/context-stuffing if that helps.

$1000 bounty for OpenAI to show whether GPT3 was "deliberately" pretending to be stupider than it is

I don't think it's a BPE issue but not sure. I'd guess it's closer to the parity issue. It has a hard time implicitly counting in general.

edit: thanks, i know how to link now.

$1000 bounty for OpenAI to show whether GPT3 was "deliberately" pretending to be stupider than it is

It seems to just do really bad with parentheses on their own. It can fix them with like... f(f(f(x))) but not '((())' type situations (I'm just using the beta).

Code: https://gist.github.com/brockmanmatt/aea4fc4a962188f85d83db761bf0ac50