GPT-3 + GAN

stick109

GPT-3 + GAN

1 min read17th Oct 20202 answers No comments

4

I am wondering if anyone tried to combine GPT-3 with GAN, basically trying to train network that would feed GPT-3 questions and then judge responses as correct/incorrect, thus providing GPT-3 opportunity to improve.

Does my question even make sense or I am far off base?

New to LessWrong?

New Answer

New Comment

3 Answers sorted by
top scoring

Alexander Mathiasen

Sep 22, 2021

This would require you to sample from GPT during training. If you want a sentence with 500 words you need to evaluate GPT 500 times. As a result, it would slow down training 500 times. The clever thing with GPT (and other autoregressive models) is that they circumvent sampling during training!

Aiyen

Oct 19, 2020

That might work. Maybe have the adversarial network try to distinguish GPT-3 text from human text? That said, GPT-3 is already trying to predict humanlike text continuations, so there's a decent chance that having a separate GAN layer wouldn't help. It's probably worth doing the experiment though; traditional GANs work by improving the discriminator as well as the desired categorizer, so there's a chance it could work here too.

1 comment, sorted by

top scoring

Click to highlight new comments since: Today at 9:58 PM

[-]Pattern3y20

You might find this interesting:

https://www.gwern.net/GPT-2-preference-learning#bradley-terry-preference-learning

Moderation Log

LESSWRONG
LW

[ Question ]

GPT-3 + GAN

4

New to LessWrong?

3 Answers sorted by
top scoring

Sep 22, 2021

Oct 19, 2020

[ Question ]

GPT-3 + GAN

4

New to LessWrong?

3 Answers sorted by top scoring

Sep 22, 2021

Oct 19, 2020

3 Answers sorted by
top scoring