LESSWRONG
is fundraising!
LW

It does not, sad to say. I tried space-separating each digit for the BPE issue, and its general completion is to just copy the previous line. The log probs of the possible completions are generally 50:50 for 0/1, showing it's not tapping into any parity counting.

Add Comment

[-]gwern5y160

One interesting update: we've been increasingly unlocking GPT-3 solutions by rewriting them as multi-step procedures. So parity might be doable by somewhat cheating and writing out a series of steps for computing the parity for each example: https://twitter.com/bucketofkets/status/1285100951271952384 https://twitter.com/Malcolm_Ocean/status/1285099206781341696

Rendering 1/2 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 4:56 AM

[-]Gurkenglas5y60

If you try this, reformat to work around the BPE problem as detailed in https://www.gwern.net/GPT-3#bpes

Moderation Log

19

[ Question ]

How well can the GPT architecture solve the parity task?

19

19

1 Answers sorted by top scoring

Jul 12, 2020*

1 Answers sorted by
top scoring