LESSWRONG
LW

2464
Mohammad Bavarian
5020
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
ChatGPT (and now GPT4) is very easily distracted from its rules
Mohammad Bavarian2y62

Did you test Claude for it being less susceptible to this issue? Otherwise not sure where your comment actually comes from. Testing this, I saw similar or worse behavior by that model - albeit GPT4 also definitely has this issue

https://twitter.com/mobav0/status/1637349100772372480?s=20

Reply
We Are Conjecture, A New Alignment Research Startup
Mohammad Bavarian3y10

What do you mean by Scaling Hypothesis? Do you believe extremely large transformer models trained based on autoregressive loss will have superhuman capabilities?

Reply
No posts to display.