The only public instance of this change being pointed out was a LessWrong comment by someone unaffiliated with Anthropic.
Nitpick: an outside reporter also noticed this on the day of the release and wrote up a story on it. It didn't seem to get much traction though.
I thought the "past chats" feature was a tool to look at previous chats, which only happens if the user asks for it, basically. (I.e., there wasn't a change to the system prompt). So I'm a bit surprised that it seemed to make a difference around sycophancy for you? But maybe I'm misunderstanding something.
Labor costs are much higher in the US, which I think plays into this. So it's easier in Europe to not be reliant on the credit card model.
Response to your thoughts after the yoda timer
Why are you so certain it's dangerous to try once even at the beginning? My guess is that it won't immediately be particularly compelling, but get more so over time as they have time to do RL on views or whatever they are trying to do.
But I also have a large error bar. This might, in the near future, be less compelling than either of us expect. It's genuinely difficult to make compelling products, and maybe Sora 2 isn't good enough for this.
I'm more concerned about Youtube Shorts to be honest, in the long term.
How does ARC-AGI's replication of the HRM result and ablations update you? [Link].
Basically, they claim that the HRM wasn't important; instead it was the training process behind it that had most of the effect.
A lot of people have been talking about OpenAI re-instating 4o because users want sycophancy.
While OpenAI did re-instate 4o for paid users, it seems like they are trying to prevent users from using it as much as possible.
To access 4o from a plus account, one needs to:
This seems like intentionally dissuasive UI to me.
That being said, if I have 4o enabled and then create a new chat, the next chat will also be with 4o. (I was hoping they'd do an Anthropic style thing and make GPT-5 the default on all new chats).
One point of information against the "journalists are completely misinterpreting the thing they're reporting on" view is that the one of the co-authors is Rocket Drew, who previously worked as a Research Manager at MATS.
But I'll definitely be interested to follow this space more.
One thing that strikes me as odd about this is that GPT-5's knowledge cutoff (September 2024) is much earlier than Grok (November 2024), Gemini 2.5 pro (January 2025), and Opus 4.1 (March 2025).[1]
I mean, I guess this is a scaling thing and a persona thing. But I'm a little confused
Though oddly, Claude's system card says the knowledge cutoff is the end of January 2025. Maybe February and March's training data aren't as complete as January and before.
Thanks for posting this! I found it quite interesting to read. A couple of questions off the top of my head:
No need to respond completely to this comment, but I hope these questions are useful!
Afaict, some of this is now in the Gemini app. But if not, feel free to ping me (I have access).