Does anyone have thoughts on Muse Spark that they've written up? Do we have any speculations on what its good at/it's size/whether it has good/bad post-training?
There are a lot of comments on hacker news and maybe some are useful: https://news.ycombinator.com/item?id=47692043
I'm excited to try it on the "recommend me a book" benchmark, since it is reportedly good at that, whereas other models haven't been. Haven't gotten around to that yet, though. I'd be interested to learn if this replicates for anyone!
The only things which I remember on LW is Brendan Long's comment along with a take by Rauno Arike and my expression of distrust and irritation.
A lot of people have been talking about OpenAI re-instating 4o because users want sycophancy.
While OpenAI did re-instate 4o for paid users, it seems like they are trying to prevent users from using it as much as possible.
To access 4o from a plus account, one needs to:
This seems like intentionally dissuasive UI to me.
That being said, if I have 4o enabled and then create a new chat, the next chat will also be with 4o. (I was hoping they'd do an Anthropic style thing and make GPT-5 the default on all new chats).