Aleksey Bykhun — LessWrong

founder at yolocode.xyz, helping artists sell art online

(Recall that ChatGPT 4o was released all the way back in May 2024.)

My understanding of the timeline:

Late Oct 2024 – Anthropic releases Claude Sonnet 3.5 (new). It's REALLY good at EQ. People start talking to it and asking for advice
https://www.anthropic.com/news/3-5-models-and-computer-use

OpenAI is mad – how could they fuck this up? They have to keep up.

https://help.openai.com/en/articles/9624314-model-release-notes#h_826f21517f

They release a series of updates to 4o (Nov 20, Jan 29, Mar 27), trying to invoke similar empathy and emotional realism, which culminates in Mar 2025 when they even had to dial it back down due to twitter complaints

Uncertain: ChatGPT can't match Sonnet in EQ, cause of the differences between RLHF and RLAIF.

However, it's "good enough" that people grow emotionally attached to 4o.

OpenAI makes most of the money on their b2c chatgpt.com – Anthropic doesn't care about b2c as much, they rake in API inference $$$ and claude.ai is like a 5th priority on their list, somewhere after training, alignment, enterprise sales, coding performance

well, how do I play democracy with AI? It’s already 2025

Try asking Claude to how to login under root on your machine. This is completely valid use case, but I spent more than 15 minutes arguing that I am literally already an owner of the machine, I just need correct syntax.

I gave up and Googled it, cause Claude literally said that I’m a hacker and trying to break in and it won’t cooperate

plot twist: this post was written by Claude

re: post main claim, I think local entrepreneurship would actually thrive

skipping network effects; would you rather use taxi app created by faceless VC or the one created by your neighbour?

(actually it's not even a fake example, see https://techcrunch.com/2024/07/15/google-backs-indian-open-source-uber-rival-namma-yatri/)

it's also already happening in the indie hacker space – people would prefer to buy something that's #buildinpublic versus the same exact product made by google

interesting angle: given space travel, we'll have civilizations on other planets, that can't communicate fast enough with the mainland. presumanly, social hierarchies would be vastly different, and much more fluid there versus here on Earth

If you don't believe this, the strategy could be to take on as much debt as possible, and spend the money right now.

(Obviously not a financial advice)

I have tried to play with Claude – I would ask it to think of a number, drop the hint, and only then print the number. It should have test the ability to have "hidden memory" that's outside the text.

I expected it to be able to do that, but the hints to be too obvious. Instead, actually it failed multiple times in a row!

Sharing cause I liked the experiment but wasn't sure if I executed it properly. There might be a way to do more of this.

P.S. I have also tried "print hash, and then preimage" – but this turned out to be even harder for him

I live in Ubud, but I will try to get there!

Hi! Sorry, i’m running late

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments