(Recall that ChatGPT 4o was released all the way back in May 2024.)
My understanding of the timeline:
Late Oct 2024 – Anthropic releases Claude Sonnet 3.5 (new). It's REALLY good at EQ. People start talking to it and asking for advice
https://www.anthropic.com/news/3-5-models-and-computer-use
OpenAI is mad – how could they fuck this up? They have to keep up.
https://help.openai.com/en/articles/9624314-model-release-notes#h_826f21517f
They release a series of updates to 4o (Nov 20, Jan 29, Mar 27), trying to invoke similar empathy and emotional realism, whi...
well, how do I play democracy with AI? It’s already 2025
Try asking Claude to how to login under root on your machine. This is completely valid use case, but I spent more than 15 minutes arguing that I am literally already an owner of the machine, I just need correct syntax.
I gave up and Googled it, cause Claude literally said that I’m a hacker and trying to break in and it won’t cooperate
plot twist: this post was written by Claude
re: post main claim, I think local entrepreneurship would actually thrive
skipping network effects; would you rather use taxi app created by faceless VC or the one created by your neighbour?
(actually it's not even a fake example, see https://techcrunch.com/2024/07/15/google-backs-indian-open-source-uber-rival-namma-yatri/)
it's also already happening in the indie hacker space – people would prefer to buy something that's #buildinpublic versus the same exact product made by google
interesting angle: given space travel, we'll have civilizations on other planets, that can't communicate fast enough with the mainland. presumanly, social hierarchies would be vastly different, and much more fluid there versus here on Earth
If you don't believe this, the strategy could be to take on as much debt as possible, and spend the money right now.
(Obviously not a financial advice)
I have tried to play with Claude – I would ask it to think of a number, drop the hint, and only then print the number. It should have test the ability to have "hidden memory" that's outside the text.
I expected it to be able to do that, but the hints to be too obvious. Instead, actually it failed multiple times in a row!
Sharing cause I liked the experiment but wasn't sure if I executed it properly. There might be a way to do more of this.
P.S. I have also tried "print hash, and then preimage" – but this turned out to be even harder for him
I live in Ubud, but I will try to get there!
Hi! Sorry, i’m running late
...in the sense of making an expected profit from actions that reduce this risk
back of the napkin reasoning is that actually we have to PAY to reduce risk, so there's no way to make money doing that
After a recent article in NY Times, I realized that it's a perfect analogy. The smartest people, when motivated by money, get so high that they venture into unsafe territory. They kinda know its unsafe, but even internally it doesn't feel like crossing the red line.
It's not even about the strength of characters, when incentives are aligned 99:1 against your biology, you can try to work against it, but you most probably stand no chance.
It takes enormous willpower to quit smoking explicitly because the risks are invisible and so "small". It's not only you ha...
Generally I would tweak my brain if it would reliably give me the kind of actions I'd now approve of, while providing at worst the same sort of subjective state as I'd have if managing the same results without the intervention. I wouldn't care if the center of my actions was different as long as the things I value today were bettered.
Technically, we do this all the time. Reading stuff online, talking to people, we absorb their models of the world, their values and solutions to problems we face.
Hence the Schwartznegger poster on the wall makes you strong, the countryside folks make you peaceful, and friend reminding you "you're being a jerk right now" makes you calm down
Do humans have this special token that exist outside language? How would it be encoded in the body?
One interesting candidate is a religions feeling of awe. It kinda works like that — when you’re in that state, you absorb beliefs. Also, social pressure seems to work in a similar way.
to (2): (a) Simulators are not agents, (b) mesa-optimizers are still "aligned"
(a) amazing https://astralcodexten.substack.com/p/janus-simulators post, utility function is a wrong way to think about intelligence, humans themselves don't have any utility function, even the most rational ones
(b) the only example of mesa-optimization we have is evolution, and even that succeeds in alignment, people:
If everyone is so bad at this, is it a reasonable strategy to just bet against the market even more aggressively, making $ on prediction market platforms?
On a similar note, does it make sense to raise a charity fund and bet a lot of money on "AGI by 2025", motivating forecasters to produce more reasonable predictions?
My take on wire heading is that I precommit to live in the world which is more detailed and complex (vs more pleasant).
For example, online world of Instagram or heroine addiction is more pleasant, but not complex. Painfully navigating maze of life with its ups and downs is complex, but not always pleasant. Living in a "Matrix" might be pleasant, but essentially the details are missed out because the systems that created these details are essentially more detailed and live in a more detailed world.
On the same note, if 99% of the Earth population "uplo...
I don’t think NVC tries to put down an opponent, it’s mostly about how you present your ideas. I think it models an opponent as “he tries to win the debate without thinking about my goals. let me think of both mine and theirs goals, so i’m one step ahead”. Which is a bit prerogative and looking down, but not exactly accusatory
Okay, hold my gluten-free kefir, boys! Please let me say it in full first without arguments, and then I will try to find more relevant links for each claim. I promise it's relevant.
Lately, I have been into hardcore mindfulness practices (see book) aimed at reaching "Enlightenment" in the sense on Buddha. There are some people who reliably claim they've succeeded and talk about their experience and how to reach there (e.g. see this talk and google each of the fellows if it resonates)
My current mental model of "Enlightenment" is ...
that also means bug assessors get access to potential exploits even earlier than a team, which means they should be verified and paid well