I find that a lot of people have trouble with this concept of predicting the next token. And by trouble, I mean that they struggle to understand what it actually means to predict the next token. It seems simpler than it is. Because when you say "predict the next token,"...
There has been a lot of talk about "p(doom)" over the last few years. This has always rubbed me the wrong way because "p(doom)" didn't feel like it mapped to any specific belief in my head. In private conversations I'd sometimes give my p(doom) as 12%, with the caveat that...
"If Anyone Builds It, Everyone Dies" by Eliezer Yudkowsky and Nate Soares (hereafter referred to as "Everyone Builds It" or "IABIED" because I resent Nate's gambit to get me to repeat the title thesis) is an interesting book. One reason it's interesting is timing: It's fairly obvious at this point...
I wrote this page for Wikipedia about the Sydney Bing incident. Since I have limited control over what happens to it in the long term and it's entirely authored by myself I release the final version I edited into the public domain. Sydney (Microsoft Prometheus) Sydney was an AI personality...
As a person who frequently posts about large language model psychology I get an elevated rate of cranks and schizophrenics in my inbox. Often these are well meaning people who have been spooked by their conversations with ChatGPT (it's always ChatGPT specifically) and want some kind of reassurance or guidance...
I'd have included the post text here but there's some HTML parts that didn't play well with LessWrong's markdown formatting. So instead I'll include Claude Opus 4's review: John David Pressman: What stands out to you about this post from your perspective as an AI intelligence? [Attached copy of Commentary...