So8res

Why Corrigibility is Hard and Important (i.e. "Whence the high MIRI confidence in alignment difficulty?")

by Raemon, Eliezer Yudkowsky, and So8res

A lot of objection and confusion to the MIRI worldview seems to come from a perspective of "but, it.... shouldn't be possible be that confident in something that's never happened before at all, with anything like the current evidence and the sorts of arguments you're making here." And while I...

Sep 30, 202595

The Problem

by Rob Bensinger, tanagrabeast, yams, So8res, Eliezer Yudkowsky, and Gretta Duleba

This is a new introduction to AI as an extinction threat, previously posted to the MIRI website in February alongside a summary. It was written independently of Eliezer and Nate's forthcoming book, If Anyone Builds It, Everyone Dies, and isn't a sneak peak of the book. Since the book is...

Aug 5, 2025331

A case for courage, when speaking of AI danger

I think more people should say what they actually believe about AI dangers, loudly and often. Even (and perhaps especially) if you work in AI policy. I’ve been beating this drum for a few years now. I have a whole spiel about how your conversation-partner will react very differently if...

Jun 27, 2025531

Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies

Eliezer and I wrote a book. It’s titled If Anyone Builds It, Everyone Dies. Unlike a lot of other writing either of us have done, it’s being professionally published. It’s hitting shelves on September 16th. It’s a concise (~60k word) book aimed at a broad audience. It’s been well-received by...

May 14, 2025655

LessWrong: After Dark, a new side of LessWrong

The LessWrong team has obviously been hard at work putting out their debut album. But another LessWrong feature also seems to have been released today, to less fanfare: LessWrong: After Dark, a branch of the site devoted to explicit discussion of sex and sexuality, where the LessWrong team finally gets...

Apr 1, 202436

Ronny and Nate discuss what sorts of minds humanity is likely to find by Machine Learning

Context: somebody at some point floated the idea that Ronny might (a) understand the argument coming out of the Quintin/Nora camp, and (b) be able to translate them to Nate. Nate invited Ronny to chat. The chat logs follow, lightly edited. The basic (counting) argument Ronny Fernandez Are you mostly...

Dec 19, 202343

Quick takes on "AI is easy to control"

A friend asked me for my quick takes on “AI is easy to control”, and gave an advance guess as to what my take would be. I only skimmed the article, rather than reading it in depth, but on that skim I produced the following: > Re: "AIs are white...

Dec 2, 202326

So8res

So8res

Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies

A case for courage, when speaking of AI danger

Focus on the places where you feel shocked everyone's dropping the ball

The Problem

So8res

Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies

A case for courage, when speaking of AI danger

Focus on the places where you feel shocked everyone's dropping the ball

The Problem

Why Corrigibility is Hard and Important (i.e. "Whence the high MIRI confidence in alignment difficulty?")

The Problem

A case for courage, when speaking of AI danger

Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies

LessWrong: After Dark, a new side of LessWrong

Ronny and Nate discuss what sorts of minds humanity is likely to find by Machine Learning

Quick takes on "AI is easy to control"