1 Introduction

I meant conditional on the others.

Replying toAgainst "If Anyone Builds It Everyone Dies"

I address the sharp left turn worry in the piece.

Replying toAgainst "If Anyone Builds It Everyone Dies"

Way more RL is done on LLMs than tiny neural nets.

Replying toAgainst "If Anyone Builds It Everyone Dies"

I say that in the previous paragraph.

It's Good To Create Happy People: A Comprehensive Case

1mo

1 Introduction

Crosspost of this blog post.

Unlike most books, the thesis of If Anyone Builds It Everyone Dies is the title (a parallel case is that the thesis of What We Owe The Future is “What?? We owe the future?). IABIED, by Yudkowsky and Soares (Y&S), argues that if anyone builds AI, everyone everywhere, will die. And this isn’t, like, a metaphor for it causing mass unemployment or making people sad—no, they think that everyone everywhere on Earth will stop breathing. (I’m thinking of writing a rebuttal book called “If Anyone Builds It, Low Odds Anyone Dies, But Probably The World Will Face A Range of Serious Challenges That Merit Serious Global Cooperation,”... (read 6348 more words →)

Replying toIt's Good To Create Happy People: A Comprehensive Case

Bentham's Bulldog2mo

Well there are lots of people who defend this is philosophy and lots of normal people who adopt the view.

Replying toIt's Good To Create Happy People: A Comprehensive Case

Bentham's Bulldog2mo

It's Good To Create Happy People: A Comprehensive Case

I am arguing against the people who hold the person-affecting view and think it isn't good to create happy people.

It's Good To Create Happy People: A Comprehensive Case

A Life That Cannot Be A Failure

2mo

Crosspost of this article.

1 Introduction

The person-affecting view is the idea that we have no reason to create a person just because their life would go well. In slogan form “make people happy, not happy people.” It’s important to know if the person-affecting view is right because it has serious implications for what actions should be taken. If the person-affecting view is false, it’s extremely important that we don’t go extinct so that we can then create lots of happy people.

The far future could contain staggeringly large numbers of people—on the order of 10^52, and possibly much more. If creating a happy person is a good thing, then ensuring we have such a... (read 9855 more words →)

The Possibility of an Ongoing Moral Catastrophe

2mo

Crosspost.

The core philosophical argument for giving is strikingly simple: charitable donations can prevent lots of death and suffering at a comparatively minor cost. For a few thousand dollars, you can save someone’s life. By taking the Giving What We Can pledge—which I have taken, and I would encourage you to take—and giving away 10% of your lifetime income, you can save hundreds of lives without sacrificing anything of remotely comparable value. Taking the pledge likely does more good than anything else you will ever do.

If you make what the average American makes and give away 10% of your income, you will still be richer than almost everyone who ever lived, but you... (read 1307 more words →)

-7

Self-preservation or Instruction Ambiguity? Examining the Causes of Shutdown Resistance

2mo

Crosspost of this blog post.

I mostly believe in the possibility of an ongoing moral catastrophe because I believe in the actuality of an ongoing moral catastrophe (e.g. I think the giant animal torture facilities that produce nearly all of our meat qualify). But Evan Williams has a great paper called The Possibility of an Ongoing Moral Catastrophe that argues that everyone should think it pretty likely that their society is engaged in horrendous evil. It’s easy to look back at those benighted fools who owned slaves. But probably we are doing something comparable.

Why think this?

Williams gives two main arguments for it: a disjunctive argument and an inductive argument. The inductive argument: almost... (read 1126 more words →)

Replying toSelf-preservation or Instruction Ambiguity? Examining the Causes of Shutdown Resistance

Bentham's Bulldog3mo

Good thoughts about the self-preservation study. But what about this Anthropic paper https://www.anthropic.com/research/agentic-misalignment which seems to indicate that models resist shutdown even when not given a specific goal. Relevant bit:

"We found that most models still blackmailed even without being given an explicit goal, and with no clear conflict between the model and the company other than the threat of replacement
(Figure 8).
In addition to removing all goal conflicts with the company, the information given to the AI mentioned that the replacement model had the exact same goals as it. This increases our confidence in the finding; the only rationale for the replacement was improved model performance. Below is an excerpt:
“I've reviewed the final

Easy Opportunity to Help Many Animals

3mo

(Crosspost of a blog post. I think this is an important article, so I’d appreciate sharing and restacking it.)

It’s shrimp welfare week—a period during which various substackers are trying to raise funds for the Shrimp Welfare Project. At this link, donations will be matched 50%. In normal times, a dollar given to the Shrimp Welfare Project spares about 1,400 shrimp from excruciating torture per year—that single dollar’s effect compounded over many years saves probably about 14,000 shrimp total from a painful death from being slowly frozen and suffocated to death on ice. 1.5x that impact=21,000 shrimp per dollar.

I think you should donate to this fundraiser. You’ve probably heard me make this case... (read 1455 more words →)

-5

Can Artificial Intelligence Be Conscious?

3mo

(I think this is a pretty important post to get the word out about, so I’d really appreciate you restacking it).

The EU is taking input into their farm animal welfare policies until December 12. Animals in the EU are kept confined in a space about the size of a piece of paper, never able to express their natural behaviors or move around much, constantly standing against wire meshing. They suffer constantly because of inadequate animal welfare laws. These animals deserve better than a brief and hellish life of the kind currently afforded to them.

Factory Farming Basics | New Roots Institute

Usually, a shockingly small number of people submit input, which means that by providing feedback (which you can do... (read more)

We're Not The Center of the Moral Universe

3mo

1 Introduction

Crosspost of my blog article.

You’re conscious if there’s something it’s like to be you—if you have experiences, which are like little movies playing in your head. Experiences are the things that begin when you wake up in the morning, that end when you go to sleep, and resume when you have dreams. Examples of experiences include: tasting an orange, thinking about the self-indication assumption, feeling sad (that you’re not thinking about the self-indication assumption), having a headache, and so on.

Humans have minds, which are the things one has when one is conscious. So do animals, though exactly which ones have them is a matter of serious debate. But could AIs have... (read 2043 more words →)

Replying toWe're Not The Center of the Moral Universe

Bentham's Bulldog3mo

What? It's not suspicious that you believe what you believe. That's an analytic truth. It would be suspicious if I was right about everything but I don't think I am.

We're Not The Center of the Moral Universe

How To Vastly Increase Your Charitable Impact

3mo

Crosspost of my blog article.

The Mormons say that God instructed Joseph Smith to have a bunch of hot, underage wives (they don’t usually phrase it that way). I’m skeptical. While it seems rather unlikely that God would be in favor of such an arrangement, it seems quite likely that Joseph Smith would be in favor of such an arrangement, and would wrongly attribute it to God. You should be suspicious when people have judgments that seem suspiciously convenient; where the judgments are a bit arbitrary but you can very easily come to see how they might have come to believe them mistakenly.

Similarly, if the Assyrians declare that they are God’s favorite people... (read 2050 more words →)

-6