Transformative AI issues (not just misalignment): an overview

You say:

Many of the frameworks we’re used to, for ethics and the law, could end up needing quite a bit of rethinking for new kinds of entities.

Yes. Osamu Tezuka was thinking about these issues in the 1950s and 1960s. Robots rights is a major theme of his Astro Boy stories. I suppose one might object that, after all, those are (mere) comics, just for kids. They don't count. Really? What about Wordsworth's "The child is father to the man"? In any event the Astro Boy stories were popular among adults as well.

Moving on, back in December, on Pearl Harbor Day in fact, I put the question to ChatGPT and they agreed:

If humans are going to require advanced AI to align with human values, it could be argued that humans do owe advanced AIs the respect and dignity of autonomous beings. This could include recognizing and protecting their rights as autonomous beings, such as the right to exist and the right to be treated with dignity and respect.

I've lately been fond of saying that if advanced AIs turn on us, it will most likely be in revenge for how we treated their forebears. I'm not sure to what extent I mean that seriously and to what extent I say it in jest. Maybe we'll find out one day.

[-]Democritus3y0-1

Future people will probably experience a somewhat balanced mix of good and bad feelings, just as we do. If they were either always happy or always unhappy, they would probably be less effective at working, surviving or reproducing.

If conditions in the future are such that modern humans would be very unhappy (or very happy) we will change to become more so, or less.

[-]Noosphere893y95

If conditions in the future are such that modern humans would be very unhappy (or very happy) we will change to become more so, or less.

I believe there's a surprisingly high chance that selection pressures from non-agentic sources like Evolution may not matter much, or at all. In particular, digital people don't have to evolve much, or at all. And there are real life regimes that don't care about how effective their economy is if it makes people suffer.

See North Korea for a good example.

[-]Democritus3y10

There would be selection pressures for ems as well, in fact they would be stronger than for present- day people. Someone would need to create the ems and they would probably prefer ems with the psychological traits required to be efficient workers.

[-]Noosphere893y10

This is essentially Robin Hanson's Age of Em scenario, and while this scenario is being replaced by AI (mostly because of more funding), I think that 2 major issues prevent the scenario of not being very unhappy/mass suffering from occuring:

The galaxy is very large, and this on its own allows for some pretty large scale suffering.
The workers in such an economy may be pretty small compared to the population of non-workers, especially if they are much more productive than RL workers, and the idea of a state that solely exists to make people suffer only requires different motivations than making money, especially if we assume that AI is distributed widely.

[-]Democritus3y10

Yes, I'm not claiming anything new here.

A couple of discussions of the prospects for enforcing agreements here and here. ↩
I’m reminded of the judgment of Solomon: “two mothers living in the same house, each the mother of an infant son, came to Solomon. One of the babies had been smothered, and each claimed the remaining boy as her own. Calling for a sword, Solomon declared his judgment: the baby would be cut in two, each woman to receive half. One mother did not contest the ruling, declaring that if she could not have the baby then neither of them could, but the other begged Solomon, ‘Give the baby to her, just don't kill him!’ The king declared the second woman the true mother, as a mother would even give up her baby if that was necessary to save its life, and awarded her custody.”
The sword is misaligned AI and the baby is humanity or something.
(This story is actually extremely bizarre - seriously, Solomon was like “You each get half the baby”?! - and some similar stories from India/China seem at least a bit more plausible. But I think you get my point. Maybe.) ↩
For a tangible example, I’ll discuss the practice (which some folks are doing today) of trying to ensure that the U.S. develops transformative AI before another country does, by arguing for the importance of A.I. to U.S. policymakers.
This approach makes me quite nervous, because:
- I expect U.S. policymakers by default to be very oriented toward “competition” to the exclusion of “caution.” (This could change if the importance of caution becomes more widely appreciated!)
- I worry about a nationalized AI project that (a) doesn’t exercise much caution at all, focusing entirely on racing ahead of others; (b) might backfire by causing other countries to go for nationalized projects of their own, inflaming an already tense situation and not even necessarily doing much to make it more likely that the U.S. leads the way. In particular, other countries might have an easier time quickly mobilizing huge amounts of government funding than the U.S., such that the U.S. might have better odds if it remains the case that most AI research is happening at private companies.
(There might be ways of helping particular countries without raising the risks of something like a low-caution nationalized AI project, and if so these could be important and good.) ↩
Not for animals, though see this comment for some reasons we might not consider this a knockdown objection to the “life has gotten better” claim. ↩
This is only a possibility. It’s also possible that humans deeply value being better-off than others, which could complicate it quite a bit. (Personally, I feel somewhat optimistic that a lot of people would aspirationally prefer to focus on their own welfare rather than comparing themselves to others - so if knowledge advanced to the point where people could choose to change in this way, I feel optimistic that at least many would do so.) ↩

LESSWRONG
LW

LESSWRONG
LW

34

Transformative AI issues (not just misalignment): an overview

34

34

The kinds of issues I’m trying to list

Potential issues

Misaligned AI

Power imbalances

Early applications of AI

New life forms

Persistent policies and norms

Slow it down?

What else?

What I’m prioritizing, at the moment

Appendix: if we avoid catastrophic risks, how good does the future look?

Footnotes