Individual AI representatives don't solve Gradual Disempowerement

[-]Nathan Helm-Burger5mo50

I think this is correct, but for what it's worth, I did take this into account in my proposed AI representative system. My system wasn't a free-for-all, it was government-run with an equally powerful value-customizable AI assigned to each citizen. Then these representatives would debate in a controlled setting, like a giant parliament with established discussion rules. Like a legislature the size of the populace.

In the free-for-all setting, then you not only have problems with States and corporations, but also inequality between people. Wealthy people can afford smarter, faster, more numerous AIs. Less cautious people can let their AIs operate with less hindrance from human oversight. So the "AI teams" that pull ahead will be those who started out powerful and acted recklessly (and probably also immorally).

So the only way this situation is ok is if the government bans all of that free-for-all AI. Or, if a value-aligned AI Singleton or coalition bans it. Naive personally-aligned AIs are a bad equilibrium.

[-]Noosphere895mo40

While I agree that the idea of AI representatives don't immediately solve the problem absent other things, I do think you underestimate the power of AI representatives in solving the issues of Gradual Disempowerment, and there are a couple of reasons for this:

Lots of the dynamic of gradual disempowerment comes down to the fact that you can't just leave an economy or society that disempowers you, and due to stuff like fusion, nanotech, biotech and more technologies could allow humans to survive alone in space colonies without suffering the big logistical penalties for leaving society.

RussellThor talks more about this below:

https://www.lesswrong.com/posts/pZhEQieM9otKXhxmd/?commentId=CramJssYNDmTMDr6Z

Assuming that a supermajority/every AI produced by companies/states terminally value humans surviving and thriving, then people being disempowered could work out fine, similar to how pets are treated relatively well by humans, despite pets generally being totally dependent on humans to live well (with caveats).

Indeed, the human-pet relationship is a good example of what I think good futures/relationships between AIs and humans look like by default, assuming the alignment problem is solved and we don't die and get very rich.

That isn't likely to happen, but if it did happen, would defuse a lot of the issue of disempowerment leading to starvation/death.

Also, the individual AI representatives can coordinate (under assumptions of shared values) much better than human negotiators/coordinators do, so companies don't have all the coordination power.

[-]Cerine_Way5mo30

Re: Humans are Not Alone.

Firstly,

To further your argument, Capital in the 21st Century makes the point that it becomes more efficient to generate capital the more capital you have (because many strategies to make more capital benefit from having more starting capital).

In that case, even if we had both
1) Only AI advisors for individual *humans*, and not any larger social entities,
2) All AI advisors are (initially) rate-limited to the same level of intelligence,

We might imagine that the AI advisors of the richest individuals could enact capital-intensive strategies that disproportionately increase their influence over the future, effectively disempowering >99% of humans simply for being poor.

Secondly,

I claim the clearest reason that AI advisors fail is simply that before the transition to a stable AI-driven civilization finishes, it seems almost certain at least one smart-human level AI will escape into the wild and start autonomously evolving, and this is likely to lead to disempowerment. Does this seem right to you? What am I missing?

^{^}

We also recommend this in the Mitigation section of the paper: Developing AI delegates who can advocate for people's interest with high fidelity, while also being better to keep up with the competitive dynamics that are causing the human replacement. Recently Ryan Greenblatt recommends this as one of the best approaches.

^{^}

There are also multiple practical obstacles, not discussed here,

^{^}

This is one of the reasons why I believe differentially advancing capabilities in direction of coordination is important.

LESSWRONG
LW

LESSWRONG
LW

60

Individual AI representatives don't solve Gradual Disempowerement

60

Ω 31

60

Ω 31

Humans are Not Alone

The Substrate Shift

Coordination and Collective Action

The Role of the Leviathan