The many paths to permanent disempowerment even with shutdownable AIs (MATS project summary for feedback)

[-]Davidmanheim2mo128

I think a critical issue is that disempowerment implies loss of control we currently have - but this is poorly defined, and left unfortunately implicit.

If we concretize the idea of control, the extreme version is that if humanity unanimously chooses some action, it will occur. This is a bit overstated, but the obvious weak version is already untrue; if a majority of citizens in a country want some action to occur, say, a specific company to turn off a datacenter and stop running a given AI model, in a liberal democracy that majority cannot reliably ensure that it does happen, since there are protections and processes in place. In fact, the intermediate version is probably untrue as well - even a supermajority cannot reliably dictate this type of action, and certainly cannot decide it quickly.

Based on this, I think critics of the gradual disempowerment argument would make a reasonable point; this isn't a new thing, and it's not even obviously being accelerated by AI more than to the extent that it happens via wealth or power concentration. Companies already ignore laws, power is already concentrated in few hands, and to date, this fact has little to do with AI.

[-]David Scott Krueger (formerly: capybaralet)1mo40

and to date, this fact has little to do with AI.

This seems incorrect over the last couple of years. But also incorrect historically if you broaden from AI to "information processing and person modelling technologies that help turn money into influence".

But more generally, GD can be viewed as a continuation of historical trends or not. I think I'm more in the "continuation" camp vs. e.g. Duvenaud, who would stress that things change once humans become redundant.

[-]Davidmanheim1mo20

I'm guessing we don't actually strongly disagree here, but I think that unless you're broadening / shortening "information processing and person modelling technologies" to "technologies", it's only been a trend for a couple decades at most - and even with that broadening, it's only been true under some very narrow circumstances in the west recently.

[-]David Scott Krueger (formerly: capybaralet)1mo20

Yeah I roughly agree.

EtA: I might say algorithmic trading and marketting (which are older) are alread doing this, e.g., but it's a bit subjective and uncertain.

[-]Vladimir_Nesov2mo20

I define permanent disempowerment as a state of affairs where humanity loses the ability to meaningfully exert any influence over the state and direction of civilisation.

Direction of which civilization? For example personal autonomy is about being in control of your own affairs, rather than in control of the whole world, and similarly with state sovereignty. So lack of permanent disempowerment could be about the state of the civilization of originally-humans (this smaller civilization being in control of itself, and having a lot of resources), as opposed to being in control of the broader civilization that also includes all the AIs, such that many of these AIs are not best described as an intended part of humanity's future.

[-]sfv2mo10

4.2: Human veto is uncompetitive

This particular scenario is looking a bit too probable. Assuming humanity aligned AI, given sufficient variance in their alignments and a multipolar enough setting, resisting such disempowerment pressures seems quite tricky. A better case scenario I could imagine is that once one AI wins, it gives some decision making power back to humans. I think that It would be useful to determine the equilibrium boundary of number of agents and alignment variance that lies between stable human influence and runaway disempowerment.

LESSWRONG
LW

LESSWRONG
LW

55

The many paths to permanent disempowerment even with shutdownable AIs (MATS project summary for feedback)

55

55

1: Introduction

2: Principal Driven Pathways

2.1: Power concentrated in specific principals

2.2: Principals "Voluntarily" Hand Over Power to the AIs

3: Alignment Driven Pathways

3.1: Misaligned, powerseeking AIs take over

3.2: Legal Lock In

3.3: AI-driven culture causes value drift

4: System-Driven Pathways

4.1: The cost of shutdown is too high

4.2: Human veto is uncompetitive

4.3: Coordination Ability is never good enough

5: The interaction between pathways