Also very rough response
- I think the debate would probably benefit from better specification of what is meant by "misalignment" or "solving alignment"
-- I do not think the convincing versions of gradual disempowerment either rely on misalignment or result power concentration among humans for relatively common meaning of alignment roughly at the level "does what the developer wants and approves, resolving conflicts between their wants in a way which is not egregiously bad". If "aligned" means something at the level "implements coherent extrapolated volition of humanity" or "solves AI safety" than yes.
- Economic
-- the counter-argument seems to be roughly in the class "everyone owns index funds" and "state taxes AIs"
--- count-counter arguments are:
----- difficulty of indexing economy undergoing radical technological transiton (as explained in an excellent post by Beren we reference)
----- problems with stability of property rights: people in the US or UK often perceive them as very stable, but they depend on state enforcing them -> state becomes a more load-bearing component of the system
----- taxation: same -> state becomes a more load-bearing component of the system
----- in many cases some income can be nominally collected in the name of humans, but they may have very little say in the process or how is it used (for some intuition, consider His Majesty Revenue & Customs. HMRC is direct descendant of a chain of org collecting customs from ˜13th century; in the beginning, His Majesty had a lot of say in what these are and also could actually use the revenue; now, not really)
- Cultural. If humans remain economically empowered (in the sense of having much more money than AI), I think they will likely remain culturally empowered.
-- this takes a bit too much econ perspective on culture; cultural evolution is somewhat coupled with economy, but is an independent system with different feedback loops
-- in particular it is important to understand that while in most econ thinking preferences of consumers are exogenous, culture is largely what sets the preferences; to some extent culture is what the consumers are made of -> having overwhelming cultural production power means setting consumer preference
--- for some intuitions, consider current examples
---- right-wing US twitter discourse is often influenced by anonymous accounts run by citizens of India and Pakistan; people running these accounts often have close to zero econ power, and their main source of income is the money they get for posts
----- yet they are able to influence what eg Elon Musk thinks, despite the >10ˆ7 wealth difference
----- Even AI-AI culture, if it promotes bad outcomes for humans and humans can understand this, will be indirectly selected against as humans (who have money) prefer interacting with AI systems that have good consequences for their well-being. seems to prove too much. Again, consider Musk. He is the world's wealthiest person, yet it is the case that his mind is often inhabited by ideas that are bad for him, his well-being, and have overall bad consequences.
State
-- unclear to me: why would you expect "formal power" to keep translating to real power (For some intuitions: United Kingdom. Quite many things in the country are done in the name of His Majesty The King)
-- we assume institutional AIs will be aligned to institutions and institutional interests, not their nominal human representatives or principals
-- I think the model of the world where superagents like states or large corporations have "dozens of people controlling these entities" is really not how the world works. Often the person nominally in charge is more a servant of the entity aligned to it rather than "principal".
--- “While politicians might ostensibly make the decisions, they may increasingly look to AI systems for advice on what legislation to pass, how to actually write the legislation, and what the law even is. While humans would nominally maintain sovereignty, much of the implementation of the law might come from AI systems.” / ll seems good, if AI is well-aligned? Imo, it would be bad to not hand off control to aligned AIs that would be more competent and better motivated that us
---- I think you should be really clear who are the AIs aligned to. Either eg US governmental AIs are aligned to US government and state in general, in which case the dynamic leads to a state with no human principals with any real power, and humans will just rubber-stamp.
---- Or the governmental AIs are aligned to specific humans, such as US president. This would imply very large changes of power relative to current state, transitioning from republic to personal dictatorship. Both US state and US citizens would fight this
(may respond to some of the rough thoughts later, they explore interesting directions)
I don't think that the example of kings losing their powers really supports your thesis here. That wasn't a seamless, subtle process of power slipping away. There was a lot of bloodshed and threat of bloodshed involved.
King Charles I tried to exercise his powers as a real king and go against the Parliament, but the people rebelled and he lost his head. After that, his son managed to restore the monarchy, though he needed to agree to some more restrictions on his powers. After that, James II tried to go against the Parliament again, and got overthrown and replaced by another guy who agreed to relinquish the majority of royal powers. After that, the king still had some limited say, but he they tried to do unpopular taxes in America, the colonies rebelled, and gained independence through a violent revolution. Then next door from England, Louis XVI tried to go against the will of his Assembly, and lost his head. After these, the British Parliament started to politely ask their kings to relinquish the remainder of their powers, and they wisely agreed, so their family could keep their nominal rulership, their nice castle, and most importantly, their head.
I think the analogous situation would be AIs violently over-taking some countries, and after that, the other countries bloodlessly surrendering to their AIs. I think this is much closer to the traditional picture of AI takeover than to the picture you are painting in Gradual Disempowerment.
On the other hand, there is another interesting factor in kings losing power that might be more related to what you are talking about (though I don't think this factor is as important as the threat of revolutions discussed in the previous comment).
My understanding is that part of the story for why kings lost their power is that the majority of people were commoners, so the best writers, artists and philosophers were commoners (or at least not the highest aristocrats), and the kings and the aristocrats read their work, and these writer often argued for more power to the people. The kings and aristocrats sometimes got sincerely convinced, and agreed to relinquish some powers even when it was not absolutely necessary for preempting revolutions.
I think this is somewhat analogous to the story of cultural AI dominance in Gradual Disempowerment: all the most engaging content creators are AIs, humans consume their content, the AIs argue for giving power to AIs, and the humans get convinced.
I agree this is a real danger, but I think there might be an important difference between the case of kings and the AI future.
The court of Louis XVI read Voltaire, but I think if there was someone equally witty to Voltaire who also flattered the aristocracy, they would have plausibly liked him more. But the pool of witty people was limited, and Voltaire was far wittier than any of the few pro-aristocrat humorists, so the royal court put up with Voltaire's hostile opinions.
On the other hand, in a post-AGI future, I think it's plausible that with a small fraction of the resources you can get close to saturating human engagement. Suppose pro-human groups fund 1% of the AIs generating content, and pro-AI groups fund 99%. (For the sake of argument, let's grant the dubious assumption that the majority of economy is controlled by AIs.) I think it's still plausible that the two groups can generate approximately equally engaging content, and if humans find pro-human content more appealing, then that just wins out.
Also, I'm kind of an idealist, and I think part of the reason that Voltaire was successful is that he was just right about a lot of things, parliamentary government really leads to better outcomes than absolute monarchy from the perspective of a more-or-less shared human morality. So I have some hope (though definitely not certainty) that AI content creators competing in a free marketplace of ideas will only convince humanity to voluntarily relinquish power if relinquishing power is actually the right choice.
Kings also lost their power because the name of the game had changed significantly.
In the actual Middle Ages, kings may have nominally had complete power, but in reality they were heavily constrained by the relations they had with wealthy landowners and nobles. The institution of the Royal Court persevered precisely because it served an absolutely critical social purpose, namely a mechanism for coordination between the lords of the realm. Everybody was subject to the crown and the crown's rulings, so disputes could be resolved and hierarchies could be established (relatively) bloodlessly. Conversely, the king nominally was above the lords, but he served at their pleasure, in the sense that it he became sufficiently unpopular with them, he would be removed.[1]
As the move towards absolutism happened and kings started amassing de facto power approaching the de jure power they'd long pretended they'd had, suddenly the old justification for the king's existence evaporated.
Chinese history contains dozens of examples of emperors losing the Mandate of Heaven in the eyes of wealthy lords or powerful generals, and getting executed for it
- I think the debate would probably benefit from better specification of what is meant by "misalignment" or "solving alignment"
-- I do not think the convincing versions of gradual disempowerment either rely on misalignment or result power concentration among humans for relatively common meaning of alignment roughly at the level "does what the developer wants and approves, resolving conflicts between their wants in a way which is not egregiously bad". If "aligned" means something at the level "implements coherent extrapolated volition of humanity" or "solves AI safety" than yes.
Just checking: Would you say that the AIs in you get what you measure and another (outer) alignment failure story are substantially less aligned than "does what the developer wants and approves, resolving conflicts between their wants in a way which is not egregiously bad"?
Thanks!
Appreciate the many concrete examples you're giving here.
Responding quickly.
I do not think the convincing versions of gradual disempowerment either rely on misalignment or result power concentration among humans for relatively common meaning of alignment roughly at the level "does what the developer wants and approves, resolving conflicts between their wants in a way which is not egregiously bad". If "aligned" means something at the level "implements coherent extrapolated volition of humanity" or "solves AI safety" than yes.
Yep, that makes sense. And I disagree, so this is useful clarification.
I think that if AI "does what the developer wants and approves, resolving conflicts between their wants in a way which is not egregiously bad" then I am much less worried about GD than about power-seeking AI. (Though i have some uncertainty here if the AI is resolving these conflicts pretty badly but hiding the fact it's doing this for some reason. But if it's resolving these conflicts as well as fairly competent human would, i feel much less worried about GD than powre-seeking AI.)
----- difficulty of indexing economy undergoing radical technological transiton (as explained in an excellent post by Beren we reference)
It's more than just index funds. It's ppl getting AIs to invest on their behalf, just like VCs invest on ppl's behalf today. It seems like we need fairly egregious misalignment for this to fail, no?
problems with stability of property rights: people in the US or UK often perceive them as very stable, but they depend on state enforcing them -> state becomes a more load-bearing component of the system
Why is it more load bearing than today? Today it's completely load bearing right? If income switches from wages to capital income, why does it become more load bearing? (I agree it becomes more load bearing when taxation is needed for ppl's income - but many ppl will own capital so not need this)
having overwhelming cultural production power means setting consumer preference
Thanks, interesting point. Though humans will own/control the AIs producing culture, so they will still control this determinant of human preferences.
right-wing US twitter discourse is often influenced by anonymous accounts run by citizens of India and Pakistan; people running these accounts often have close to zero econ power, and their main source of income is the money they get for posts
Interesting. And you're thinking that the analogy is that AIs will have no money but could have a big cultural influence? Makes sense. (Though again, those AIs will be owned/controlled by humans, somewhat breaking the analogy.)
Again, consider Musk
But the ideas that are bad for Musk and his thinking have generally decreased his power + influence, no? Overall he's an exceptionally productive and competent person. If some cultural meme caused him to be constantly addicted to his phone, that wouldn't be selected for culturally.
we assume institutional AIs will be aligned to institutions and institutional interests, not their nominal human representatives or principals
So what causes the govt AIs to be aligned to the state over the heads of office, to the extent where they disempower those humans? Why don't those humans see it coming and adjust the AI's goals? Or, if the AI is aligned to the state, why doesn't it pursue the formal goals of the state like protecting it's ppl?
I think that if AI "does what the developer wants and approves, resolving conflicts between their wants in a way which is not egregiously bad" then I am much less worried about GD than about power-seeking AI.
If the AI is that well-aligned, then presumably power-seeking AI is also not much of a problem, and you shouldn't be that concerned about either?
Maybe you mean "if I assume that I don't need to be worried about GD outside of the cases where AI "does what the developer wants and approves, resolving conflicts between their wants in a way which is not egregiously bad", then I am overall much less worried about GD than about power-seeking AI"?
I'm hope it's not presumptuous to respond on Jan's behalf, but since he's on vacation:
> It's more than just index funds. It's ppl getting AIs to invest on their behalf, just like VCs invest on ppl's behalf today. It seems like we need fairly egregious misalignment for this to fail, no?
Today, in the U.S. and Canada, most people have no legal way to invest in OpenAI, Anthropic, or xAI, even if they have AI advisors. Is this due to misalignment, or just a mostly unintended outcome from consumer protection laws, and regulation disincentivizing IPOs?
> If income switches from wages to capital income, why does it become more load bearing?
Because the downside of a one-time theft is bounded if you can still make wages. If I lose my savings but can still work, I don't starve. If I'm a pensioner and I lose my pension, maybe I do starve.
> humans will own/control the AIs producing culture, so they will still control this determinant of human preferences.
Why do humans already farm clickbait? It seems like you think many humans wouldn't direct their AIs to make them money / influence by whatever means necessary. And it won't necessarily be individual humans running these AIs, it'll be humans who own shares of companies such as "Clickbait Spam-maxxing Twitter AI bot corp", competing to produce the clickbaitiest content.
Today, in the U.S. and Canada, most people have no legal way to invest in OpenAI, Anthropic, or xAI, even if they have AI advisors. Is this due to misalignment, or just a mostly unintended outcome from consumer protection laws, and regulation disincentivizing IPOs?
Sorry if this is missing your point — but why would AIs of the future have a comparative advantage relative to humans, here? I would think that humans would have a much easier time becoming accredited investors and being able to invest in AI companies. (Assuming, as Tom does, that the humans are getting AI assistance and therefore are at no competence disadvantage.)
I was responding to "ppl getting AIs to invest on their behalf, just like VCs invest on ppl's behalf today. It seems like we need fairly egregious misalignment for this to fail, no?"
I'm saying that one way that "humans live off index funds" fails, even today, is that it's illegal for almost every human to participate in many of the biggest wealth creation events. You're right that most AIs would probably also be barred from participating from most wealth creation events, but the ones that do (maybe by being hosted by, or part of, the new hot corporations) can scale / reproduce really quickly to double down on whatever advantage that they have from being in the inner circle.
You're right that most AIs would probably also be barred from participating from most wealth creation events, but the ones that do (maybe by being hosted by, or part of, the new hot corporations) can scale / reproduce really quickly to double down on whatever advantage that they have from being in the inner circle.
I still don't understand why the AIs that have access would be able to scale their influence more quickly than the AI-assisted humans who have the same access.
(Note that Tom never talked about index funds, just about humans investing their money with the help of AIs, which should allow them to stay competitive with AIs. You brought up one way in which some humans are restricted from investing their money, but IMO that constraint applies at least as strongly to AIs as to humans, so I just don't get how it gives AIs a relative competitive advantage.)
Overall, i think this considerations favours economic power concentration among the humans who are legally allowed to invest in the most promising opportunities and have AI advisors to help them
And, conversely, this would would decrease the economic influence of other humans and AIs
(I’ve only read the parts I’m responding to)
My high-level view is that the convincing versions of gradual disempowerment either rely on misalignment or result [from] power concentration among humans.
It feels like this statement should be qualified more; later it is stated that GD isn’t “similarly plausible to the risks from power-seeking AI or AI-enabled coups”, but this is holding GD to a higher bar; the relevant bar would seem to be “is plausible enough to be worth considering”.
“Rely[ing] on misalignment” is also an extremely weak condition: I claim that current systems are not aligned, and gradual disempowerment dynamics are already at play (cf AI “arms race”).
The analysis of economic disempowerment seems to take place in a vacuum, ignoring one of the main arguments we make, which is that different forms of disempowerment can mutually reinforce each other. The most concerning version of this, I think, is not just “we don't get UBI”, but rather that the memes that say “it's good to hand over as much power as quickly as possible to AI” win the day.
The analysis of cultural disempowerment goes one step “worse”, arguing that “If humans remain economically empowered (in the sense of having much more money than AI), I think they will likely remain culturally empowered.” I think we agree that a reasonable model here is one where cultural and economic are tightly coupled, but I don’t see why that means they won’t both go off the rails. You seem to think that they are almost guaranteed to feedback on each other in a way that maintains human power, but I think it can easily go the opposite way.
Regarding political disempowerment, you state: “It’s hard to see how those leading the state and the top AI companies could be disempowered, absent misalignment.” Personally, I find this quite easy. Insufficient elite coordination is one mechanism (discussed below). But reality can also just be unfriendly to you and force you to make choices about how you prioritize long-term vs. short-term objectives, leading people to accept deals like: “I'll be rich and powerful for the next hundred years, and then my AI will take over my domain and do as it pleases”. Furthermore, if more people take such deals, this creates pressure for others to do so as well, since you need to get power in the short-term in order to remain “solvent” in the long term, even if you aren’t myopic yourself. I think this is already happening; the AI arms race is burning the commons every day; I don’t expect it to stop.
Regarding elite coordination, I also looked at the list under the heading “Sceptic: Why don’t the elites realise what’s happening and coordinate to stop it?” Another important reason not mentioned is that cooperating usually produces a bargaining game where there is no clearly correct way to split the proceeds of the cooperation.
Thanks!
It feels like this statement should be qualified more... this is holding GD to a higher bar
Yeah fair. I've edited to qualify it more.
The analysis of economic disempowerment seems to take place in a vacuum, ignoring one of the main arguments we make, which is that different forms of disempowerment can mutually reinforce each other. The most concerning version of this, I think, is not just “we don't get UBI”, but rather that the memes that say “it's good to hand over as much power as quickly as possible to AI” win the day.
Yeah it's a bit tricky to know how to structure the argument when you're saying that the 3 domains all mutually reinforce each other. Like, after reading the paper it was unclear to me why the 3 domains don't mutually reinforce each other to remain good given that they start good. The order in which the paper's sections appear, and the arguments within, suggested that the main mechanism was:
And so a natural response (which i gave) is:
But yeah, i agree cultural shifts to favour hand off will happen even absent economic disempowerment. I think just driven by ordinary economic competition. And if we hand off to sufficiently misaligned AI, we're screwed. Assuming AI is aligned enough that it never seeeks power, i'm not sure how worried we should be about handoff. But plausibly we should demand a higher alignment bar than that.
Anyway, to my mind this argument is better understood as "competitive pressure to hand off to misaligned AI" than as an interplay between economic and cultural and state disempowerment, but I do buy it.
The analysis of cultural disempowerment goes one step “worse”, arguing that “If humans remain economically empowered (in the sense of having much more money than AI), I think they will likely remain culturally empowered.” I think we agree that a reasonable model here is one where cultural and economic are tightly coupled, but I don’t see why that means they won’t both go off the rails. You seem to think that they are almost guaranteed to feedback on each other in a way that maintains human power, but I think it can easily go the opposite way.
Yeah, to clarify, i don't feel it's guaranteed to maintain human power. Overall, I feel like "yeah i guess maybe that could happen, though none of the mechanisms you mention seem that convincing and there seem like counter considerations and humans will have a strong incentive to keep power if they can and (hopefully!) truthful AI advice to help them and myopically aligned AIs to implement things to help... and also we can see this playing out in real time and respond so not sure it's worth focussing on in advance (though i agree it will be worth focussing on while it's happening)".
Do you think that, absent AI power-seeking, this dynamic is highly likely to lead to human disempowerment? (If so, then i disagree.)
Regarding political disempowerment, you state: “It’s hard to see how those leading the state and the top AI companies could be disempowered, absent misalignment.” Personally, I find this quite easy. Insufficient elite coordination is one mechanism (discussed below). But reality can also just be unfriendly to you and force you to make choices about how you prioritize long-term vs. short-term objectives, leading people to accept deals like: “I'll be rich and powerful for the next hundred years, and then my AI will take over my domain and do as it pleases”. Furthermore, if more people take such deals, this creates pressure for others to do so as well, since you need to get power in the short-term in order to remain “solvent” in the long term, even if you aren’t myopic yourself. I think this is already happening; the AI arms race is burning the commons every day; I don’t expect it to stop.
I said "absent misalignemnt", and I think your story involves misalignment? Otherwise the human could hand off to AI that represents their interests. Clearly there's a problem with handoff if AI seeks power. And i agree it seems bad if AI doesn't seek power that but also won't represent human interests as it governs. Though i feel a bit confused about how humans are never able to coordinate to reign it all back in if AIs aren't seeking power.
Thanks!
> Do you think that, absent AI power-seeking, this dynamic is highly likely to lead to human disempowerment? (If so, then i disagree.)
As a sort-of answer, I would just say that I am concerned that people might knowingly and deliberately build power-seeking AIs and hand over power to them, even if we have the means to build AIs that are not power-seeking.
> I said "absent misalignemnt", and I think your story involves misalignment?
It does not. The point of my story is: "reality can also just be unfriendly to you". There are trade-offs, and so people optimize for selfish, short-term objectives. You could argue people already do that, but cranking up the optimization power without fixing that seems likely to be bad.
My true objection is more that I think we will see extreme safety/performance trade-offs due to technical inadequacies -- ie (roughly) the alignment tax is large (although I don't like that framing). In that case, you have misalignment despite also having a solution to alignment: competitive pressures prevent people from adopting the solution.
I see the gradual disempowerment story as a simple outside view flavoured reason why things could go badly for many people. I think it’s outside view flavoured because it’s a somewhat direct answer to “well things seems to have been getting better for people so far”. While, as you point out, misalignment seems to make the prospects much worse, it’s worth bearing in mind also that economic irrelevance of people also strongly supports the case for bad outcomes from misalignment. If people remained economically indispensable, even fairly serious misalignment could have non catastrophic outcomes.
Someone I was explaining it to described it as “indefinite pessimism”.
> If people remained economically indispensable, even fairly serious misalignment could have non catastrophic outcomes.
Good point. Relatedly, even the most terribly misaligned governments mostly haven't starved or killed a large fraction of their citizens. In this sense, we already survive misaligned superintelligence on a regular basis. But only when, as you say, people remain economically indispensable.
> Someone I was explaining it to described it as “indefinite pessimism”.
I think this is a fair criticism, in the sense that it's not clear what could make us happy about the long-term future even in principle. But to me, this is just what being long-term agentic looks like! I don't understand why so many otherwise-agentic people I know seem content to YOLO it post-AGI, or seem to be reassured that "the AGI will figure it out for us".
I didn’t mean it as a criticism, more as the way I understand it. Misalignment is a “definite” reason for pessimism - and therefore somewhat doubtful about whether it will actually play out. Gradual disempowerment is less definite about what actual form problems may take, but also a more robust reason to think there is a risk.
Oh, makes sense. Kind of like Yudkowsky's arguments about how you don't know how a chess master will beat you, just that they will. We also can't predict exactly how a civilization will disempower its least productive and sophisticated members. But a fool and his money are soon parted, except under controlled circumstances.
There’s still a real puzzle about why Xi/Trump/CEOs can’t coordinate here after they realise what’s happening.
- Maybe it’s unclear even to superintelligent AIs where this will lead, but it in fact leads to disempowerment. Or maybe the AIs aren’t aligned enough to tell us it’s bad for us.
I agree that having truthful, aligned AGI advisors might be sufficient to avoid coordination failures. But then again, why do current political leaders regularly appoint or listen to bad advisors? Steve Byrnes had a great list of examples of this pattern, which he calls "conservation of wisdom"
Thanks for linking to that comment - great stuff.
I think this might be a case where 'absolute disempowerment' and 'everyone dying' comes apart from 'relative disempowerment' and 'we get a much worse future than we could have done'. Seems more plausible AI advisors don't sufficiently forewarn about the latter
why not deploy aligned AI that makes as much money as possible and then uses it for your interests? maybe the successionism means ppl choose not to? (Seems weak!)
For the non-rich, one way or another, they'll quickly end up back in Malthusian competition with beings that are more productive, and have much more reproductive flexibility than them.
For the oligarchs / states, as long as human reproduction remained slow, they could easily use a small amount of their fortunes to keep humanity alive. But there are so many possible forms of successionism, that I expect at least one of them to be more appealing to a given oligarch / government than letting humans-as-they-are continue to consume substantial physical resources. E.g.:
Thanks, these are interesting ideas though i'm not quite seeing the connection to my point.
I was imagining some fraction of ppl want to use all their current wealth to bring about a good long-run future. Strategy: they invest it on the market and eventually spend the proceeds on whatever is best. They don't reproduce (as that would cost money, which they want to invest).
Maybe they upload themselves to live indefinitely if that's needed to know how to best spend the money, but it will be cheap. Tbh, even staying alive indefinitely in biological form will likely be very cheap in absolute terms for someone with decent saving today.
Malthusian dynamics don't seem to block this strategy
I agree Malthusian dynamics are important to think about post-AGI, and like your examples of ways that you could technically 'keep humans alive' more cheaply. Though i'm not sure if you fully appreciate how cheap it would be to keep all current humans around and limit reproduction to 8 babies per person. (Longer term you'll need stricter limits.)
I buy you could get radical cultural changes. [...] But stuff as big as in this story feels unlikely. Often culture changes radically bc the older generation dies off, but that won’t happen here.
Good point, but imo old peoples' influence mostly wanes well before they die, as they become unemployed, out-of-touch, and isolated from the levers of cultural production and power. Which is what we're saying will happen to almost all humans, too.
Another way that culture changes radically is through mass immigration, which will also effectively happen as people spend more time interacting with effectively more-numerous AIs.
- Even if AIs do earn wages, those wages may be driven down to subsistence levels via Malthusian dynamics (you can quickly make more compute) so that human income from capital assets dominates AI income.
Why does it matter whether AIs' wages are subsistence-level? This seems to prove too much, e.g. "monkeys won't be threatened by human domination of the economy, since the humans will just reproduce until they're at subsistence level."
- Even if AIs earn significant non-subsistence wages, humans can easily tax that income at >50% and give it to humans.
Maybe - but taxing machine income seems to me to be similarly difficult to taxing corporate income. As a machine, you have many more options to form a legal super-organization and blur the lines between consumption, trade, employment, and capex.
Why does it matter whether AIs' wages are subsistence-level? This seems to prove too much, e.g. "monkeys won't be threatened by human domination of the economy, since the humans will just reproduce until they're at subsistence level."
I think it matters bc AIs won't be able to save any money. They'll spend all their wages renting compute to run themselves on. So it blocks problems that stem from AI having more disposal income and therefore weighing heavily on economic demand signals.
But perhaps you're worried about other problems. E.g. the economy might still be geared towards creating and running AI chips, even if AI lacks disposal income. But the hope would be that, if humans control all disposal income, then the economy is only geared towards chips to the extent that's instrumentally helpful for humans.
Re monkeys, a disanalogy is that none of the human economy is geared towards providing monkeys with goods and services. Whereas that will be the case with AI.
form a legal super-organization and blur the lines between consumption, trade, employment, and capex
Interesting idea that AI systems could blur the lines between their own consumption and "spending that's instrumentally useful for producing goods and services". That does seem like a distinctive and interesting way to tip the economy towards AI preferences, and one that would dodge tax. (Though it's a problem even if AIs aren't paid wages. To the extent that AIs are paid wages and are thereby economically enmpowered, taxation still seems like a good solution to me.)
I think it matters bc AIs won't be able to save any money. They'll spend all their wages renting compute to run themselves on. So it blocks problems that stem from AI having more disposal income and therefore weighing heavily on economic demand signals.
This doesn't make sense to me, and sounds like it proves too much - something like "Corporations can never grow because they'll spend all their revenue on expenses, which will be equal to revenue due to competition". Sometimes AIs (or corporations) will earn more than their running costs, and invest those in growth, and end up with durable advantages due to things such as returns to scale or network effects.
In the absolute Malthusian limit AI won't earn more than its running costs and can't save. So if we expect AI wages to approach that limit, that seems like a strong reason not to expect that humans will keep more money than AI
But yeah, wages probably won't be completely up against that limit in practice, as for humans throughout history. But i think it might be pretty close
Hmmm, maybe we got mixed somewhere along the way, because I was also trying to argue that humans won't keep more money than AI in the Malthusian limit!
Epistemic status: very rough! Spent a couple of days reading the Gradual Disempowerment paper and thinking about my view on it. Won’t spend longer on this, so am sharing rough notes as is
I think most people who have read this far shouldn’t read on! The rest of this post is rough “thinking out loud” notes.
I discuss:
Ok, stepping back, am i convinced by my own steelman?
Could all this go through if AI alignment is solved? Let me try to argue for that conclusion.
Stepping back, is this convincing?