AI #132 Part 2: Actively Making It Worse

[-]denkenberger2mo40

I can strongly confirm that few of the people worried about AI killing everyone, or EAs that are so worried, favor a pause in AI development at this time, or supported the pause letter or took other similar actions.
An especially small percentage (but not zero!) would favor any kind of unilateral pause, either by Anthropic or by the West, without the rest of the world.
>Holly Elmore (PauseAI): It's kinda sweet that PauseAI is so well-represented on twitter that a lot of people >think it *is* the EA position. Sadly, it isn't.
>The EAs want Anthropic to win the race. If they wanted Anthropic paused, Anthropic would kick those >ones out and keep going but it would be a blow.

I tried to get at this issue with polls on EA Forum and LW. For EAs, 26% want to stop or pause AI globally, 13% want to pause it even if only done unilaterally. I would not call this an especially small percentage.

My summary for EAs was: "13% want AGI never to be built, 26% said to pause AI now in some form, and another 21% would like to pause AI if there is a particular event/threshold. 31% want some other regulation, 5% are neutral and 5% want to accelerate AI in a safe US lab. So if I had to summarize the median respondent, it would be strong regulation for AI or pause if a particular event/threshold is met. There appears to be more evidence for the claim that EA wants AI to be paused/stopped than for the claim that EA wants AI to be accelerated."

My summary for LW was: "the strongest support is for pausing AI now if done globally, but there's also strong support for making AI progress slow, pausing if disaster, and pausing if greatly accelerated progress. There is only moderate support for shutting AI down for decades, and near zero support for pausing if high unemployment, pausing unilaterally, and banning AI agents. There is strong opposition to never building AGI. Of course there could be large selection bias (with only ~30 people voting), but it does appear that the extreme critics saying rationalists want to accelerate AI in order to live forever are incorrect, and also the other extreme critics saying rationalists don't want any AGI are incorrect. Overall, rationalists seem to prefer a global pause either now or soon."

[-]Radford Neal2mo40

"Meta is controlled purely by Zuckerberg and xAI follows the whims of Musk."

Isn't this actually a comparatively good situation? As far as I know, neither of these people wants to die, so if it comes to an existential crunch, they might make decisions that avoid dying. Compare that with amorphous control by corporate beaurocracy, in which no invididual human can manage to shift the decision...

[-]Erich_Grunewald2mo30

If you think that an AI developer can do more harm than good on the margin, e.g., because you can unilaterally push the frontier by deploying a model but you cannot unilaterally pause, and other similar asymmetries, then you may favour lower variance in the policies of AI developers. It seems likely to me that individual control increases policy variance, and so that is a reasons to favour distributed/diffused control over AI developers.

It also seems empirically that individually-controlled AI developers (Meta, xAI, DeepSeek) are worse on safety than more diffusely controlled ones (OpenAI, Anthropic, Google DeepMind), which may suggest there are selection processes that cause that generally. For example, maybe those individuals tend to be especially risk-taking, or optimistic on safety, etc.

[-]Radford Neal2mo30

I agree that individual control increases policy variance, which was sort of my point. Whether that's good or not seems to me to depend on what the default course of events is. If you think things are headed in a good direction, then low variance is good. But if the default course is likely to be disastrous, high variance at least provides a chance.

I don't understand your point about asymmetry. Doesn't that tend to make the default course bad?

[-]Erich_Grunewald2mo20

I don't understand your point about asymmetry. Doesn't that tend to make the default course bad?

What I meant was, imagine two worlds:

Individual Control, where AI developers vary wildly in their approach to risk, safety, and deployment
Diffused Control, where AI developers tend to take similar approaches to risk, safety, and deployment

If in scenario A risk-reducing actions reduce risk as much as risk-increasing actions increase risk (i.e., payoffs are symmetrical), then these two worlds have identical risk. But if in scenario B payoffs are symmetrical (i.e., these companies are more able to increase risk than they are to decrease risks), then the Diffused Control world has lower overall risk. A single reckless outlier can dominate the outcome, and reckless outliers are more likely in the Individual Control world.

Does that make the default course bad? I guess so. But if it is true, it implies that having AI developers controlled by individuals is worse than having them run by committee.

[-]StanislavKrym2mo30

We would also need to account for the possibility that an AI researcher at Meta or xAI prompts an actual leader to race harder (think of DeepCent's role in the AI-2027 forecast) or comes up with a breakthrough, initiates the explosion and ends up with Agent-4 who is misaligned and Agent-3 who doesn't catch Agent-4 because xAI's safety team doesn't have a single human competent enough. If this happens, then the company is never oversighted, races as hard as it can and dooms mankind.

However, if Agent-4 is caught, but P(OC member votes for slowdown) is smaller than 0.5 due to the evidence being inconclusive, then the more members the OC has, the bigger p(doom) is. On the other hand, this problem may be arguably solved by adopting the liberum veto on trusting any model...

So a big safety team is good for catching Agent-4, but may be bad for deciding whether it is guilty.

[-]StanislavKrym2mo40

‘Why AI Overregulation Could Kill the World’s Next Tech Revolution.’

At the time of writing the link is broken. Please correct it.

P.S. @habryka, this is another case when using automated tools is justified: they could scan posts and comments for broken links and report them to the authors.

[-]habryka2mo50

I agree! Would be good to do automatic link checking, and ideally even automatic link-backuping.

[-]denkenberger2mo30

The replies are full of people pointing out the ‘two grids’ claim is simply not true. Why is the Secretary of Energy coming out, over and over again, with this bold anti-energy stance backed by absurdly false claims and arguments?
Solar power and batteries are the future unless and until we get a big breakthrough. If we are sabotaging American wind and solar energy, either AGI shows up quickly enough to bail us out, our fusion energy projects bear fruit and hyperscale very quickly or we are going to lose. Period.

Intermittent renewable energy alone does require a grid to support it. It is possible that wind and solar can be cheaper than the variable cost of conventional power plants, but it's not yet in most places without subsidy. One could theoretically replace the current system with wind plus solar plus batteries, but it would be crazy expensive. Either you have to build the wind and solar far larger and waste most of the energy and still need batteries overnight, or you need something like days of battery storage, which is very expensive. Now you could use the excess electricity from the overbuilding scenario to make hydrogen, but hydrogen is also a long way from being economical. So the thing we could do economically at current prices is pumped hydropower for storage (geographically constrained) or underground compressed air energy storage (somewhat geographically constrained, but saline aquifers are very common and the US stores a lot of natural gas seasonally that way). These have low enough storage cost to be feasible for days worth of storage. Or we could do fission (yes, I know, public perception and regulations, but it's not clear that fusion would be much better).