New Answer

New Comment

3 Answers sorted by
top scoring

Aug 31, 2021

Boeing MCAS (https://en.wikipedia.org/wiki/Maneuvering_Characteristics_Augmentation_System) is blaimed by more than 100 deaths. How much "AI" would a similar system need to include for a similar tragedy to count as "an event precipitated by AI"?

[-]CharlesD4y10

Great point - I'm not sure if that contained aspects which are similar enough to AI to resolve such a question. This source doesn't think it counts as AI (though it doesn't provide much of an argument for this) and I can't find reference to machine learning or AI on the MCAS page, though clearly one could use AI tools to develop an automated control system like this and I don't feel well positioned to judge whether it should count.

3Anon User4y

To clarify - I do not think MCAS specifically is an AI based system, I was just thinking of a hypothetical future similar system that does include a weak AI component, but where, similarly to ACAS the issue is not so much with the flaw in AI itself, but in how it is being used in a larger system. In other words, I think your test needs to make a distinction between a situation where one needed a trustworthy AI, and the actual AI was unintentionally/unexpectedly untrustworthy vs a situation where perhaps the AI performed reasonably well, but the use of AI was problematic, causing a disaster anyway.

Zac Hatfield-Dodds

Sep 01, 2021

Such scenarios are at best smoke, not fire alarms.

When I observe that there’s no fire alarm for AGI, I’m not saying that there’s no possible equivalent of smoke appearing from under a door.

What I’m saying rather is that the smoke under the door is always going to be arguable; it is not going to be a clear and undeniable and absolute sign of fire; and so there is never going to be a fire alarm producing common knowledge that action is now due and socially acceptable. ...

There is never going to be a time before the end when you can look around nervously, and see that it is now clearly common knowledge that you can talk about AGI being imminent, and take action and exit the building in an orderly fashion, without fear of looking stupid or frightened.

[-]CharlesD4y30

The article convincingly makes the weaker claim that there's no guarantee of a fire alarm, and provides several cases which support this. I don't buy the claim (which the article also tries to make) that there is no possible fire alarm, and such a claim seems impossible to prove anyway.

Whether it's smoke or a fire alarm, that doesn't really address the specific question I'm asking, in any case.

niplav

Aug 31, 2021

AI systems find ways to completely manipulate some class of humans, e.g. by making them addicted. Arguably, this is already happening on a wider scale to a smaller amount – people becoming “addicted” to algorithmically generated feeds.

Maybe the question could be concretized to the amount of time people spend on their devices on average?

[-]CharlesD4y20

That seems like a different question which is partially entangled with AI but not necessarily, as more screen time doesn't necessarily need to be caused by AI, and the harms are harder to evaluate (even the sign of the value of "more screen time" is probably disputed).

1 comment, sorted by

top scoring

Click to highlight new comments since: Today at 11:06 PM

[-]Charlie Steiner4y60

Some high-profile failures I think we won't get are related to convergent goals, such as acquiring computing power, deceiving humans into not editing you, etc. We'll probably get examples of this sort of thing in small scale experiments, that specialists might hear about, but if an AI that's deceptive for instrumental reasons causes $1bn in damages I think it will be rather too late to learn our lesson.

Moderation Log

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

14

[ Question ]

What could small scale disasters from AI look like?

14

14

3 Answers sorted by
top scoring

Aug 31, 2021

Sep 01, 2021

Aug 31, 2021

14

[ Question ]

What could small scale disasters from AI look like?

14

14

3 Answers sorted by top scoring

Aug 31, 2021

Sep 01, 2021

Aug 31, 2021

3 Answers sorted by
top scoring