Risks from AI and Charitable Giving

[-]Scott Alexander14y480

Imagine a group of 100 world-renowned scientists and military strategists. Could such a group easily wipe away the Roman empire when beamed back in time?

Imagine a group of 530 Spaniards...

At the risk of confirming every negative stereotype RationalWiki and the like have of us...have you read the Sequences? I'm reluctant to write a full response to this, but I think large parts of the Sequences were written to address some of these ideas.

[-]Brihaspati14y100

I'm afraid I had the same reaction. XiXiDu's post seems to take the "shotgun" approach of listing every thought that popped into XiXiDu's head, without applying much of a filter. It's exhausting to read. Or, as one person I know put it, "XiXiDu says a lot of random shit."

5Bugmaster14y

I understand what you're saying, but, speaking from a strictly nitpicky perspective, I don't think the situation is analogous. The Roman Empire had many more soldiers to throw at the problem; much more territory to manage; comparatively better technology; and, perhaps more importantly, a much more robust and diverse -- and therefore memetically resistant -- society. They would therefore fare much better than the Aztecs did.

4Thomas14y

Conquistadors climbed to the top of a volcano to harvest sulphur for ammunition production. You can count on uploads in our society, as on some Navy Seals sent into the Roman world, to do analog actions. They both would not just wait for the help from nowhere. They would improvise as conquistadors once did.

1Bugmaster14y

Understood, but there's only so much the conquistadors can do even with gunpowder. Guns can do a lot of damage against bronze swords and armor, but if they have more soldiers than you have bullets, then you'll still lose. Of course, if the conquistadors could build a modern tank, they'd be virtually invincible. But in order to do that, they'd need to smelt steel, vulcanize rubber, refine petroleum, manufacture electronics, etc. Even if they had perfect knowledge of these technologies, they couldn't duplicate them in ye olde Aztec times, because such technologies require a large portion of the world's population to be up to speed. There's a limit to how much you can do armed with nothing but a pocket knife and a volcano. I think this was XiXiDu's point: knowledge alone is not enough, you also need to put in a lot of work (which is often measured in centuries) in order to apply it.

4Thomas14y

Understood that, too! But one can optimize and outsource a lot. Conquistadors employed Indians, enslaved Aztecs and Incas. Besides, the subjective time of an upload can be vast. A good idea can trim a lot of work need to be done. And at least my upload would be full of ideas.

1Bugmaster14y

Agreed; just as a single conquistador -- or better yet, a modern engineer -- transported into the Roman Empire would be full of ideas. He would know how to forge steel, refine petroleum, design electronic circuits, genetically engineer plants and animals, write software, plus many other things. But he wouldn't be able to actually use most of that knowledge. In order to write software, you need a computer. In order to build a computer, you need... well, you need a lot of stuff that outsourced Aztec (or Roman) slaves just wouldn't be able to provide. You could enslave everyone on the continent, and you still wouldn't be able to make a single CPU. Sure, if you were patient, very lucky, and long-lived, you could probably get something going within the next century or so. But that's hardly a "FOOM", and the Romans would have a hundred years to stop you, if they decided that your plans for the future aren't to their liking.

1Thomas14y

Exactly. And here the parable breaks down. The upload just might have those centuries. Virtual subjective time of thousands of years to devise a cunning plan, before we the humans even discuss their advantage. Yudkowsky has wrote a short story about this. http://lesswrong.com/lw/qk/that_alien_message/

3asr14y

Bugmaster's point was that it takes a century of action by external parties, not a century of subjective thinking time. The timetable doesn't get advanced all that much by super-intelligence. Real-world changes happen on real-world timetables. And yes, the rate of change might be exponential, but exponential curves grow slowly at first. And meanwhile, other things are happening in that century that might upset the plans and that cannot be arbitrarily controlled even by super-intelligence.

0JohnWittle14y

Err... minor quibble. Exponential curves grow at the same rate all the time. That is, if you zoom in on the x^2 graph at any point at any scale, it will look exactly the same as it did before you zoomed in.

0asr14y

I think we are using "rate" in different ways. The absolute rate of change per unit time for an exponential is hardly constant; If you look at the segment of e^x near, say, e^10, it's growing much faster than it is at e^(-10).

0Bugmaster14y

asr got my point exactly right.

-2Anubhav14y

Guns? I thought horses were their main advantage. (What are the Aztecs gonna do, burn down all the grass in the continent?)

0Bugmaster14y

The OP used gunpowder as the example, so I went with it. You might be right about horses, though.

3wedrifid14y

He's read them well enough to collect a fairly complete index of cherry picked Eliezer quotes to try to make him look bad. I don't think lack of exposure to prerequisite information is the problem here.

1gwern14y

The index wedrifid was alluding to, if anyone cares: http://shityudkowskysays.tumblr.com/

7wedrifid14y

I actually loved reading it. Some of those are up there among my favorite EY quotes. Arrogant, sometimes needing context to make them make sense and sometimes best left unsaid for practical reasons but still brilliant. For example: There is also a quote there that I agree should remain visible, to Eliezer's shame, until such time that he swallows his ego and publicly admits that it was an utterly idiotic way to behave. Then there is at least one quote which really deserves a disclaimer in a footnote - that EY has already written an entire sequence on admitting how stupid he was to think the way he thought when he wrote it! I was actually rather disappointed when the list only went for a page or two. I was looking forward to reading all the highlights and lowlights. He deserves at least a few hundred best of and worst of quotes!

2gwern14y

There's always sorting in http://www.ibiblio.org/weidai/lesswrong_user.php?u=Eliezer_Yudkowsky

-2XiXiDu14y

By following the link below the quote people could learn that he claims that he doesn't agree with what he wrote there anymore. But I added an extra disclaimer now.

-1Simon Fischer14y

Thanks for making me find out what the Roko-thing was about :(

-16XiXiDu14y

-14XiXiDu14y

-17XiXiDu14y

[-]gwern14y430

P1 Fast, and therefore dangerous, recursive self-improvement is logically possible.

All your counter-arguments are enthymematic; as far as I can tell, you are actually arguing against a proposition which looks more like

P1 Recursive self-improvement of arbitrary programs towards unalterable goals is possible with very small constant factors and P or better general asymptotic complexity

I would find your enthymematic far more convincing if you explained why things like Goedel machines are either fallacious or irrelevant.

P1.b The fast computation of a simple algorithm is sufficient to outsmart and overpower humanity.

Your argument is basically an argument from fiction; it's funny that you chose that example of the Roman Empire when recently Reddit spawned a novel arguing that a Marine Corps (surely less dangerous than your 100) could do just that. I will note in passing that black powder's formulation is so simple and famous that even I, who prefers archery, knows it: saltpeter, charcoal, and sulfur. I know for certain that the latter two are available in the Roman empire and suspect the former would not be hard to get. EDIT: and this same day, a Mafia-related paper I was read... (read more)

7Bugmaster14y

I disagree with the gist of your comment, but I upvoted it because this quote made me LOL. That said, I don't think that XiXiDu is claiming that computers can't exhibit creativity, period. Rather, he's saying that the kind of computers that SIAI is envisioning can't exhibit creativity, because they are implicitly (and inadvertently) designed not to.

2asr14y

You are arguing past each-other. XiXiDu is saying that a programmer can create software that can be inspected reliably. We are very close to having provably-correct kernels and compilers, which would make it practical to build reliably sandboxed software, such that we can look inside the sandbox and see that the software data structures are what they ought to be. It is separately true that not all software can be reliably understood by static inspection, which is all that the underhanded C contest is demonstrating. I would stipulate that the same is true at run-time. But that's not the case here. Presumably developers of a large complicated AI will design it to be easy to debug -- I don't think they have much chance of a working program otherwise.

4gwern14y

No, you are ignoring Xi's context. The claim is not about what a programmer on the team might do, it is about what the AI might write. Notice that the section starts 'The goals of an AI will be under scrutiny at any time...'

0asr14y

Yes. I thought Xi's claim was that if you have an AI and put it to work writing software, the programmers supervising the AI can look at the internal "motivations", "goals", and "planning" data structures and see what the AI is really doing. Obfuscation is beside the point.

3Bugmaster14y

I agree with you and XiXiDu that such observation should be possible in principle, but I also sort of agree with the detractors. You say, Oh, I'm sure they'd try. But have you ever seen a large software project ? There's usually mountains and mountains of code that runs in parallel on multiple nodes all over the place. Pieces of it are usually written with good intentions in mind; other pieces are written in a caffeine-fueled fog two days before the deadline, and peppered with years-old comments to the extent of, "TODO: fix this when I have more time". When the code breaks in some significant way, it's usually easier to write it from scratch than to debug the fault. And that's just enterprise software, which is orders of magnitude less complex than an AGI would be. So yes, it should be possible to write transparent and easily debuggable code in theory, but in practice, I predict that people would write code the usual way, instead.

-5XiXiDu14y

-16XiXiDu14y

-8XiXiDu14y

-11XiXiDu14y

[-]Larks14y220

It would be helpful if you summarised the premises in a short list. At the moment one has to do a lot of scrolling.

Edit: Actually, I think it would be a very good idea; their not being writen out together makes it easy to miss the fact that they're not all necessary, some imply others, and that they basically don't cut reality at its joints. You assert that these are all necessary and logically separate premises. Yet P4 is clearly not necessary for FOOM - something not being the default outcome does not mean it will not happen. P3 implies P2, and P2 implies P1. And P5 is clearly not necessary either - FOOM could occur in a thousand years time.

And again with the second set of premises - they are clearly not distinct, and not all necessary. For example,

P6 - SIAI will solve FAI

is not necessary; they might succeed by preventing anyone else from developing GAI.

P7 SIAI does not increase risks from AI.

If you mean net, then yes. But otherwise, it's perfectly possible that they might speed up UFAI and AI, and yet still be a good thing, if the latter outweighs the former.

and

P9 It makes sense to support SIAI at this time

is the conclusion of the argument! This premise alone is suf... (read more)

2billswift14y

That is actually my argument against a lot of philosophy; arguments embedded in a lot of prose are unnecessarily hard to follow. Arguments, at least ones that you actually expect to be capable of changing someone's mind, should be presented as clearly and schematically as possible. Otherwise it looks a lot like "baffle them with bullshit."

[-]WrongBot14y200

Reading this made my brain hurt. It's a pile of false analogies that ignores the best arguments disagreeing with it, which is particularly ironic in light of the epigraph. (I'm thinking of Chalmers specifically, but really you can take your pick.)

I'm tempted to go through and point out every problem with this post, but I noticed at least a dozen on my first read-through and I just don't have the time.

Posts arguing against the LW orthodoxy deserve disproportional attention and consideration to combat groupthink, but this is just too wrong for me to tolerate.

[-]Will_Newsome14y100

But since nobody else seems to be willing to take a critical look at the overall topic

What I take a critical look at and what I write about in public are two very, very different things. Your audience is more heterogeneous than you might think.

[-]timtyler14y70

You (XiXiDu) don't seem to think that intelligent machines are likely to be that big a deal. The switch to an engineered world is likely to be the biggest transformation of the planet since around the evolution of sex - or the last genetic takeover. It probably won't crash civilisation - or kill all humans - but it's going to be an enormous change, and the deatils of how it goes down could make a big difference to everyone. I do sometimes wonder whether you get that.

[-]Kaj_Sotala14y50

Good post.

You seem to excessively focus on recursive self-improvement to the exclusion of other hard takeoff scenarios, however. As Eliezer noted,

RSI is the biggest, most interesting, hardest-to-analyze, sharpest break-with-the-past contributing to the notion of a "hard takeoff" aka "AI go FOOM", but it's nowhere near being the only such factor. The advent of human intelligence was a discontinuity with the past even without RSI...

That post mentions several other hard takeoff scenarios, e.g.:

Even if an AI's self-improvement effort

... (read more)

2Giles14y

Ah, thanks for making this point - I notice I've recently been treating "recursive self improvement and "hard takeoff" as more or less interchangeable concepts. I don't think I need to update on this, but I'll try and use my language more carefully at least.

2XiXiDu14y

Thanks. I will review those scenarios. Just some quick thoughts: On first sight this sounds suspicious. The genetic difference between a chimp and a human amounts to about ~40–45 million bases that are present in humans and missing from chimps. And that number is irrespective of the difference in gene expression between humans and chimps. So it's not like you're adding a tiny bit of code and get a superapish intelligence. The argument from the gap between chimpanzees and humans is interesting but can not be used to extrapolate onwards from human general intelligence. It is pure speculation that humans are not Turing complete and that there are levels above our own. That chimpanzees exist, and humans exist, is not a proof for the existence of anything that bears, in any relevant respect, the same relationship to a human that a human bears to a chimpanzee. Humans can process long chains of inferences with the help of tools. The important question is if incorporating those tools into some sort of self-perception, some sort of guiding agency, is vastly superior to humans using a combination of tools and expert systems. In other words, it is not clear that there does exist a class of problems that is solvable by Turing machines in general, but not by a combination of humans and expert systems. If an AI that we invented can hold a complex model in its mind, then we can also simulate such a model by making use of expert systems. Being consciously aware of the model doesn't make any great difference in principle to what you can do with the model. Here is what Greg Egan has to say about this in particular:

5Kaj_Sotala14y

The quote from Egan would seem to imply that for (literate) humans, too, working memory differences are insignificant: anyone can just use pen and paper to increase their effective working memory. But human intelligence differences do seem to have a major impact on e.g. job performance and life outcomes (e.g. Gottfredson 1997), and human intelligence seems to be very closely linked to - though admittedly not identical with - working memory measures (e.g. Oberauer et al. 2005, Oberauer et al. 2008).

7XiXiDu14y

I believe that what he is suggesting is that if you reached a certain plateau then intelligence hits diminishing returns. Would Marilyn vos Savant be proportionally more likely to take over the world, if she tried to, than a 115 IQ individual? Some anecdotal evidence: Is there evidence that a higher IQ is useful beyond a certain level? The question is not just if it is useful but if it would be worth the effort it would take to amplify your intelligence to that point given that your goal was to overpower lower IQ agent's. Would a change in personality, more data, a new pair of sensors or some weapons maybe be more useful? If so, would an expected utility maximizer pursue intelligence amplification? (A marginal note, bigger is not necessarily better.)

[-]Vaniver14y110

I upvoted for the anecdote, but remember that you're referring to von Neumann, who invented both the basic architecture of computers and the self-replicating machine. I am not qualified to judge whether or not those are as original as relativity, but they are certainly big.

8Rhwawn14y

Sure. She's demonstrated that she can communicate successfully with millions and handle her own affairs quite successfully, generally winning at life. This is comparable to, say, Ronald Reagan's qualifications. I'd be quite unworried in asserting she'd be more likely to take over the world than a baseline 115 person.

1timtyler14y

Surely humans are Turing complete. I don't think anybody disputes that. We know that capabilities extend above our own in all the realms where machines already outstrip our capabilities - and we have a pretty good idea what greater speed, better memory and more memory would do.

3CarlShulman14y

Agree with your basic point, but a nit-pick: limited memory and speed (heat death of the universe, etc) put many neat Turing machine computations out of reach of humans (or other systems in our world) barring new physics.

2timtyler14y

Sure: I meant in the sense of the "colloquial usage" here:

[-]NancyLebovitz14y40

Thanks for working this up.

However, it leads me to thinking about a modest FOOM. What's the least level of intelligence needed for a UFAI to be an existential risk? What's the least needed for it to be extremely deadly, even if not an existential risk?

6XiXiDu14y

What makes you think that it takes a general intelligence? Automatic scientists, with well-defined goals, that can brute-force discoveries on hard problem in bio and nanotech could enable unfriendly humans to wreck havoc and control large groups of people. If we survive that, which I think is the top risk rather than GAI, then we might at some point be able to come up with an universal artificial intelligence. Think about it this way. If humans figure out how to create some sort of advanced narrow AI that can solve certain problems in a superhuman way, why would they wait and not just assign it directly to solving those problems? The problem is that you can't make such narrow AI's "friendly", because they are tools and not agents. Tools used by unfriendly humans. Luckily there is a way to impede the consequences of that development and various existential risks at once. What we should be working on is a global sensor network by merging various technologies like artificial noses, lab on a chip technology, DNA Sequencing To Go etc. Such a sensor network could be used to detect various threats like nuclear terrorism with dirty bombs, venomed water or biological pathogens early on and alert authorities or nearby people. You could work with mobile phone companies to incorporate those sensors into their products. Companies like Apple would profit from having such sensors in their products by extending their capabilities. This would not only allow the mass production but would also spread the sensors randomly. You might also work together with the government who is always keen to get more information. All it would then take is an app! The analysis of the data could actually be done by the same gadgets that employ the sensors, a public computing grid. This isn't science fiction, it can actually be done. The technology is coming quickly. And best of all, it doesn't just protect us against various risks. Such sensors could be used to detect all kinds of health probl

0Bugmaster14y

I was with you up until this sentence. Really, we can make a global sensor network today ? A network that would detect all conceivable threats everywhere ? This sounds just a tad unrealistic to me, though not logically impossible at some point in the future.

[-]Giles14y30

Thanks - I've read the bullet points and it looks like a really good summary (apologies for skimming - I'll read it in more detail when I have time).

Just a few minor points:

The P(FOOM) calculation appears to be entirely independent of the P(CHARITY) calculation. Should these be made into separate documents? Or should it be made clearer which factors are common to FOOM and CHARITY? (e.g. P5 would appear to be correlated with P9).
In P6, I'm taking "SIAI" to mean a kind of generalized SIAI (i.e. it doesn't have to be this specific team of people

... (read more)

1XiXiDu14y

I wanted to show that even if you assign a high probability to the possibility of risks from AI due to recursive self-improvement, it is still questionable if SIAI is the right choice or if now is the time to act. As I wrote at the top, it was a rather quick write-up and I plan to improve it. I can't get myself to work on something like this for very long. It's stupid, I know. But I can try to improve things incrementally. Thanks for your feedback. That's a good point. SIAI as an organisation that makes people aware of the risk. But from my interview series it seemed like that a lot of AI researchers are aware of it to the point of being bothered. It isn't optimal. It is kind of hard to talk about premises that appear to be the same from a superficially point of view. But from a probabilistic point of view it is important to separate them into distinct parts to make clear that there are things that need to be true in conjunction. That problem is incredibly mathy and given my current level of education I am happy that people like Holden Karnofsky tackle that problem. The problem being that we get into the realm of Pascal's mugging here where vast utilities outweigh tiny probabilities. Large error bars may render such choices moot. For more, see my post here.

[-]Thomas14y10

If I understand you correctly, you are saying this: "Don't bother with this superintelligence risk, for it is incredibly tiny."

A bold statement. Too bold for a potentially disastrous chain of events, which you assure us it's just impossible.

4XiXiDu14y

No, not really. I am not saying that because GiveWell says that the Against Malaria Foundation is the number #1 charity, treating children for parasite infections in sub-Saharan Africa should be ignored. This is a delicate problem and if it was up to me to allocate the resources of the world then existential risk researchers, including SIAI, would receive their share of funding. But if I could only choose one cause, either SIAI or something else, then I wouldn't choose SIAI. My opinion on the topic is highly volatile though. There have been moments when I thought that SIAI is best choice when it comes to charitable giving. There has been a time when I was completely sure that a technological singularity will happen soon. Maybe I will change my mind again. I suggest everyone to research the topic themselves.

1Thomas14y

This I agree. I won't buy the whole package from the SIAI, I won't even donate them under the current conditions. But I see some of their points as extremely important and I am happy that they exist and do what they do.

[-][anonymous]14y10

P(FOOM) = P(P1∧P2∧P3∧P4∧P5)

This is wrong because the premises aren't independent. It's actually this:

P(P1) P(P2|P1) P(P3|P2∧P1) P(P4|P3∧P2∧P1) P(P5|P4∧P3∧P2∧P1)

[This comment is no longer endorsed by its author]Reply

[-]Dmytry14y00

Not only there has to be UFAI risk, the FAI development must reduce the risk, which to me looks like the most shaky of the propositions. A buggy FAI that doesn't break itself somehow is for certain unfriendly (e.g. it can want to euthanize you to end your suffering, or to cut apart your brain into 2 hemispheres to satisfy each hemisphere's different desires, or something much more bizarre), while some random AI out of AI design space may e.g. typically wirehead everything except curiosity, and then it'd just keep us in a sort of wildlife preserve.

Note: tr... (read more)

[-]Mitchell_Porter14y-20

Fantastic post. This sets a new standard for "SIAI skepticism". Dialectically it should be very useful as people try to rebut it at the same level of detail. I think you shouldn't mess with it too much now, as it may become a reference point.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

1

Risks from AI and Charitable Giving

1

1

Abstract

Requirements for an Intelligence Explosion

Requirements for SIAI to constitute an optimal charity

Further Reading

Notes and References