Self-improving AGI: Is a confrontational or a secretive approach favorable?

[-]Armok_GoB14y100

Upon reading this I instantly like it and find it thrilling and get an impulse to slam the button voting for secrecy with my face hidden in shadow while sprouting a dramatic one liner, because it'd look awesome in the movie. This is somewhat impeding my ability to actually consider the matter rationally.

9Jonathan_Graehl14y

The Button

1Raw_Power14y

Oh my GOD that was hilarious!

[-]Vladimir_Nesov14y70

The potential benefit is that more of the people who could be working on random-goals AGI, or on FAI, or contributing to support/funding of such projects, would consider the idea, and so AGI risk gets reduced and FAI progress gets a boost. The potential downside is what, exactly, at what marginal cost? I don't think this tradeoff works the way you suggest.

6Friendly-HI14y

I'm not sure I'm following... do you honestly think, that the cost of openly working on self-improving AGI and openly making statements along the lines of "we need to get this AI exactly right, or else we'll probably kill every man, woman and child on this planet" will be marginal in -say- 30 years, once the majority of people no longer views AGI as the product of a loony imagination but an actual possibility due to advances in robotics and narrow AI all around them? Don't you think open development of AGI would draw massive media attention once the public is surrounded and accustomed to all kinds of robots and narrow AI's? Why this optimism about how reasonable people will react to our notion of self-improving AGI, am I somehow missing something profound from my model of reality? I still expect people to be crazy, religious and irrational in 30 years and the easiest way of dealing with that would simply be to not arouse their attention. Now that most people perceive us as as hopeless sci-fi nerds (at best) and AGI still seems at least 500 years away in their mind, of course I'm all for being open and drawing in people and funding - but do you expect such an open approach to work (without interference by the public or politics) until the very completion of a godlike AGI? I severely doubt that, and I find it surprising that this is somehow perceived as a wildly marginal concern. As if it's not even worth thinking about... why is that?

0Jonathan_Graehl14y

In case it helps other readers: the upside/downside is pro/con openly promoting your AGI+FAI work and its importance (vs. working in secret).

1Vladimir_Nesov14y

No. I was talking about discussing the topic, not a specific project. The post discusses LW, and we don't have any specific project on our hands.

2Friendly-HI14y

That explains a lot, thanks for clarifying the misunderstanding. I for one wasn't specifically referring to LW, but I was wondering whether in the coming decades people involved with AGI (AI researchers and others) should be as outspoken about the dangers and capabilities of self-improving AGI, as we currently are here on LW. I think I made clear, why I wouldn't count on having the support of the global public, even if we did communicate our cause openly and in detail - so if (as I would predict) public outreach won't cut it and may even have severe adverse effects, I'd personally favor to keep a low profile.

[-]JoshuaZ14y60

Secret attempts for AGI have two problems that seems to get repeatedly ignored when this issue is discussed:

First, if there is a real danger that the first AGI is going to go foom and control everything, then having fewer people look at the code makes it more likely that something will go wrong. If there's one thing we've learned about programming in the last few fifty years, almost all code has mistakes, but the number of such mistakes goes down when more people can look at it. This is why for example in cryptography, systems which aren't completely publ... (read more)

4timtyler14y

FWIW, I discussed that a couple of months ago here.

4Friendly-HI14y

I agree with both of your mentioned drawbacks and would add a third: If someone actually discovered that it's not just a human-level AI but a self-improving AGI with a mission (or if someone from the team leaked that information) the public backlash would be absolutely fatal. And then there would also be the added problem of how to introduce the AGI in a fashion that's not essentially an unconsenting takeover of planet earth. As for my participation in AI-research you don't need to worry. I can hardly code a website and have no intention of learning it or participating directly in AI development. I'm coming from a psychological background, which is probably why I'm unusually concerned about the social repercussions of the self-improving AGI meme. ("Give him a hammer and suddenly everything looks like a nail" as the saying goes. Conversely, some of the technically inclined people here may not even realize, that there may be much more to pulling off AGI than the strictly technical aspects.) You are aware, that currently about 40% of Americans believe in full-blown creationism and another 40% that god guided the process of evolution with us in mind, right? Ten years ago almost 20% believed the sun orbits our earth and I wouldn't be terribly surprised if essentially nothing about that state of affairs has changed in the meantime... Even if such beliefs will somewhat recede in the upcoming 30 years, you should be painfully aware that under such conditions you will never ever get public consent for the development of self-improving AGI. At least not from primal unenhanced brains. So if you ever put our idea of the future up for vote, we will lose for sure. And that's just America, the rest of the planet of the apes -including my beloved Europe- won't be amused about our futuristic plans either, and even less amused if some other nation or an American corporate giant like google or IBM tried to pull off something like this in a solo attempt. So I'll just call it how I

1Strange714y

Really? That's not the impression I got from those numbers at all. To me, it sounds less like the public is adamantly resolved to stick with those entrenched ideas, and more like most people will believe all sorts of insane bullshit of you can spin a plausible-sounding explanation of how they might benefit by believing in it, and if you persist long enough. Do you really think the vote would be a one-time thing?

2Friendly-HI14y

There may be something to that perspective, but I think it is unrealistic to expect we could change enough people's minds in so short of a time-frame. There's a lot of people out there. Religions have had thousands of years to adapt themselves in such a way, that they reinforce and play into people's innate superstitions and psychological desires. In turn, religions also shaped people's culture and until very recently they played the major role in the "nurture" side of the "nature and nurture" make-up of people. Competing with religion on our own terms (rationality) simply won't work so well with the majority of people. Understanding our AGI "message" requires various quantum leaps in thinking and rationality. These insights implicitly and explicitly challenge most innate intuitions about reality and humanity that people currently hold on to. I'm not saying there won't be many people we could be able to persuade without a thorough education in these matters, but because in contrast to religion our "worldview" doesn't tell people what deep down they would like to hear and believe, we're less attractive to those who just can't be arsed into rationality. Which are a lot o people. In conclusion, I'll sum up my basic point in another light yet again: I think I'm not confronting us with a false dichotomy, when I say that there are essentially only two possibilities when it comes to introducing AGI into the lives of people: EITHER we're willing to adhere to public consent along current democratic principles. This would entail, that we massively concern ourselves with public opinion and make a commitment to not unleash AGI, unless the absolute majority of all citizens on this planet (or those who we consider to meet the criteria of valid consent) approve of our plan. OR, we take the attitude that people who do not meet a certain standard of rationality have no business in shaping humanity's future and we become comfortable with deciding over their heads/on their behalf.

-3Strange714y

What about representative democracy? Any given community sends off a few of it's cleverest individuals with a mandate to either directly argue for that community's interests, or to select another tier of representatives who will do so. Nobody feels completely excluded, but only a tiny fraction of the overall population actually needs to be educated and persuaded on the issues.

2Friendly-HI14y

How is it representative, if only the cleverest individuals are chosen? That would rather be elitism. If actually only the most rational people with herculean minds would decide, they should theoretically unanimously agree to either do it or not do it anyway, based on a sound probability-evaluation and shared premises based in reality that they all agree on. If those "representative" individuals were democratically determined by vote, then these people most certainly won't be the most intelligent and rational people, but those best at rhetorically convincing others and sucking up to them by exploiting their psychological shortcomings. They would simply be politicians like the ones we have nowadays. So in a way we're where we started. If people don't decide for themselves, they'll simply vote for someone who represents their (or provides them with a new) uninformed opinion. Whoever wins such an election will not be the most rational person, that's for sure (remember when America voted twice for an insane cowboy?) While representative democracy is certainly more practical than the alternatives, I doubt the outcome would be all that better. If we want the most rational and intelligent people to make this decision, then these individuals couldn't be chosen by vote but only by another "elitist" group. I don't know how the public would react to that - I suppose they would not be flattered.

-1Strange714y

I'm not saying it would be a better system overall, just that a relatively small group of politicians would be comparatively easier for us to educate and/or bribe.

0Friendly-HI14y

Yes, that is true. I'm still puzzled though which approach would be better... involving and educating the politicians (there are many who wouldn't understand) or trying to keep them out as long as possible to avoid confrontations and constraints? I already remarked somewhere, that I would find some kind of international effort towards AGI development very preferable, something comparable to CERN would be brilliant. Such a team could first work towards human-level AI and then one-up themselves with self-improving AGI once they gained some trust for their competence. In other words, perhaps advertising and reaching the "low-hanging fruit" of human-level AI plus reaping the amazing benefits of such a breakthrough will raise public and political trust in them, as opposed to some "suspicious" corporation or national institute that suddenly builds potential "weapons" of mass destruction.

1JoshuaZ14y

Well, I'm not at all convinced that substantially self-improving AGI can exist (that is, that will self-improve at such a rate as to quickly gain near complete control of its light cone or something like that). I assign only a small chance to the likelyhood that the first AGI will go foom. Also, if I've learned one thing from LW it is that such AI could plausibly be really bad. So I'd rather take a risk averse strategy if it at all possible.

-4Raw_Power14y

Rule number 1 of voting: it's done after a thorough debate where every single party has said everything they wanted to say. Not to generalize from fictional evidence, but "Twelve Angry Man" has shown us a pretty good caricature of the dramatic changes that can happen if you prolong the debate just a little bit longer before voting. Which is why educating the public and letting the ideas circulate is so crucial. You haven't justified this. What does believeing God guided Evolution have to do with making plans to build a self-improving artificial intelligence? More importantly, isn't it better that they know about it, and forbid it or put it under extremely intense scrutiny, rather than they not know about it, and some group developing it and obscurity and botching it?

2Friendly-HI14y

I think you're highly delusional about how malleable people's opinions really are... Are you aware of what's going on in politics and the religious sphere? As if just talking really thoroughly about AGI and appealing to rationality is going to get the majority of people from all over the world on our side. Are you serious? The point I made about creationism wasn't just that most people who believe in god probably won't want to see one being built, but that you cannot change people's opinions easily. Even completely ridiculous and unworldly ideas like creationism have hardly budged an inch in the last decade - they are rationalityproof. If you really thoroughly explained to people what this self-improving AGI was good for and how powerful it could really become... they'd totally lose it. They won't welcome "our robot overlords", regardless of how nice you make the resulting utopias sound. People fear the unknown and on a gut-level they will immediately reject our idea and rationalize in the blink of an eye why we're wrong, and crazy, and have to be stopped. I'm all for thoroughly educating people about rationality (you've read my suggestion in the other topic), but seriously getting the majority of people behind us? Sorry, but my psychological model of how people and masses behave tells me that this will never happen. At least not without brain-augmentation, and even then a global 50% +1 vote seems quite unlikely to me. Would it be better if people knew in detail about self-improving AGI and could objectively discuss this matter in order to rationally make a decision and responsibly vote on whether or not it should be developed? Hell yeah I'd love that! I'd also love to ride on a flying pig but that's not gonna happen either.

0Raw_Power14y

I'd prefer it if you used "mistaken" rather than "delusional", thank you very much. Ascribing opposing opinions to madness usually signals weakness in your own stance. Quite, see my next paragraph. But as a resident of Europe and the Middle East, I see religion and partisanship shriveling into a husk, intelligence and culture extending and growing, triumphant, as they never have before, and the citizenry reclaiming power over the aloof governments, and over their futures. Humanism wins. And rationalism cannot lose, on the long term, because, as its name indicates, it is the art of being right. That's not what I heard at all. Creationism only became acknowledged as a problem recently. Which means it was secure before, and it is now being challenged, and singing it's swan song. Lack of visible, spectacular budging doesn't mean that it isn't crumbling from the inside. And it's really a problem that is endemic to the USA: across the pond, virtually no one believes in Creationism. I suspect this has something to do with the education of the masses, which is very overlooked in the USA. Once the US society will feel the need to raise their own education level, for any economic reasons, the problems derived from ignorance will just extinguish themselves by sheer lack of combustible. That's why the improvement of public, mass education, and the spreading of our Art must be a priority, if not our number one priority. That's why I said "explaining things thoroughly: by that I mean raising the level of awareness of the general public*. Here in Spain, France, the UK, the majority of people are Atheists. In the USSR virtually everyone were atheists. Beliefs are extremely malleable. By Raising The Sanity Waterline, we'll make it so that they are only malleable through empirical evidence, and, if we do it right, people won't even notice ). You believe in Transhuman-level cybernetics and brain expansions and you don't believe we can make pigs fly and carry people on their bac

4Friendly-HI14y

Funny, on my second read-through I thought about editing it, but then my mind went "whatever, a healthy ego can probably take it". No hurt feelings I hope. I'm also living in the EU, but I'm very aware and constantly following what's going on in the US, because their erratic development concerns me. As far as evolution and creationism goes, I'm drawing my statistics from the Gallup polls: in the last 30 years creationism went down to 40% from 45%, "evolution through divine intervention" remained stagnant at 40% and "plain natural selection" went up about 5% to 15%. There is a positive movement here, but I don't see how in another 30 years the collapse of religion will be imminent. And that's just the US, to a resident of the Middle East I shouldn't need to point out, that there are plenty of countries out there much more religious than the US. (Essentially all of them, apart from a few developed countries - and even those countries have usually only around 20% confirmed atheists. Many don't visit the church, but they're still holding on to a mountain of superstitious garbage -> http://en.wikipedia.org/wiki/Demographics_of_atheism#Europe ) You're of course also right that here in Europe there's hardly any creationists - unlike in the US it's just too damaging to one's reputation, so people conveniently adapt their views. I doubt however, that this has all that much to do with the quality of education, and a lot more with cultural attitudes towards religion. As far as Europe goes, I can imagine that the church in the biggest countries (France, Germany, Spain, UK...) will be essentially dead in another 30 years, but what has that to say about people's ability to make rational decisions? There's a lot more to rationality than not believing in obvious bs. There's gonna be close to 9 billion people in 30 years and you think we -a tiny speck of nerds like LW- could hope to reach out and educate a sizable portion (almost half no less) of the world's people in the art of

-2Raw_Power14y

As a resident of the Middle East, I can tell you that mentalities are changing fast. Regardless, the attitude towards Science isn't the same as Christians, since they don't feel threatened by it, believing that the Qur'an not only isn't in conflict with science, but actually anticipated some discoveries. They also reclaim the development of modern scientific research as a proud heritage they are enthusiastic to live up to again, and believe researchers should be left alone to investigate, no matter how outrageous the stuff they come up with is (if i remember well think there's even a command in the Qran specifically to that effect). As for countries that have been converted to major religions by colonialism, I have a strong feelings that they would actually convert to whatever looks coolest, most Western and most high-status-signalling. We just need to be about 20% cooler than everyone else. Seems manageable. We should be able to teach rationality to anyone capable of deliberative thought. That is, anyone with an IQ over 70. that the original developers and vanguard be more fast-learning than average is not surprising at all. Our stuff is simpler, less confusing, far clearer, and far more useful, than anything any religion can teach. I think people could definitely be attracted to our lack of bullshit, if we sell it right. LOL at the last bit!

1Karl14y

I would be interested in knowing where you got your numbers because the statistics I found definitively disagreed with this.

-1Raw_Power14y

Checks his numbers Forgive me. I should have said the majority of young people (below 30) who, for our uses and purposes, are those who count, and the target demographic. It has come to the point that self-declared Christian kids get bullied and insulted [which is definitely wrong and stupid and not a very good sign that the Sanity Waterline was raised much]. Then again, I have this rule of thumb that I don't count people who don't attend church as believers, and automatically lump them into the "atheist in the making" category, a process that is definitely not legitimate nor fair. I sincerely apologize for this, and retract the relevant bits. Now let's see. For one thing The fact that Jedi outnumber Jews in the UK should be a sign that people don't take that part of the polls very seriously. That said This last bit i found particularly troubling because I do not recall metting a single person, in all my time in Spain, who declared themselves a Christian except in name only (as in, embarassingly confessing they only got baptized or went to Communion to please the grandparents). Some entertained some vague fuzziness, but simply telling them a little about "belief in belief" and some reductionist notions has been enough to throw them in serious doubt. I may very well be mistaken, but my perception is that they are really ripe for the taking, and only need to hear the right words. My perception as a young Arab-European is that the trend is overwhelmingly in the direction of faithlessness, and that it is an accelerating process with no stopping force in sight.

1JGWeissman14y

That is indeed a problem, but it is nowhere near as bad as a public AGI getting forked by people who don't know what they are doing. The polished cryptographic functions that benifeted from all those eyeballs were not the first in their code history to be executed. For cryptographic systems that don't ruin the universe if slightly wrong, that is ok, but for AGI that is very bad.

-2timtyler14y

It seems to me that forking is common practice in the open source arena. It rarely causes major problems - and is sometimes very healthy if a dominant project starts going in a direction that people don't like.

3nshepperd14y

A bad fork rarely destroys the world in the open source arena.

-2timtyler14y

Right - and that is very unlikely in the future too, I reckon. You typically need marketing, support infrastructure, social contacts, etc. to get ahead. Most forks don't have that. "Bad" forks are especially unlikely to succeed - and good forks we are OK with. We don't try and stop the mafia using powerful IT tools - like EMACS. We realise that is not a practical reason for keeping such power secret.

[-]timtyler14y40

Once robots become more commonplace in our lives, I think we can reasonably expect that people will begin to place their trust into simple AI's - and they will hopefully become less suspicious towards AGI and simply assume (like a lot of current AI-researchers apparently) that somehow it is trivial to make it behave friendly towards humans.

Step one is to use machine intelligence to stop the carnage on the roads. With machines regularly brutally killing and maiming people, trust in machines is not going to get very high.

2Raw_Power14y

Car accidents take more lives in developed countries than actual wars. This is depressing.

8JoshuaZ14y

It tells us where we should concentrate our work. But this isn't depressing: this is a sign that as a society we've become a lot more peaceful over the last few hundred years. Incidentally, the number of traffic fatalities in the US has shown general trend downwards for the last fifty years, even as the US population has increased. Moreover, I think the same is true in much of Europe. (I don't have a citation for this part though.)

1fubarobfusco14y

I'd expect a lot of that downward trend is due to better engineering, or to be specific, more humane engineering — designing cars in a way that takes into account the human preference that survival of the humans inside the car is a critical concern. A 1950s car is designed as a machine for going fast. A modern car is designed as a machine to protect your life at high speed. The comparison is astounding. It is arguably an example of rationality failure that automobile safety had to become a political issue before this change in engineering values was made.

6timtyler14y

Right - and we mostly know how to fix it: smart cars. We pretty-much have the technology today, it just needs to be worked on and deployed.

2Raw_Power14y

Linkies? If only we could say the same of the accident victims... "We have the technology. We can rebuld them..."

-1timtyler14y

Well, search if you are interested. There's a lot of low-hanging fruit in the area. From slamming on the brakes when you are about to hit something, to pointing a camera at the driver to see if they are awake.

4Raw_Power14y

Any reason why that sector isn't developing explosively then? Or is it actually developing explosively and we just don't notice?

2timtyler14y

Safety is improving - despite there being more vehicles on the roads. ...and yes, there are developments taking place with smart cars. e.g. Volvo Pedestrian Detection. Of course one issue is how to sell additional safety to consumer. It is often most visible in the price tag.

2Raw_Power14y

I suggest legislation. It's hard to get someone to pay additional money to protect others, especially from themselves. It's much easier to get them to feel the fuzzy moral righteousness of supporting a law that forces them to do so by making those measures compulsory.

3timtyler14y

That might take a while, though. What might help a little in the mean time is recognition and support from insurance companies.

2[anonymous]14y

This seems to suffer the same problems as robotics in surgery. People not only can't readily understand the expected utility benefit of having a robot assist a doctor with difficult incisions, they go further and demand that we don't act or talk about medical risk as if it is quantifiable. Most people tend to think that if you reduce their medical care down to a numeric risk, even if that number is very accurate and it is really quite beneficial to have it, then you somehow are cold and don't care about them. I think an insurance company would have a hard time not alienating its customers (who are mostly non-rationalists) by showing interest in any procedure that attempts to take control of human lives out of the hands of humans -- even if doing so was statistically undeniably safer. People don't care much about what actually is safer, rather what is "safer" in some flowery model that includes religion and apple pie and the American dream, etc. etc. I think getting societies at large to adopt technologies like these either has to just be enforced through unpopular legislation or a massive grassroots campaign that gets younger generations to accept methods of rationality.

1Raw_Power14y

Yeah, definitely! They'd certainly like that, if they could be convinced it'd be cost-effective for them. Then lobbying occurs.

0Friendly-HI14y

...except if safe cars become so abundant one day that no one will want to pay insurance for such an unlikely incident as dying inside a car.

0timtyler14y

Right - so support from insurance companies would be an interim measure - before we got to that point.

1Wilka14y

I think http://inhabitat.com/google-succeeds-in-making-driverless-cars-legal-in-nevada/ was a big step to helping improve that. Providing it works, once people start to notice the (hopefully) massive drop in traffic accidents for autonomous cars they should push for them to be more widespread. Still, it's a way off for them to actually be in use on the roads.

[-]Manfred14y40

If some FAI project is already right about everything and is fully funded, secrecy is helpful because it reduces outside interference.
If it's not, then secrecy is bad. Secrecy loses all sorts of cool community resources, from bug finding to funding to someone to bounce ideas off of (See JoshuaZ's longer post).

So the problem is one of balancing the cost of lost resources if they're wrong against the chance of interference if right. I guess I'm more hopeful about the low costs of openness (edit: not democracy, just non-secrecy) than you. The people most likely to object to building an AI even when they're wrong are the least likely to understand, after all :P

[-]Perplexed14y40

With apologies to Ludwig Wittgenstein, if we can't talk about the singularity, maybe we should just remain silent. :)

I happen to agree with you that the SIAI mission will never be popular. But a part of the purpose of this website is to create more people willing and capable to work (directly or indirectly) on that mission. So, not mentioning FAI would be a bit counterproductive - at least at this stage.

0[anonymous]14y

This is the "point that we can't see beyond" business? Surely that is just a crock.

[-]Will_Sawin14y40

UFAI-prevention is a much more difficult and serious problem than FAI. A thousand lesswrongs or a thousand SIAIs could be destroyed, and still the next could make an FAI and usher in a utopia. But if one organization makes an UFAI, we are all doomed forever.

I think, mostly, the anti-AI crazies are on our side.

0torekp14y

Yes, the anti-AI crazies are a net benefit. By exerting political pressure, they are likely to slow down many groups that otherwise might be too quick and sloppy about AI advances. Technologists will be forced to anticipate possible objections to their creations, and may add safety features in response. Of course, managers will also add marketing - but marketing alone will probably not be the only response.

-2timtyler14y

Surely the former is a subset of the latter. If you check with a definition, "non-human-harming" is one part of the specification.

-1Will_Sawin14y

Our claims do not contradict. If FAI succeeds, then so does UFAI prevention, so UFAI prevention is in some sense a subproblem. But, UFAI-prevention remains a more important problem. There are 3 possible states of the world in 50 years. No AI, UFAI, and FAI. Utility (No AI)-Utility (UFAI) > > Utility (FAI) - Utility of (No AI)

0timtyler14y

It isn't a more difficult problem, though. It is an easier problem. The idea that it is more important is Nick Bostrom's "Maxipok" principle.

1nshepperd14y

I must point out that "the FAI problem" could refer to one of two things: creating FAI before UFAI, or the pure technical problem of building FAI given essentially unlimited time. The former (which is basically what UFAI prevention amounts to) is, I expect, far harder than the latter. So, for the benefit of anyone reading, UFAI prevention is 1) at least as easy as creating FAI before UFAI (which will involve more than just software development, probably) but 2) much harder than building FAI itself.

0timtyler14y

If we are entertaining abstract problems from fantasy worlds there is also the case of unlimited resources to consider.

1nshepperd14y

I'm trying to be more pragmatic than that. The average person, when they read "how hard is it to build FAI?" probably does not think of the task of building FAI while trying to prevent UFAI. They think of solving decision theory and metaethics and implementation of CEV or whatever. There's a sensible notion of how hard it is to build FAI on its own, without involving UFAI-prevention. That's what I'm talking about. And I don't want people to confuse those things. It's one thing to say UFAI prevention is as easy as "building FAI (before UFAI)". But it's much harder than, you know, just building FAI in a world without the UFAI threat, which is what I think people will think of when you say "no, we just have to build FAI". Well, yes, but you don't just have to build it, you have to build it before anyone else creates AGI.

[-]orthonormal14y30

You bring up a good topic, but this post isn't developed enough to go on the front page. I'd rather you'd posted it to Discussion.

2Friendly-HI14y

Thank you. I wasn't even really aware that there was a distinction. On reflection you're certainly right, since the topic was indeed intended as a discussion, rather than an article that is aimed at education. Upvoted for valuable input. Maybe I can still change it in retrospect, I'll try it out. EDIT: Piece of cake, it's in the discussion section now. Thanks 4 mentioning it.

[-]Jonathan_Graehl14y30

If you're serious, then you must be pretty sure secrecy is not imperative, otherwise you'd be more hesitant to discuss this in public.

Those who oppose avowed FAI attempts must consider whether they're prepared to live in a world where only secretive AI attempts exist (specifically: are those stealthy attempts more likely to be either accidentally or intentionally unfriendly to them, than the public project they oppose?).

This topic won't be relevant for a long time, but I don't see anything to object to in your thinking about it, except to note that it provides the sort of fuel future conspiracy-believers will love.

0Friendly-HI14y

I don't see why I should be hesitant to discuss this matter nowadays here on lesswrong - there are probably a hundred other discussions about the creative ways in which self-improving AGI may end us. (Although admittedly I am not aware of any that openly ask whether self-improving AGI development should happen in secrecy). In the stupendously unlikely scenario that this article inspires some kind of "pulling the AGI-stuff out of the public sphere" a decade from now, it would have more than made up for it's presence - and if not, then it's just another drop in the bucket for all to see and a worthwhile discussion to be had. I'm serious, self-improving AGI is at least on the same threat-level as nuclear warheads and it would be quite foolish to assume that 30-50 years from now people like Eliezer or Ben Goertzel could actually build one and somehow remain "unmolested" by governments or public outrage.

0Jonathan_Graehl14y

You don't hesitate to discuss the possibility of secrecy exactly because you don't expect secrecy to have huge benefits that will be spoiled by others' expecting it. My level of concern over this post is also nearly zero. I think this is about effects far in the future (even so: may be worth thinking about now), that depend on decisions that will be made far in the future (so: safe to postpone thinking about).

[-]loup-vaillant14y00

Quick guess (I've only read the first paragraph) :

Secrecy sounds kinda impossible anyway, because now we have the internet.

[-]nazgulnarsil14y00

how about we blast them into space at high delta before flipping the switch? Of course this didn't work out so well in destination void.

0Friendly-HI14y

What? Who is "them"? The AI's or the naysayers? I insist we keep the AI's, in contrast to politicians an AI wouldn't be very useful orbiting say... venus. By the way, while we're geeking out on the topic of space... we could release the AI on Mars first, give it access to nanoscale 3D printers and transform that waste of a planet into something useful. On second thought though, I'd rather it started to solve the problems on earth first, our planet seems to be in need of a deus ex machina asap.

[-]MatthewBaker14y00

At first i downvoted this, but after more review and i decided that many people on LW are too quick to downvote Top level posts that don't use very sophisticated language and changed my vote.

Also, the first rule of the secret AGI safety group is that "If everyone knows, no one will suspect it's a secret!"

[-]Raw_Power14y-10

Building a fAGI isn't our main objective. Our main objective is to stop non-fAGI from being built. I say until we aren't 100% sure the AGI would be friendly we shouldn't build AGI at all.

And the only justification you seem to give for "they're gonna kill us" is "powers not involved in developing it will be unhappy".

Why should powers be involved at all? Why not make it an international, nonprofit, Open-Source program? And why is it a bad idea to reach the consciousness of the public and impart them a sense of clear and present danger regarding this project, so that they democratically force the necessary institutions into existence.

1benelliott14y

We can never become 100% certain of anything. Even if you just mean "really really sure", that's still quite contentious. Whoever first got in a position to launch would have to weigh up the possibility that they've made a mistake against the possibility that someone else will make a UFAI while they're still checking.

-4Raw_Power14y

This isn't a race. Why "release my FAI before anyone releases a UFAI"? ... Have we even given thought to how a clash between a FAI and a UFAI might develop?

5benelliott14y

At a guess, first mover wins. If foom is correct then even a small head start in self improvement should lead to an easy victory, suggesting that this is, in fact, a race.

0Strange714y

If things are a bit slower, like, days or weeks rather than minutes or seconds, access to human-built infrastructure might still be a factor.

2benelliott14y

I didn't want to give time lengths, since there's a great deal of uncertainty about this, but I was thinking in terms of days or weeks rather than minutes or seconds when I wrote that. I would consider it quite a strange coincidence if two AIs are finished in the same week despite no AI having been discovered prior to that.

0Strange714y

Well, if there's an open-source project, multiple teams could race to put the finishing touches on, and some microchip factory could grant access to the team with the best friendliness-checking rather than the fastest results.

1benelliott14y

It might be possible to organise an open-source project in such a way that those who take part are not racing each other, but they must still deal with the possibility of other projects which may not be as generous in sharing all their data.

-1Raw_Power14y

Wouldn't the UFAI's possible amorality give it an advantage over a morally fettered FAI? Also, friendliness or unfriendliness doesn't dictate the order of magnitude of the AI's development speed (though i suspect proper ethics could really slow a FAI down). It'd be down to the one written to develop faster, not necessarily the first if the other can quickly catch up. But yeah, race elements are undeniable.

3benelliott14y

Probably not enough to overcome much of a head start, especially since a consequentialist FAI could and would do anything necessary to win without fear of being corrupted by power in the process. True, to a limited extent. Still, if the theory about foom is correct the time-lengths involved my be very short, to the point where barring an unlikely coincidence of development the first one will take over the world before the second one is even fully coded. Even if that's not the case, it will always be the case that there will be some sort of cut-off 'launch before this time or lose' point. You always have to weigh up the chance that that cut-off is in the near future, bearing in mind that the amount of cleverness and effort need to build an AGI will be decreasing all the time.

1falenas10814y

That's what the SIAI is for, creating a way to code friendliness now so that when it comes down to building an AGI FAI is just as easy to build as UFAI.

-1Friendly-HI14y

By "they're gonna kill us" I assume you mean our potential adversaries. Well, by "powers" I essentially meant other nations, the general public, religious institutions and perhaps even corporations. You are of course right, when you say that I can't prove that the public reaction towards AGI development will be highly negative, but I think I did give a sensible justification: Self-Improving AGI has a higher threat-level than nuclear warheads and when people realize this (and I suppose they will in ~30 years), then I confidently predict that their reaction will be highly negative. I'll also add that I didn't pose any specific scenarios like public lynchings. There are other numerous ways to repress and shut down AGI-research and nowhere did I speculate that an angry mob would kill the researchers. Why not make self-improving AGI research open-source you ask? Essentially for the same reasons why biological weapons don't get developed in open-source projects. Someone could simply steal the code and release an unsafe AI that may kill us all. (By the way, at the current stage of AGI development an open source project may be a terrific way to move things along, but once things get more sophisticated you can't put self-improving AGI code "out there" for the whole world to see and modify, that's just madness.) As far as my opinion of how likely worldwide democratic consensus about developing self-improving AGI goes, I think I made my point and don't need to elaborate it further.

-3Raw_Power14y

People were quite enthusiastic about nukes when they were first introduced. It's all a matter of perception and timing. I know you didn't, I was speaking figuratively. My bad. AFAIK, biological weapons don't get developed at all, mostly because of how incredibly dangerous and unreliable they are. There's a lot of international scrutinizing each other and oneself over this. Perhaps the same policy can and should be imposed on AGI? Blasphemy! Why would that be so? You explained your opinion, but haven't justified it to my satisfaction. A lot of your argument is implicit, and I suspect that if we made we'd find out it's based on unwarranted heuristics, i.e. prejudice. Please don't take this personally: you're suggesting an important update of my beliefs, and I want to be thorough before adopting it.

[-]Vilja14y-10

Humans are rather adaptable and intelligent so teaching them rationality seems like a good and possibly stable way of letting humans decide for themselves. Then again there are plenty of different life forms on this planet - maybe they too should have their say in matters what's good way to do things and what to do.

Making anything friendly towards anything at all in strange enviroment seems rather difficult question. Using what works until better idea comes along seems like a very good idea, but it's not as if we know what works - except perhaps some preh... (read more)

[+]FreedomJury14y-100

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

6

Self-improving AGI: Is a confrontational or a secretive approach favorable?

6

6