The Importance of Self-Doubt

by multifoliaterose10 min read19th Aug 2010746 comments

24

Underconfidence
Personal Blog

[Added 02/24/14: After I got feedback on this post, I realized that it carried unnecessary negative connotations (despite conscious effort on my part to avoid them), and if I were to write it again, I would have framed things differently. See Reflections on a Personal Public Relations Failure: A Lesson in Communication for more information. SIAI (now MIRI) has evolved substantially since 2010 when I wrote this post, and the criticisms made in the post don't apply to MIRI as presently constituted.

Follow-up to: Other Existential Risks, Existential Risk and Public Relations

Related to: Tsuyoku Naritai! (I Want To Become Stronger), Affective Death Spirals, The Proper Use of Doubt, Resist the Happy Death Spiral, The Sin  of Underconfidence

In Other Existential Risks I began my critical analysis of what I understand to be SIAI's most basic claims. In particular I evaluated part of the claim

(1) At the margin, the best way for an organization with SIAI's resources to prevent global existential catastrophe is to promote research on friendly Artificial Intelligence, work against unsafe Artificial Intelligence, and encourage rational thought.

It's become clear to me that before I evaluate the claim

(2) Donating to SIAI is the most cost-effective way for charitable donors to reduce existential risk.

I should (a) articulate my reasons for believing in the importance of self-doubt and (b) give the SIAI staff an opportunity to respond to the points which I raise in the present post as well as my two posts titled Existential Risk and Public Relations and Other Existential Risks.

Yesterday SarahC described to me how she had found Eliezer's post Tsuyoku Naritai! (I Want To Become Stronger) really moving. She explained:

I thought it was good: the notion that you can and must improve yourself, and that you can get farther than you think.

I'm used to the other direction: "humility is the best virtue."

I mean, this is a big fuck-you to the book of Job, and it appeals to me.

I was happy to learn that SarahC had been positively affected by Eliezer's post. Self-actualization is a wonderful thing and it appears as though Eliezer's posting has helped her self-actualize. On the other hand, rereading the post prompted me to notice that there's something about it which I find very problematic. The last few paragraphs of the post read:

Take no pride in your confession that you too are biased; do not glory in your self-awareness of your flaws.  This is akin to the principle of not taking pride in confessing your ignorance; for if your ignorance is a source of pride to you, you may become loathe to relinquish your ignorance when evidence comes knocking.  Likewise with our flaws - we should not gloat over how self-aware we are for confessing them; the occasion for rejoicing is when we have a little less to confess.

Otherwise, when the one comes to us with a plan for correcting the bias, we will snarl, "Do you think to set yourself above us?"  We will shake our heads sadly and say, "You must not be very self-aware."

Never confess to me that you are just as flawed as I am unless you can tell me what you plan to do about it.  Afterward you will still have plenty of flaws left, but that's not the point; the important thing is to do better, to keep moving ahead, to take one more step forward.  Tsuyoku naritai!

There's something to what Eliezer is saying here: when people are too strongly committed to the idea that humans are fallible this can become a self-fulfilling prophecy where humans give up on trying to improve things and as a consequence remain fallible when they could have improved. As Eliezer has said in The Sin of Underconfidence, there are social pressures that push against having high levels of confidence even when confidence is epistemically justified:

To place yourself too high - to overreach your proper place - to think too much of yourself - to put yourself forward - to put down your fellows by implicit comparison - and the consequences of humiliation and being cast down, perhaps publicly - are these not loathesome and fearsome things?

To be too modest - seems lighter by comparison; it wouldn't be so humiliating to be called on it publicly, indeed, finding out that you're better than you imagined might come as a warm surprise; and to put yourself down, and others implicitly above, has a positive tinge of niceness about it, it's the sort of thing that Gandalf would do.

I have personal experience with underconfidence. I'm a careful thinker and when I express a position with confidence my position is typically well considered. For many years I generalized from one example and assumed when people express positions with confidence they've thought their positions out as well as I have. Even after being presented with massive evidence that few people think things through as carefully as I do, I persisted in granting the (statistically ill-considered) positions of others far more weight than they deserved for the very reason that Eliezer describes above. This seriously distorted my epistemology because it led to me systematically giving ill-considered positions substantial weight. I feel that I have improved on this point, but even now, from time to time I notice that I'm exhibiting irrationally low levels of confidence in my positions.

At the same time, I know that at times I've been overconfident as well. In high school I went through a period when I believed that I was a messianic figure whose existence had been preordained by a watchmaker God who planned for me to save the human race. It's appropriate to say that during this period of time I suffered from extreme delusions of grandeur. I viscerally understand how it's possible to fall into an affective death spiral.

In my view one of the central challenges of being human is to find an instrumentally rational balance between subjecting oneself to influences which push one in the direction of overconfidence and subjecting oneself to influences which push one in the direction of underconfidence.

In Tsuyoku Naritai! Eliezer describes how Orthodox Judaism attaches an unhealthy moral significance to humility. Having grown up in a Jewish household and as a consequence having had peripheral acquaintance with orthodox Judaism I agree with Eliezer's analysis of Orthodox Judaism in this regard. In the proper use of doubt, Eliezer describes how the Jesuits allegedly are told to doubt their doubts about Catholicism. I agree with Eliezer that self-doubt can be misguided and abused.

However, reversed stupidity is not intelligence. The fact that it's possible to ascribe too much moral significance to self-doubt and humility does not mean that one should not attach moral significance to self-doubt and humility. I strongly disagree with Eliezer's prescription: "Take no pride in your confession that you too are biased; do not glory in your self-awareness of your flaws."

The mechanism that determines human action is that we do what makes us feel good (at the margin) and refrain from doing what makes us feel bad (at the margin). This principle applies to all humans, from Gandhi to Hilter. Our ethical challenge is to shape what makes us feel good and what makes us feel bad in a way that incentivizes us to behave in accordance with our values. There are times when it's important to recognize that we're biased and flawed. Under such circumstances, we should feel proud that we recognize that we're biased we should glory in our self-awareness of our flaws. If we don't, then we will have no incentive to recognize that we're biased and be aware of our flaws.

We did not evolve to exhibit admirable and noble behavior. We evolved to exhibit behaviors which have historically been correlated with maximizing our reproductive success. Because our ancestral climate was very much a zero-sum situation, the traits that were historically correlated with maximizing our reproductive success had a lot to do with gaining high status within our communities. As Yvain has said, it appears that a fundamental mechanism of the human brain which was historically correlated with gaining high status is to make us feel good when we have high self-image and feel bad when we have low self-image.

When we obtain new data, we fit it into a narrative which makes us feel as good about ourselves as possible; a way conducive to having a high self-image. This mode of cognition can lead to very seriously distorted epistemology. This is what happened to me in high school when I believed that I was a messianic figure sent by a watchmaker God. Because we flatter ourselves by default, it's very important that those of us who aspire to epistemic rationality incorporate a significant element of "I'm the sort of person who engages in self-doubt because it's the right thing to do" into our self-image. If we do this, when we're presented with evidence which entails a drop in our self-esteem, we don't reject it out of hand or minimize it as we've been evolutionarily conditioned to do because wound of properly assimilating data is counterbalanced by the salve of the feeling "At least I'm a good person as evidenced by the fact that I engage in self-doubt" and failing to exhibit self-doubt would itself entail an emotional wound.

This is the only potential immunization to the disease of self-serving narratives which afflicts all utilitarians out of virtue of their being human. Until technology allows us to modify ourselves in a radical way, we cannot hope to be rational without attaching moral significance to the practice of engaging in self-doubt. As the RationalWiki's page on LessWrong says:

A common way for very smart people to be stupid is to think they can think their way out of being apes with pretensions. However, there is no hack that transcends being human...You are an ape with pretensions. Playing a "let's pretend" game otherwise doesn't mean you win all arguments, or any. Even if it's a very elaborate one, you won't transcend being an ape. Any "rationalism" that doesn't expressly take into account humans being apes with pretensions, isn't.


In Existential Risk and Public Relations I suggested that some of Eliezer's remarks convey the impression that Eliezer has an unjustifiably high opinion of himself. In the comments to the post JRMayne wrote

I think the statements that indicate that [Eliezer] is the most important person in human history - and that seems to me to be what he's saying - are so seriously mistaken, and made with such a high confidence level, as to massively reduce my estimated likelihood that SIAI is going to be productive at all.

And that's a good thing. Throwing money into a seriously suboptimal project is a bad idea. SIAI may be good at getting out the word of existential risk (and I do think existential risk is serious, under-discussed business), but the indicators are that it's not going to solve it. I won't give to SIAI if Eliezer stops saying these things, because it appears he'll still be thinking those things.

When Eliezer responded to JRMayne's comment, Eliezer did not dispute the claim that JRMayne attributed to him. I responded to Eliezer saying

If JRMayne has misunderstood you, you can effectively deal with the situation by making a public statement about what you meant to convey.

Note that you have not made a disclaimer which rules out the possibility that you claim that you're the most important person in human history. I encourage you to make such a disclaimer if JRMayne has misunderstood you.

I was disappointed, but not surprised, that Eliezer did not respond. As far as I can tell, Eliezer does have confidence in the idea that he is (at least nearly) the most important person in human history. Eliezer's silence only serves to further confirm my earlier impressions. I hope that Eliezer subsequently proves me wrong. [Edit: As Airedale points out Eliezer has in fact exhibited public self-doubt in his abilities in his posting The Level Above Mine. I find this reassuring and it significantly lowers my confidence that Eliezer claims that he's the most important person in human history. But Eliezer still hasn't made a disclaimer on this matter decisively indicating that he does not hold such a view.] The modern world is sufficiently complicated so that no human no matter how talented can have good reason to believe himself or herself to be the most important person in human history without actually doing something which very visibly and decisively alters the fate of humanity. At present, anybody who holds such a belief is suffering from extreme delusions of grandeur.

There's some sort of serious problem with the present situation. I don't know whether it's a public relations problem or if the situation is that Eliezer actually suffers from extreme delusions of grandeur, but something has gone very wrong. The majority of the people who I know who outside of Less Wrong who have heard of Eliezer and Less Wrong have the impression that Eliezer is suffering from extreme delusions of grandeur. To such people, this fact (quite reasonably) calls into question of the value of SIAI and Less Wrong. On one hand, SIAI looks like an organization which is operating under beliefs which Eliezer has constructed to place himself in as favorable a position as possible rather than with a view toward reducing existential risk. On the other hand, Less Wrong looks suspiciously like the cult of Objectivism: a group of smart people who are obsessed with the writings of a very smart person who is severely deluded and describing these writings and the associated ideology as "rational" although they are nothing of the kind.

My own views are somewhat more moderate. I think that the Less Wrong community and Eliezer are considerably more rational than the Objectivist movement and Ayn Rand (respectively). I nevertheless perceive unsettling parallels.


In the comments to Existential Risk and Public Relations, timtyler said

...many people have inflated views of their own importance. Humans are built that way. For one thing, It helps them get hired, if they claim that they can do the job. It is sometimes funny - but surely not a big deal.

I disagree with timtyler. Anything that has even a slight systematic negative impact on existential risk is a big deal.

Some of my most enjoyable childhood experiences involved playing Squaresoft RPGs. Games like Chrono Trigger, Illusion of Gaia, Earthbound, Xenogears, and the Final Fantasy series are all stories about a group of characters who bond and work together to save the world. I found these games very moving and inspiring. They prompted me to fantasize about meeting allies who I could bond with and work together with to save the world. I was lucky enough to meet one such person in high school who I've been friends with since. When I first encountered Eliezer I found him eerily familiar, as though he was a long lost brother. This is the same feeling that is present between Siegmund and Sieglinde in the Act 1 of Wagner's Die Walküre (modulo erotic connotations). I wish that I could be with Eliezer in a group of characters as in a Squaresoft RPG working to save the world. His writings such as One Life Against the World and Yehuda Yudkowsky, 1985-2004 reveal him to be a deeply humane and compassionate person.

This is why it's so painful for me to observe that Eliezer appears to be deviating so sharply from leading a genuinely utilitarian lifestyle. I feel a sense of mono no aware, wondering how things could have been under different circumstances.

One of my favorite authors is Kazuo Ishiguro, who writes about the themes of self-deception and people's attempts to contribute to society. In a very good interview Ishiguro said

I think that's partly what interests me in people, that we don't just wish to feed and sleep and reproduce then die like cows or sheep. Even if they're gangsters, they seem to want to tell themselves they're good gangsters and they're loyal gangsters, they've fulfilled their 'gangstership' well. We do seem to have this moral sense, however it's applied, whatever we think. We don't seem satisfied, unless we can tell ourselves by some criteria that we have done it well and we haven't wasted it and we've contributed well. So that is one of the things, I think, that distinguishes human beings, as far as I can see.

But so often I've been tracking that instinct we have and actually looking at how difficult it is to fulfill that agenda, because at the same time as being equipped with this kind of instinct, we're not actually equipped. Most of us are not equipped with any vast insight into the world around us. We have a tendency to go with the herd and not be able to see beyond our little patch, and so it is often our fate that we're at the mercy of larger forces that we can't understand. We just do our little thing and hope it works out. So I think a lot of the themes of obligation and so on come from that. This instinct seems to me a kind of a basic thing that's interesting about human beings. The sad thing is that sometimes human beings think they're like that, and they get self-righteous about it, but often, they're not actually contributing to anything they would approve of anyway.

[...]

There is something poignant in that realization: recognizing that an individual's life is very short, and if you mess it up once, that's probably it. But nevertheless, being able to at least take some comfort from the fact that the next generation will benefit from those mistakes. It's that kind of poignancy, that sort of balance between feeling defeated but nevertheless trying to find reason to feel some kind of qualified optimism. That's always the note I like to end on. There are some ways that, as the writer, I think there is something sadly pathetic but also quite noble about this human capacity to dredge up some hope when really it's all over. I mean, it's amazing how people find courage in the most defeated situations.

Ishiguro's quote describes how people often behave in accordance with sincere desire to contribute and end up doing things that are very different from what they thought they were doing (things which are relatively unproductive or even counterproductive). Like Ishiguro I find this phenomenon very sad. As Ishiguro hints at, this phenomenon can also result in crushing disappointment later in life. I feel a deep spiritual desire to prevent this from happening to Eliezer.

Underconfidence2
Personal Blog

24

Rendering 500/746 comments, sorted by (show more) Highlighting new comments since Today at 5:31 PM
New Comment
Some comments are truncated due to high volume. (⌘F to expand all)Change truncation settings

This post suffers from lumping together orthogonal issues and conclusions from them. Let's consider individually the following claims:

  1. The world is in danger, and the feat of saving the world (if achieved) would be very important, more so than most other things we can currently do.
  2. Creating FAI is possible.
  3. Creating FAI, if possible, will be conductive to saving the world.
  4. If FAI is possible, person X's work contributes to developing FAI.
  5. Person X's work contributes to saving the world.
  6. Most people's work doesn't contribute to saving the world.
  7. Person X's activity is more important than that of most other people.
  8. Person X believes their activity is more important than that of most other people.
  9. Person X suffers from delusions of grandeur.

A priori, from (8) we can conclude (9). But assuming the a priori improbable (7), (8) is a rational thing for X to conclude, and (9) doesn't automatically follow. So, at this level of analysis, in deciding whether X is overconfident, we must necessarily evaluate (7). In most cases, (7) is obviously implausible, but the post itself suggests one pattern for recognizing when it isn't:

The modern world is sufficiently complicated so that no human n

... (read more)
5multifoliaterose10yYour analysis is very careful and I agree with almost everything that you say. I think that one should be hesitant to claim too much for a single person on account of the issue which Morendil raises [http://lesswrong.com/lw/296/the_tragedy_of_the_social_epistemology_commons/21c2?c=1] - we are all connected. Your ability to work on FAI depends on the farmers who grow your food, the plumbers who ensure that you have access to running water, the teachers who you learned from, the people at Google who make it easier for you to access information, etc. I believe that you (and others working on the FAI problem) can credibly hold the view that your work has higher expected value to humanity than that of a very large majority (e.g. 99.99%) of the population. Maybe higher. I don't believe that Eliezer can credibly hold the view that he's the highest expected value human who has ever lived. Note that he has not offered a disclaimer denying the view that JRMayne has attributed to him despite the fact that I have suggested that he do so twice now.
7Vladimir_Nesov10yYou wrote elsewhere in the thread [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2h9y?c=1]: Does it mean that we need 10^9 Eliezer-level researchers to make progress? Considering that Eliezer is probably at about 1 in 10000 level of ability (if we forget about other factors that make research in FAI possible, such as getting in the frame of mind of understanding the problem and taking it seriously), we'd need about 1000 times more human beings than currently exists on the planet to produce a FAI, according to your estimate. How does this claim coexist with the one you've made in the above comment? It doesn't compute, there is an apparent inconsistency between these two claims. (I see some ways to mend it by charitable interpretation, but I'd rather you make the intended meaning explicit yourself.)
2Jonathan_Graehl10yAgreed, and I like to imagine that he reads that and thinks to himself "only 10000? thanks a lot!" :) In case anyone takes the above too seriously, I consider it splitting hairs to talk about how much beyond 1 in 10000 smart anyone is - eventually, motivation, luck, and aesthetic sense / rationality begin to dominate in determining results IMO.
1multifoliaterose10yNo, in general p(n beings similar to A can do X) does not equal n multiplied by p(A can do X). I'll explain my thinking on these matters later.
4whpearson10yI'd argue that a lot of people's work does. Everybody that contributes to keeping the technological world running (from farmers to chip designers) enables us to potentially save ourselves from the longer term non-anthrogenic existential risks.
5Vladimir_Nesov10yObviously, you need to interpret that statement as "Any given person's work doesn't significantly contribute to saving the world". In other words, if we "subtract" that one person, the future (in the aspect of the world not ending) changes insignificantly.
2whpearson10yAre you also amending 4) to have the significant clause? Because there are lots of smart people that have worked on AI, whose work I doubt would be significant. And that is the nearest reference class I have for likely significance of people working on FAI.
1Vladimir_Nesov10yI'm not amending, I'm clarifying. (4) doesn't have world-changing power in itself, only through the importance of FAI implied by other arguments, and that part doesn't apply to activity of most people in the world. I consider the work on AI as somewhat significant as well, although obviously less significant than work on FAI at the margain, since much more people are working on AI. The argument, as applied to their work, makes them an existential threat (moderate to high when talking about the whole profession, rather weak when talking about individual people). As for the character of work, I believe that at the current stage, productive work on FAI is close to pure mathematics (but specifically with problem statements not given), and very much unlike most of AI or even the more rigorous kinds from machine learning (statistics).
1MartinB10yThat makes me wonder who will replace Norman Borlaug, or lets say any particular influential writer or thinker.
1CarlShulman10yAgreed. More broadly, everyone affects anthropogenic existential risks too, which limits the number of orders of magnitude one can improve in impact from a positive start.
1Wei_Dai8y(4 here being "If FAI is possible, person X's work contributes to developing FAI.") This seems be a weak part of your argument. A successful FAI attempt will obviously have to use lots of philosophical and technical results that were not developed specifically with FAI in mind. Many people may be contributing to FAI, without consciously intending to do so. For example when I first started thinking about anthropic reasoning I was mainly thinking about human minds being copyable in the future and trying to solve philosophical puzzles related to that. Another possibility is that the most likely routes to FAI go through intelligence enhancement or uploading, so people working in those fields are actually making more contributions to FAI than people like you and Eliezer.

Unknown reminds me that Multifoliaterose said this:

The modern world is sufficiently complicated so that no human no matter how talented can have good reason to believe himself or herself to be the most important person in human history without actually doing something which very visibly and decisively alters the fate of humanity. At present, anybody who holds such a belief is suffering from extreme delusions of grandeur.

This makes explicit something I thought I was going to have to tease out of multi, so my response would roughly go as follows:

  • If no one can occupy this epistemic state, that implies something about the state of the world - i.e., that it should not lead people into this sort of epistemic state.
  • Therefore you are deducing information about the state of the world by arguing about which sorts of thoughts remind you of your youthful delusions of messianity.
  • Reversed stupidity is not intelligence. In general, if you want to know something about how to develop Friendly AI, you have to reason about Friendly AI, rather than reasoning about something else.
  • Which is why I have a policy of keeping my thoughts on Friendly AI to the object level, and not worrying about ho
... (read more)
7Jonathan_Graehl10yUpvoted for being clever. You've (probably) refuted the original statement as an absolute. You're deciding not to engage the issue of hubris directly. Does the following paraphrase your position: 1. Here's what I (and also part of SIAI) intend to work on 2. I think it's very important (and you should think so for reasons outline in my writings) 3. If you agree with me, you should support us ? If so, I think it's fine for you to not say the obvious (that you're being quite ambitious, and that success is not assured). It seems like some people are really dying to hear you say the obvious.

Upvoted for being clever.

That's interesting. I downvoted it for being clever. It was a convoluted elaboration of a trivial technicality that only applies if you make the most convenient (for Eliezer) interpretation of multi's words. This kind of response may win someone a debating contest in high school but it certainly isn't what I would expect from someone well versed in the rationalism sequences, much less their author.

I don't pay all that much attention to what multi says (no offence intended to multi) but I pay close attention to what Eliezer does. I am overwhelmingly convinced of Eliezer's cleverness and brilliance as a rationalism theorist. Everything else, well, that's a lot more blurry.

3Furcas10yI don't think Eliezer was trying to be clever. He replied to the only real justification multi offered for why we should believe that Eliezer is suffering from delusions of grandeur. What else is he supposed to do?
8wedrifid10yI got your reply and respect your position. I don't want to engage too much here since it would overlap with discussion surrounding Eliezer's initial reply [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2h2u?c=1] and potentially be quite frustrating. What I would like to see is multifoliaterose giving a considered response to the "If not, why not?" question in that link. That would give Eliezer the chance to respond to the meat of the topic at hand. Eliezer has been given a rare opportunity. He can always write posts about himself, giving justifications for whatever degree of personal awesomeness he claims. That's nothing new. But in this situation it wouldn't be perceived as Eliezer grabbing the megaphone for his own self-gratification. He is responding to a challenge, answering a request. Why would you waste the chance to, say, explain the difference between "SIAI" and "Eliezer Yudkowsky"? Or at least give some treatment of p(someone other than Eliezer Yudkowsky is doing the most to save the world). Better yet, take that chance to emphasise the difference between p(FAI is the most important priority for humanity) and p(Eliezer is the most important human in the world).
5khafra10yAs Graehl and wedrifid observed, Eliezer responded as if the original statement were an absolute. He applied deductive reasoning and found a reductio ad absurdum. But if, instead of an absolute, you see multifoliaterose's characterization as a reference class: "People who believe themselves to be one of the few most important in the world without having already done something visible and obvious to dramatically change it," it can lower the probability that Eliezer is, in fact, that important by a large likelihood ratio. Whether this likelihood ratio is large enough to overcome the evidence on AI-related existential risk and the paucity of serious effort dedicated to combating it is an open question.

Success is not assured. I'm not sure what's meant by confessing to being "ambitious". Is it like being "optimistic"? I suppose there are people who can say "I'm being optimistic" without being aware that they are instantiating Moore's Paradox but I am not one of them.

I also disclaim that I do not believe myself to be the protagonist, because the world is not a story, and does not have a plot.

4Perplexed10yI hope that the double negative in the last sentence was an error. I introduced the term "protagonist", because at that point we were discussing a hypothetical person who was being judged regarding his belief in a set of three propositions. Everyone recognized, of course, who that hypothetical person represented, but the actual person had not yet stipulated his belief in that set of propositions.
5wedrifid10yInteresting. I don't claim great grammatical expertise but my reading puts the last question at reasonable. Am I correct in inferring that you do not believe Eliezer's usage of "I also disclaim" to mean "I include the following disclaimer: " is valid? Regarding 'protagonist' there is some context for the kind of point Eliezer likes to make about protagonist/story thinking in his Harry Potter fanfic. I don't believe he has expressed the concept coherently as a post yet. (I don't see where you introduced the 'protagonist' word so don't know whether Eliezer read you right. I'm just throwing some background in.)
6Perplexed10yRegarding "disclaim". I read "disclaim" as a synonym for "deny". I didn't even consider your interpretation, but upon consideration, I think I prefer it. My mistake (again!). :(
1Unknowns10yEven if almost everything you say here is right, it wouldn't mean that there is a high probability that if you are killed in a car accident tomorrow, no one else will think about these things (reflective decision theory and so on) in the future, even people who know nothing about you personally. As Carl Shulman points out [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2h4f?c=1], if it is necessary to think about these things it is likely that people will, when it becomes more urgent. So it still wouldn't mean that you are the most important person in human history.

give the SIAI staff an opportunity to respond to the points which I raise in the present post as well as my two posts titled Existential Risk and Public Relations and Other Existential Risks.

Indeed, given how busy everyone at SIAI has been with the Summit and the academic workshop following it, it is not surprising that there has not been much response from SIAI. I was only involved as an attendee of the Summit, and even I am only now able to find time to sit down and write something in response. At any rate, as a donor and former visiting fellow, I am only loosely affiliated with SIAI, and my comments here are solely my own, although my thoughts are certainly influenced by observations of the organization and conversation with those at SIAI. I don’t have the time/knowledge to address everything in your posts, but I wanted to say a couple of things.

I don’t disagree with you that SIAI has certain public relations problems. (Frankly, I doubt anyone at SIAI would disagree with that.) There is a lot of attention and discussion at SIAI about how to best spread knowledge about existential risks and to avoid sounding like a fringe/doomsday organization in doing so. It’s true that... (read more)

3Morendil10ySpeaking from personal experience, the SIAI's somewhat haphazard response [http://lesswrong.com/lw/29c/be_a_visiting_fellow_at_the_singularity_institute/24q5?c=1] to people answering its outreach calls [http://lesswrong.com/lw/29c/be_a_visiting_fellow_at_the_singularity_institute/] strikes me as a bigger PR problem than Eliezer's personality. The SIAI strikes me as in general not very good at effective collective action (possibly because that's an area where Eliezer's strengths are, as he admits himself, underdeveloped). One thing I'd suggest to correct that is to massively encourage collaborative posts on LW.
2Airedale10yAgreed. I think that communication and coordination with many allies and supporters has historically been a weak point for SIAI, due to various reasons including overcommitment of some of those tasked with communications, failure to task anyone with developing or maintaining certain new and ongoing relationships, interpersonal skills being among the less developed skill sets among those at SIAI, and the general growing pains of the organization. My impression is that there has been some improvement in this area recently, but there's still room for a lot more. More collaborative posts on LW would be great to see. There have also been various discussions about workshops or review procedures for top-level posts that seem to have generated at least some interest. Maybe those discussions should just continue in the open thread or maybe it would be appropriate to have a top-level post where people could be invited to volunteer or could find others interested in collaboration, workshops, or the like.
1multifoliaterose10yThanks for pointing out "The Level Above Mine." I had not seen it before.

I'd like to vote this up as I agree with lots of the points raised, but I am not comfortable with the personal nature of this article. I'd much rather the bits personal to Eliezer be sent via email.

Probably some strange drama avoidance thing on my part. On the other hand I'm not sure Eliezer would have a problem writing a piece like this about someone else.

I've thought to myself that I have read one too many fantasy books as a kid, so the partying metaphor hits home.

8multifoliaterose10yI was conflicted about posting in the way that I did precisely for the reason that you describe, but after careful consideration decided that the benefits outweighed the costs, in part because Eliezer does not appear to be reading the private messages that I send him.
3JamesAndrix10yI would say that given an audience that is mostly not Eliezer. the best way to send a personal message to Eliezer is to address how the community ought to relate to Eliezer.

Well, in the category of "criticisms of SIAI and/or Eliezer", this text is certainly among the better ones. I could see this included on a "required reading list" of new SIAI employees or something.

But since we're talking about a Very Important Issue, i.e. existential risks, the text might have benefited from some closing warnings, that whatever people's perceptions of SIAI, it's Very Important that they don't neglect being very seriously interested in existential risks because of issues that they might perceive a particular organization working on the topic to have (and that it might also actually have, but that's not my focus in this comment).

I.e. if people think SIAI sucks and shouldn't be supported, they should anyway be very interested in supporting the Future of Humanity Institute at Oxford, for example. Otherwise they're demonstrating very high levels of irrationality, and with regard to SIAI, are probably just looking for plausible-sounding excuses to latch onto for why they shouldn't pitch in.

Not to say that the criticism you presented mightn't be very valid (or not; I'm not really commenting on that here), but it would be very important for people to f... (read more)

[-][anonymous]10y 15

I don't think there's any point doing armchair diagnoses and accusing people of delusions of grandeur. I wouldn't go so far as to claim that Eliezer needs more self-doubt, in a psychological sense. That's an awfully personal statement to make publicly. It's not self-confidence I'm worried about, it's insularity.

Here's the thing. The whole SIAI project is not publicly affiliated with (as far as I've heard) other, more mainstream institutions with relevant expertise. Universities, government agencies, corporations. We don't have guest posts from Dr. X or Think Tank Fellow Y. The ideas related to friendly AI and existential risk have not been shopped to academia or evaluated by scientists in the usual way. So they're not being tested stringently enough.

It's speculative. It feels fuzzy to me -- I'm not an expert in AI, but I have some education in math, and things feel fuzzy around here.

If you want to claim you're working on a project that may save the world, fine. But there's got to be more to show for it, sooner or later, than speculative essays. At the very least, people worried about unfriendly AI will have to gather data and come up with some kind of statistical stu... (read more)

The whole SIAI project is not publicly affiliated with (as far as I've heard) other, more mainstream institutions with relevant expertise. Universities, government agencies, corporations. We don't have guest posts from Dr. X or Think Tank Fellow Y.

According to the about page, LW is brought to you by the Future of Humanity Institute at Oxford University. Does this count? Many Dr. Xes have spoken at the Singularity Summits.

At the very least, people worried about unfriendly AI will have to gather data and come up with some kind of statistical study that gives evidence of a threat!

It's not clear how one would use past data to give evidence for or against a UFAI threat in any straightforward way. There's various kinds of indirect evidence that could be presented, and SIAI has indeed been trying more in the last year or two to publish articles and give conference talks presenting such evidence.

Points that SIAI would do better if it had better PR, had more transparency, published more in the scientific literature, etc., are all well-taken, but these things use limited resources, which to me makes it sound strange to use them as arguments to direct funding elsewhere.

5[anonymous]10yMy post was by way of explaining why some people (including myself) doubt the claims of SIAI. People doubt claims when, compared to other claims, they're not justified as rigorously, or haven't met certain public standards. Why do I agree with the main post that Eliezer isn't justified in his opinion of his own importance (and SIAI's importance)? Because there isn't (yet) a lot beyond speculation here. I understand about limited resources. If I were trying to run a foundation like SIAI, I might do exactly what it's doing, at first, and then try to get the academic credentials. But as an outside person, trying to determine: is this worth my time? Is this worth further study? Is this a field I could work in? Is this worth my giving away part of my (currently puny) income in donations? I'm likely to hold off until I see something stronger. And I'm likely to be turned off by statements with a tone that assumes anyone sufficiently rational should already be on board. Well, no! It's not an obvious, open-and shut deal. What if there were an organization comprised of idealistic, speculative types, who, unknowingly, got themselves to believe something completely false based on sketchy philosophical arguments? They might look a lot like SIAI. Could an outside observer distinguish fruitful non-mainstream speculation from pointless non-mainstream speculation?
1torekp10yThanks for that last link. The paper on Changing the frame of AI futurism [http://intelligence.org/theuncertainfuture.html] is extremely relevant to this series of posts.
7WrongBot10yLessWrong is itself a joint project of the SIAI and the Future of Humanity Institute at Oxford. Researchers at the SIAI have published these [http://intelligence.org/research/publications] academic papers. The Singularity Summit's website [http://www.singularitysummit.com/] includes a lengthy list of partners, including Google and Scientific American. The SIAI and Eliezer may not have done the best possible job of engaging with the academic mainstream, but they haven't done a terrible one either, and accusations that they aren't trying are, so far as I am able to determine, factually inaccurate.
6Perplexed10yBut those don't really qualify as "published academic papers" in the sense that those terms are usually understood in academia. They are instead "research reports" or "technical reports". The one additional hoop that these high-quality articles should pass through before they earn the status of true academic publications is to actually be published - i.e. accepted by a reputable (paper or online) journal. This hoop exists for a variety of reasons, including the claim that the research has been subjected to at least a modicum of unbiased review, a locus for post-publication critique (at least a journal letters-to-editor column), and a promise of stable curatorship. Plus inclusion in citation indexes and the like. Perhaps the FHI should sponsor a journal, to serve as a venue and repository for research articles like these.
1CarlShulman10yThere are already relevant niche philosophy journals (Ethics and Information Technology, Minds and Machines, and Philosophy and Technology). Robin Hanson's "Economic Growth Given Machine Intelligence" has been accepted in an AI journal, and there are forecasting journals like Technological Forecasting and Social Change. For more unusual topics, there's the Journal of Evolution and Technology. SIAI folk are working to submit the current crop of papers for publication.
1Perplexed10yCool!
4[anonymous]10yOkay, I take that back. I did know about the connection between SIAI and FHI and Oxford. What are these academic papers published in? A lot of them don't provide that information; one is in Global Catastrophic Risks. At any rate, I exaggerated in saying there isn't any engagement with the academic mainstream. But it looks like it's not very much. And I recall a post of Eliezer's that said, roughly, "It's not that academia has rejected my ideas, it's that I haven't done the work of trying to get academia's attention." Well, why not?
4WrongBot10yLimited time and more important objectives, I would assume. Most academic work is not substantially better than trial-and-error in terms of usefulness and accuracy; it gets by on volume. Volume is a detriment in Friendliness research, because errors can have large detrimental effects relative to the size of the error. (Like the accidental creation of a paperclipper.)
5Morendil10yPossibly because this blog is Less Wrong, positioned as "a community blog devoted to refining the art of human rationality", and not as the SIAI [http://intelligence.org/achievements] blog, or an existential risk blog, or an FAI blog.
4multifoliaterose10yI respectfully disagree with this statement, at least as an absolute. I believe that: (A) In situations in which people are making significant life choices based on person X's claims and person X exhibits behavior which is highly correlated with delusions of grandeur, it's appropriate to raise the possibility that person X's claims arise from delusions of grandeur and ask that person X publicly address this possibility. (B) When one raises the possibility that somebody is suffering from delusions of grandeur, this should be done in as polite and nonconfrontational way as possible given the nature of the topic. I believe that if more people adopted these practices, this would would raise the sanity waterline [http://lesswrong.com/lw/1e/raising_the_sanity_waterline/]. I believe that the situation with respect to Eliezer and portions of the LW community is as in (A) and that I made a good faith effort at (B).
2wedrifid10yI agree with your conclusion but not this part: I categorically do not want statistical studies of the type you mention done. I do want solid academic research done but not experiments. Some statistics on, for example, human predictions vs actual time till successful completion on tasks of various difficulties would be useful. But these do not appear to be the type of studies you are asking for, and nor do they target the most significant parts of the conclusion. You are not entitled to that particular proof [http://lesswrong.com/lw/1ph/youre_entitled_to_arguments_but_not_that/]. EDIT: The 'entitlement' link was broken.
2timtyler10yThere's these fellows: * http://singinst.org/aboutus/advisors [http://singinst.org/aboutus/advisors] Some of them have contributed here: * http://singinst.org/media/interviews [http://singinst.org/media/interviews]
1[anonymous]10yI agree with your conclusion but not this part: I categorically do not want statistical studies of the type you mention done. I do want solid academic research done but not experiments. Some statistics on, for example, human predictions vs actual time till successful completion on tasks of various difficulties would be useful. But these do not appear to be the type of studies you are asking for, and nor do they target the most significant parts of the conclusion. [You are not entitled to that particular proof] http://lesswrong.com/lw/1ph/youre_entitled_to_arguments_but_not_that/ [http://lesswrong.com/lw/1ph/youre_entitled_to_arguments_but_not_that/]).
1Perplexed10yI only wish it were possible to upvote this comment more than once.

I assign a probability of less than 10^(-9) to you succeeding in playing a critical role on the Friendly AI project that you're working on.

I wish the laws of argument permitted me to declare that you had blown yourself up at this point, and that I could take my toys and go home. Alas, arguments are not won on a points system.

My impression is that you've greatly underestimated the difficulty of building a Friendly AI.

Out of weary curiosity, what is it that you think you know about Friendly AI that I don't?

And has it occurred to you that if I have different non-crazy beliefs about Friendly AI then my final conclusions might not be so crazy either, no matter what patterns they match in your craziness recognition systems?

I wish the laws of argument permitted me to declare that you had blown yourself up at this point, and that I could take my toys and go home. Alas, arguments are not won on a points system.

On the other hand, assuming he knows what it means to assign something a 10^-9 probability, it sounds like he's offering you a bet at 1000000000:1 odds in your favour. It's a good deal, you should take it.

4rabidchicken10yIndeed. I do not know how many people are actively involved in FAI research, but i would guess that it is only in the the dozens to hundreds. Given the small pool of competition, it seems likely that at some point Eliezer will, or already has, made a unique contribution to the field. Get Multi to put some money on it, offer him 1 cent if you do not make a useful contribution in the next 50 years, and if you do, he can pay you 10 million dollars.

I agree it's kind of ironic that multi has such an overconfident probability assignment right after criticizing you for being overconfident. I was quite disappointed with his response here.

2multifoliaterose10yWhy does my probability estimate look overconfident?

One could offer many crude back-of-envelope probability calculations. Here's one: let's say there's

  • a 10% chance AGI is easy enough for the world to do in the next few decades
  • a 1% chance that if the world can do it, a team of supergeniuses can do the Friendly kind first
  • an independent 10% chance Eliezer succeeds at putting together such a team of supergeniuses

That seems conservative to me and implies at least a 1 in 10^4 chance. Obviously there's lots of room for quibbling here, but it's hard for me to see how such quibbling could account for five orders of magnitude. And even if post-quibbling you think you have a better model that does imply 1 in 10^9, you only need to put little probability mass on my model or models like it for them to dominate the calculation. (E.g., a 9 in 10 chance of a 1 in 10^9 chance plus a 1 in 10 chance of a 1 in 10^4 chance is close to a 1 in 10^5 chance.)

1multifoliaterose10yI don't find these remarks compelling. I feel similar remarks could be used to justify nearly anything. Of course, I owe you an explanation. One will follow later on.
2Unknowns10yUnless you've actually calculated the probability mathematically, a probability of one in a billion for a natural language claim that a significant number of people accept as likely true is always overconfident. Even Eliezer said that he couldn't assign a probability as low as one in a billion for the claim "God exists" (although Michael Vassar criticized him for this, showing himself to be even more overconfident than Eliezer.)
5komponisto10yI'm afraid I have to take severe exception [http://lesswrong.com/lw/1mw/advancing_certainty/] to this statement. You give the human species far too much credit [http://lesswrong.com/lw/jl/what_is_evidence/] if you think that our mere ability to dream up a hypothesis automatically raises its probability above some uniform lower bound.
1[anonymous]10yThe product of two probabilities above your threshold-for-overconfidence can be below your threshold-for-overconfidence. Have you at least thought this through before? For instance, the claim "there is a God" is not that much less spectacular than the claim "there is a God, and he's going to make the next 1000 times you flip a coin turn up heads." If one-in-a-billion is a lower bound for the probability that God exists, then one-in-a-billion-squared is a generous lower bound for the probability that the next 1000 times you flip a coin will turn up heads. (One-in-a-billion-squared is about 2-to-the-sixty). You're OK with that?
1Unknowns10yYes. As long as you think of some not-too-complicated scenario where the one would lead to the other, that's perfectly reasonable. For example, God might exist and decide to prove it to you by effecting that prediction. I certainly agree this has a probability of at least one in a billion squared. In fact, suppose you actually get heads the next 60 times you flip a coin, even though you are choosing different coins, it is on different days, and so on. By that point you will be quite convinced that the heads are not independent, and that there is quite a good chance that you will get 1000 heads in a row. It would be different of course if you picked a random series of heads and tails: in that case you still might say that there is at least that probability that someone else will do it (because God might make that happen), but you surely cannot say that it had that probability before you picked the random series. This is related to what I said in the torture discussion, namely that explicitly describing a scenario automatically makes it far more probable to actually happen than it was before you described it. So it isn't a problem if the probability of 1000 heads in a row is more likely than 1 in 2-to-1000. Any series you can mention would be more likely than that, once you have mentioned it. Also, note that there isn't a problem if the 1000 heads in a row is lower than one in a billion, because when I made the general claim, I said "a claim that significant number of people accept as likely true," and no one expects to get the 1000 heads.

I'm inclined to think that Eliezer's clear confidence in his own very high intelligence and his apparent high estimation of his expected importance (not the dictionary-definition "expected", but rather, measured as an expected quantity the usual way) are not actually unwarranted, and only violate the social taboo against admitting to thinking highly of one's own intelligence and potential impact on the world, but I hope he does take away from this a greater sense of the importance of a "the customer is always right" attitude in managing his image as a public-ish figure. Obviously the customer is not always right, but sometimes you have to act like they are if you want to get/keep them as your customer... justified or not, there seems to be something about this whole endeavour (including but not limited to Eliezer's writings) that makes people think !!!CRAZY!!! and !!!DOOMSDAY CULT!!!, and even if is really they who are the crazy ones, they are nevertheless the people who populate this crazy world we're trying to fix, and the solution can't always just be "read the sequences until you're rational enough to see why this makes sense".

I realize it's a bala... (read more)

there seems to be something about this whole endeavour (including but not limited to Eliezer's writings) that makes people think !!!CRAZY!!! and !!!DOOMSDAY CULT!!!,

Yes, and it's called "pattern completion", the same effect that makes people think "Singularitarians believe that only people who believe in the Singularity will be saved".

6timtyler10yThe outside view of the pitch: * DOOM! - and SOON! * GIVE US ALL YOUR MONEY; * We'll SAVE THE WORLD; you'll LIVE FOREVER in HEAVEN; * Do otherwise and YOU and YOUR LOVED ONES will suffer ETERNAL OBLIVION! Maybe there are some bits missing - but they don't appear to be critical components of the pattern [http://en.wikipedia.org/wiki/Doomsday_cult]. Indeed, this time there are some extra features not invented by those who went before - e.g.: * We can even send you to HEAVEN if you DIE a sinner - IF you PAY MORE MONEY to our partner organisation [http://wiki.lesswrong.com/wiki/Cryonics].
8CarlShulman10yThis one isn't right, and is a big difference between religion and threats like extinction-level asteroids or AI disasters: one can free-ride if that's one's practice in collective action problems. Also: Rapture of the Nerds, Not [http://www.acceleratingfuture.com/steven/?p=21]
2cousin_it10yI don't understand why downvote this. It does sound like an accurate representation of the outside view.

This whole "outside view" methodology, where you insist on arguing from ignorance even where you have additional knowledge, is insane (outside of avoiding the specific biases such as planning fallacy induced by making additional detail available to your mind, where you indirectly benefit from basing your decision on ignorance).

In many cases outside view, and in particular reference class tennis, is a form of filtering the evidence, and thus "not technically" lying, a tool of anti-epistemology and dark arts, fit for deceiving yourself and others.

7Nick_Tarleton10yWe all already know about this pattern match. Its reiteration is boring and detracts from the conversation.
1timtyler10yIf this particular critique has been made more clearly elsewhere, perhaps let me know, and I will happily link to there in the future. Update 2011-05-30: There's now this recent article: The “Rapture” and the “Singularity” Have Much in Common [http://www.firstthings.com/blogs/secondhandsmoke/2011/05/20/the-rapture-and-the-singularity-have-much-in-common/] - which makes a rather similar point.
4Unknowns10yIt may have been downvoted for the caps.
3[anonymous]10yGiven that a certain fraction of comments are foolish, you can expect that an even larger fraction of votes are foolish, because there are fewer controls on votes (e.g. a voter doesn't risk his reputation while a commenter does).
2rhollerith_dot_com10yWhich is why Slashdot (which was a lot more worthwhile in the past than it is now) introduced voting on how other people vote (which Slashdot called metamoderation). Worked pretty well: the decline of Slashdot was mild and gradual compared to the decline of almost every other social site that ever reached Slashdot's level of quality.
0timtyler9yYes: votes should probably not be anonymous - and on "various other" social networking sites, they are not.
0rhollerith_dot_com9yMetafilter, for one. It is hard for an online community to avoid becoming worthless, but Metafilter has avoided that for 10 years.
3Perplexed10yPerhaps downvoted for suggesting that the salvation-for-cash [http://en.wikipedia.org/wiki/Johann_Tetzel] meme is a modern one. I upvoted, though.
2Emile10yThis is discussed in Imaginary Positions [http://lesswrong.com/lw/x1/imaginary_positions/].
4Strange710yWhat about less-smart people? I mean, self-motivated idealistic genius nerds are certainly necessary for the core functions of programming an FAI, but any sufficiently large organization also needs a certain number of people who mostly just file paperwork, follow orders, answer the phone, etc. and things tend to work out more efficiently when those people are primarily motivated by the organization's actual goals rather than it's willingness to pay.
1HughRistik10yGood point. It's the people in the <130 range that SIAI needs to figure out how to attract. That's where you find people like journalists and politicians.
6wedrifid10yYou also find a lot of journalists and politicians in the 130 to 160 range but the important thing with those groups is that they optimise their beliefs and expressions thereof for appeal to a < 130 range audience.
3multifoliaterose10yLeaving aside the question of whether such apparently strong estimation is warranted in the case at hand; I would suggest that there's a serious possibility that the social taboo that you allude to is adaptive; that having a very high opinion of oneself (even if justified) is (on account of the affect heuristic) conducive to seeing a halo around oneself, developing overconfidence bias, rejecting criticisms prematurely, etc. leading to undesirable epistemological skewing. Same here. It's easy to blunt this signal. Suppose that any of: 1. A billionaire decided to devote most of his or her wealth to funding Friendly AI research. 2. A dozen brilliant academics became interested in and started doing Friendly AI research. 3. The probability of Friendly AI research leading to a Friendly AI is sufficiently low so that another existential risk reduction effort (e.g. pursuit of stable whole brain emulation) is many orders of magnitude more cost-effective at reducing existential risk than Friendly AI research. Then the Eliezer would not (by most estimations) be the highest utilitarian expected value human in the world. If he were to mention such possibilities explicitly this would greatly mute the undesired connotations.
5Eliezer Yudkowsky10yIf I thought whole-brain emulation were far more effective I would be pushing whole-brain emulation, FOR THE LOVE OF SQUIRRELS!
2multifoliaterose10yGood to hear from you :-) 1. My understanding is that at present there's a great deal of uncertainty concerning how future advanced technologies are going to develop (I've gotten an impression that e.g. Nick Bostrom and Josh Tenenbaum hold this view). In view of such uncertainty, it's easy to imagine new data emerging over the next decades that makes it clear that pursuit of whole-brain emulation (or some currently unimagined strategy) is a far more effective strategy for existential risk reduction than Friendly AI research. 2. At present it looks to me like a positive singularity is substantially more likely to occur starting with whole-brain emulation than with Friendly AI. 3. Various people have suggested to me that initially pursuing Friendly AI might have higher expected value on the chance that it turns out to be easy. So I could imagine that it's rational for you personally to focus your efforts on Friendly AI research (EDIT: even if I'm correct in my estimation in the above point). My remarks in the grandparent above were not intended as a criticism of your strategy. 4. I would be interested in hearing more about your own thinking about the relative feasibility of Friendly AI vs. stable whole-brain emulation and current arbitrage opportunities for existential risk reduction, whether on or off the record.
2ata10yThat's an interesting claim, and you should post your analysis of it (e.g. the evidence and reasoning that you use to form the estimate that a positive singularity is "substantially more likely" given WBE).
1multifoliaterose10yThere's a thread with some relevant points (both for and against) titled Hedging our Bets: The Case for Pursuing Whole Brain Emulation to Safeguard Humanity's Future [http://lesswrong.com/lw/1s3/hedging_our_bets_the_case_for_pursuing_whole/]. I hadn't looked at the comments until just now and still have to read them all; but see in particular a comment by Carl Shulman [http://lesswrong.com/lw/1s3/hedging_our_bets_the_case_for_pursuing_whole/1oyg?c=1] . After reading all of the comments I'll think about whether I have something to add beyond them and get back to you.
3CarlShulman10yYou may want to read this paper [http://intelligence.org/upload/WBE-superorganisms.pdf] I presented at FHI. Note that there's a big difference between the probability of risk conditional on WBE coming first or AI coming first and marginal impact of effort. In particular some of our uncertainty is about logical facts about the space of algorithms and technology landscape, and some of it is about the extent and effectiveness of activism/intervention.
2multifoliaterose10yThanks for the very interesting reference! Is it linked on the SIAI research papers page? I didn't see it there. I appreciate this point which you've made to me previously (and which appears in your comment that I linked above!).
1Vladimir_Nesov10yDo you mean that the role of ems is in developing FAI faster (as opposed to biological-human-built FAI), or are you thinking of something else? If ems merely speed time up, they don't change the shape of FAI challenge much, unless (and to the extent that) we leverage them in a way we can't for the human society to reduce existential risk before FAI is complete (but this can turn out worse as well, ems can well launch the first arbitrary-goal AGI).
4ata10yThat's the main thing that's worried me about the possibility of ems coming first. But it depends on who is able to upload and who wants to, I suppose. If an average FAI researcher is more likely to upload, increase their speed, and possibly make copies of themselves than an average non-FAI AGI researcher, then it seems like that would be a reduction in risk. I'm not sure whether that would be the case — a person working on FAI is likely to consider their work to be a matter of life and death, and would want all the speed increases they could get, but an AGI researcher may feel the same way about the threat to their career and status posed by the possibility of someone else getting to AGI first. And if uploading is very expensive at first, it'll only be the most well-funded AGI researchers (i.e. not SIAI and friends) who will have access to it early on and will be likely to attempt it (if it provides enough of a speed increase that they'd consider it to be worth it). (I originally thought that uploading would be of little to no help in increasing one's own intelligence (in ways aside from thinking the same way but faster), since an emulation of a brain isn't automatically any more comprehensible than an actual brain, but now I can see a few ways it could help — the equivalent of any kind of brain surgery could be attempted quickly, freely, and reversibly, and the same could be said for experimenting with nootropic-type effects within the emulation. So it's possible that uploaded people would get somewhat smarter and not just faster. Of course, that's only soft self-improvement, nowhere near the ability to systematically change one's cognition at the algorithmic level, so I'm not worried about an upload bootstrapping itself to superintelligence (as some people apparently are). Which is good, since humans are not Friendly.)
3multifoliaterose10yThere's a lot to respond to here. Some quick points: 1. It should be born in mind that greatly increased speed and memory may by themselves strongly affect a thinking entity. I imagine that if I could think a million times as fast I would think a lot more carefully about my interactions with the outside world than I do now. 2. I don't see any reason to think that SIAI will continue to be the only group thinking about safety considerations. If nothing else, SIAI or FHI can raise awareness of the dangers of AI within the community of AI researchers. 3. Assuming that brain uploads precede superhuman artificial intelligence, it would obviously be very desirable to have the right sort of human uploaded first. 4. I presently have a very dim view as to the prospects for modern day humans developing Friendly AI. This skepticism is the main reason why I think that pursuing whole-brain emulations first is more promising. See the comment by Carl [http://lesswrong.com/lw/1s3/hedging_our_bets_the_case_for_pursuing_whole/1p0v?c=1&context=1#comments] that I mentioned in response to Vladimir Nesov's question. Of course, my attitude on this point is subject to change with incoming evidence.
2CarlShulman10ySped-up ems have slower computers relative to their thinking speed. If Moore's Law of Mad Science means that increasing computing power allows researchers to build AI with less understanding (and thus more risk of UFAI), then a speedup of researchers relative to computing speed makes it more likely that the first non-WBE AIs will be the result of a theory-intensive approach with high understanding. Anders Sandberg of FHI and I are working on a paper exploring some of these issues.
2Vladimir_Nesov10yThis argument lowers the estimate of danger, but AIs developed on relatively slow computers are not necessarily theory-intense, could also be coding-intense, which leads to UFAI. And theory-intense doesn't necessarily imply adequate concern about AI's preference.
1multifoliaterose10yMy idea here is the same as the one that Carl Shulman mentioned [http://lesswrong.com/lw/1s3/hedging_our_bets_the_case_for_pursuing_whole/1p0v?c=1&context=1#comments] in a response to one of your comments from nine months ago.

A number of people have mentioned the seemingly-unimpeachable reputation of the Future of Humanity Institute without mentioning that its director, Nick Bostrom, fairly obviously has a high opinion of Eliezer (e.g., he invited him to contribute not one but two chapters to the volume on Global Catastrophic Risks). Heuristically, if I have a high opinion of Bostrom and the FHI project, that raises my opinion of Eliezer and decreases the probability of Eliezer-as-crackpot.

I feel that perhaps you haven't considered the best way to maximise your chance of developing Friendly AI if you were Eliezer Yudkowsky; your perspective is very much focussed on how you see it lookin in from the outside. Consider for a moment that you are in a situation where you think you can make a huge positive impact upon the world, and have founded an organisation to help you act upon that.

Your first, and biggest problem is getting paid. You could take time off to work on attaining a fortune through some other means but this is not a certain bet, and will waste years that you could be spending working on the problem instead. Your best bet is to find already wealthy people who can be convinced that you can change the world, that it's for the best, and that they should donate significant sums of money to you, unless you believe this is even less certain than making a fortune yourself. There's already a lot of people in the world with the requisite amount of money to spare. I think seeking donations is the more rational path.

Now, given that you need to persuade people of the importance of your brilliant new idea which no one has really been considering before, and that to most ... (read more)

0TheAncientGeek5yA display of confidence is a good way of getting people on your side if you are right,. It is also a good way of ovwrestimating whether you are right or not.

I upvoted this, but I'm torn about this.

In your recent posts you've been slowly, carefully, thoroughly deconstructing one person. Part of me wants to break into applause at the techniques used, and learn from them, because in my whole life of manipulation I've never mounted an attack of such scale. (The paragraph saying "something has gone very wrong" was absolutely epic, to the point of evoking musical cues somewhere at the edge of my hearing. Just like the "greatly misguided" bit in your previous post. Bravo!) But another part of me feels horror and disgust because after traumatic events in my own life I'd resolved to never do this thing again.

It comes down to this: I enjoy LW for now. If Eliezer insists on creating a sealed reality around himself, what's that to me? You don't have to slay every dragon you see. Saving one person from megalomania (real or imagined) is way less important than your own research. Imagine the worst possible world: Eliezer turns into a kook. What would that change, in the grand scheme of things or in your personal life? Are there not enough kooks in AI already?

And lastly, a note about saving people. I think many of us here have had ... (read more)

I saved someone from suicide once. While the experience was certainly quite unpleasant at the time, if I had hit "ignore," as you suggest, she would have died. I don't think that I would be better off today if I had let her die, to say nothing of her. The fact that saving people is hard doesn't mean that you shouldn't do it!

It comes down to this: I enjoy LW for now. If Eliezer insists on creating a sealed reality around himself, what's that to me? You don't have to slay every dragon you see. Saving one person from megalomania (real or imagined) is way less important than your own research. Imagine the worst possible world: Eliezer turns into a kook. What would that change, in the grand scheme of things or in your personal life?

The very fate of the universe, potentially. Purely hypothetically and for the sake of the discussion:

  • If Eliezer did have the potential to provide a strong positive influence on grand scale future outcomes but was crippled by the still hypothetical lack of self-doubt then that is a loss of real value.
  • A bad 'Frodo' can be worse than no Frodo at all. If we were to give the ring to a Frodo who thought he could take on Nazgul in hand to hand combat then we would lose the ring and so the lose the chance to give said ring to someone who could pull it off. Multi (and those for whom he asks such questions) have limited resources (and attention) so it may be worth deliberate investigation of potential recipients of trust.
  • Worse yet than a counterproductive Frodo would be a Frodo who
... (read more)

Er... I can't help but notice a certain humor in the idea that it's terrible if I'm self-deluded about my own importance because that means I might destroy the world.

5John_Baez10yIt's some sort of mutant version of "just because you're paranoid doesn't mean they're not out to get you".
5wedrifid10yYes, there is is a certain humor. But I hope you did read the dot points and followed the reasoning. It, among other things, suggests a potential benefit of criticism such as multi's aside from hypothetical benefits of discrediting you should it have been the case that you were not, in fact, competent.
8Perplexed10yI suppose I could draw from that the inference that you have a rather inflated notion of the importance of what multi is doing here, ... but, in the immortal words of Richard Milhous Nixon, "That would be wrong." More seriously, I think everyone here realizes that EY has some rough edges, as well as some intellectual strengths. For his own self-improvement, he ought to be working on those rough edges. I suspect he is. However, in the meantime, it would be best if his responsibilities were in areas where his strengths are exploited and his rough edges don't really matter. So, just what are his current responsibilities? 1. Convincing people that UFAI constitutes a serious existential risk while not giving the whole field of futurism and existential risk reduction a bad rep. 2. Setting direction for and managing FAI and UFAI-avoidance research at SIAI. 3. Conducting FAI and UFAI-avoidance research. 4. Reviewing and doing conceptual QC on the research work product. To be honest, I don't see EY's "rough edges" as producing any problems at all with his performance on tasks #3 and #4. Only SIAI insiders know whether there has been a problem on task #2. Based on multi's arguments, I suspect he may not be doing so well on #1. So, to me, the indicated response ought to be one of the following: A. Hire someone articulate (and if possible, even charismatic) to take over task #1 and make whatever minor adjustments are needed regarding task #2. B. Do nothing. There is no problem! C. Get some academic papers published so that FAI/anti-UFAI research becomes interesting to the same funding sources that currently support CS, AI, and decision theory research. Then reconstitute SIAI as just one additional research institution which is fighting for that research funding. I would be interested in what EY thinks of these three possibilities. Perhaps for different reasons, I suspect, so would multi. [Edited to correct my hallucination
1wedrifid10yWas the first (unedited) 'you' intended? If so I'll note that I was merely answering a question within a counterfactual framework suggested by the context. I haven't even evaluated what potential importance multi's post may have - but the prior probability I have for 'a given post on LW mattering significantly' is not particularly high. I like your general analysis by the way and am always interested to know what the SIAI guys are doing along the lines of either your 1,2,3 or your A, B, C. I would seriously like to see C happen. Being able and willing to make that sort of move would be a huge step forward (and something that makes any hints of 'arrogance' seem trivial.)
3dclayh10yVeering wildly off-topic: Come on now. Humans are immortal in Tolkien, they just sit in a different waiting room. (And technically can't come back until the End of Days™, but who cares about that.)
1Strange710yAlright, then, call it her permanent resident status. If real death is off the table for everyone sapient, she's still taking as big a risk as any member of the Fellowship proper.
1dclayh10yTo be sure. I was only pointing out that her "giving up immortality" was not nearly as crazy as the words "giving up immortality" might suggest in other contexts.
1cousin_it10yWhat Eliezer said. I was arguing from the assumption that he is wrong about FAI and stuff. If he's right about the object level, then he's not deluded in considering himself important.
3Vladimir_Nesov10yBut if he is wrong about FAI and stuff, then he is still deluded not specifically about considering himself important, that implication is correct, he is deluded about FAI and stuff.
2wedrifid10yWhich, of course, would still leave the second two dot points as answers to your question.
1cousin_it10yHow so? Eliezer's thesis is "AGI is dangerous and FAI is possible". If he's wrong - if AGI poses no danger or FAI is impossible - then what do you need a Frodo for?
3Vladimir_Nesov10yThe previous post [http://lesswrong.com/lw/2lh/other_existential_risks/] was fine, but this one is sloppy [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2h72?c=1], and I don't think it's some kind of Machiavellian plot.
2xamdam10yBecause you were on the giving or on the receiving end of it? Agreed; personally I de-converted myself from orthodox judaism, but I still find it crazy when people write big scholarly books debunking the bible; it's just useless a waste of energy (part of it is academic incentives). I haven't been involved in these situations, but taking a cue from drug addicts (who incidentally have high suicide rate) most of them do not recover, but maybe 10% do. So most of the time you'll find frustration, but one in 10 you'd save a life, I am not sure if that's worthless.

FWIW, as an entrepreneur type I consider one of my top 3 key advantages the fact that I would actually appreciate it greatly if someone explained in detail why I was wasting my time with my current project. Thinking about this motivates me significantly because I haven't met any other entrepreneur types who I'd guess this is also true for.

5Jordan10ySemi related: I keep a big list of ideas I'd like to implement. (Start up ideas, video games ideas, math research topics.. the three things that consume me =) Quite often I'll find out someone is working on one of these ideas, and my immediate reaction is... relief. Relief, because I found out early enough not to waste my time. But, more than that, I look at my list of ideas like an orphanage: I'm always happy when one of them finds a loving parent =p Out of curiosity, what do you consider your other two key advantages?
1John_Maxwell10yI didn't actually think of 3 key advantages, just figured that would be one of the top three. Probably if I was to list others, they would be willingness to trawl through a lot of ideas before finding one and implementing it, never giving up unless it really is the rational thing to do (the flip side of the original advantage), and coding ability. (Although this guy still freaks me out: http://weblog.markbao.com/2008/how-i-built-a-webapp-in-18-hours-for-699/ [http://weblog.markbao.com/2008/how-i-built-a-webapp-in-18-hours-for-699/]) I think people often suck at following through.
1Jordan10yA kid genius entrepreneur.. awesome. You see kid genius mathematicians, chess players, musicians, etc... but an entrepreneur, that's really different. The subject matter forces him to diversify, rather than focus in on a single skill. I'm a little inspired. Agreed. Sometimes I see someone working on an idea I had and become even more motivated to work on it.
1wedrifid10yI recall Tim Ferris relaying a tale of a young (~14) Olympian (Skier) who founded a remarkably successful business in order to support his international sport habit.

How would you address this?

http://scienceblogs.com/pharyngula/2010/08/kurzweil_still_doesnt_understa.php

It seems to me like PZ Meyers really doesn't understand information theory. He's attacking Kurzweil and calling him a kook. Initially due to a relatively straightforward complexity estimate.

And I'm pretty confident that Myers is wrong on this, unless there is another information rich source of inheritance besides DNA, which Meyers knows about but Kurzweil and I do not.

This looks to me like a popular science blogger doing huge PR damage to everything sin... (read more)

9WrongBot10yThis analogy made me cringe. Myers is disagreeing with the claim that human DNA completely encodes the structure and functioning of the human brain: the hardware and software, roughly. Looking at the complexity of the hardware and making claims about the complexity of the software, as he does here, is completely irrelevant to his disagreement. It serves only to obscure the actual point under debate, and demonstrates that he has no idea what he's talking about.
7Risto_Saarelma10yThere seems to be a culture clash between computer scientists and biologists with this matter. DNA bit length as a back-of-the-envelope complexity estimate for a heavily compressed AGI source seems obvious to me, and, it seems, to Larry Page [http://pimm.wordpress.com/2007/02/20/googles-larry-page-at-the-aaas-meeting-entrepreneurship-in-science/] . Biologists are quick to jump to the particulars of protein synthesis and ignore the question of extra information, because biologists don't really deal with information theoretical existence proofs. It really doesn't help the matter that Kurzweil threw out his estimate when talking about getting at AGI by specifically emulating the human brain, instead of just trying to develop a general human-equivalent AI using code suitable for the computation platform used. This seems to steer most people into thinking that Kurzweil was thinking of using the DNA as literal source code instead of just a complexity yardstick. Myers seems to have pretty much gone into his creationist-bashing attack mode on this, so I don't have a very high hopes for any meaningful dialogue from him.
3whpearson10yI'm still not sure what people are trying to say with this. Because the kolmogorov complexity of the human brain given the language of the genetic code and physics is low, therefore X? What is that X precisely? Because of kolmogorov complexities additive constant, which could be anything from 0 to 3^^^3 or higher, I think it only gives us weak evidence for the amount of code we should expect it to take to code an AI on a computer. It is even weaker evidence for the amount of code it would take to code for it with limited resources. E.g. the laws of physics are simple and little information is taken from the womb, but to create an intelligence from them might require a quantum computer the size of the human head to decompress the compressed code. There might be short cuts to do it, but they might be of vastly greater complexity. We tend to ignore additive constants when talking about Complexity classes, because human designed algorithms tend not to have huge additive constants. Although I have come across some in my time such as this [http://books.google.co.uk/books?id=8P8KTnNLQegC&pg=PA226&lpg=PA226&dq=huge+additive+constants&source=bl&ots=GDLi5zuOz6&sig=qiQeHfxUtsF8R8IglcdNBxL0umw&hl=en&ei=tWVyTIeSKsKY4AaL1I3fCA&sa=X&oi=book_result&ct=result&resnum=1&ved=0CBUQ6AEwAA#v=onepage&q=huge%20additive%20constants&f=false] ...
3Emile10yWe have something like this going on like: discrete DNA code -> lots of messy chemistry and biology -> human intelligence and we're comparing it to : discrete computer code -> computer -> human intelligence Kurzweil is arguing that the size of the DNA code can tell us about the max size of the computer code needed to run an intelligent brain simulation (or a human-level AI), and PZ Myers is basically saying "no, 'cause that chemistry and biology is really really messy". Now, I agree that the computer code and the DNA code are very very different ("a huge amount of enzymes interacting with each other in 3D real time" isn't the kind of thing you easily simulate on a computer), and the additive constant for converting one into the other is likely to be pretty darn big. But I also don't see a reason for intelligence to be easier to express with messy biology and chemistry than with computer code. The things about intelligence that are the closest to biology (interfacing with the real world, how one neuron functions) are also the kind of things that we can already do quite well with computer programs. There are some things that are "natural" to code in Prolog, but not natural in Fortran, fotran. So a short program in prolog might require a long program in Fotran to do the same thing, and for different programs it might be the other way around. I don't see any reason to think that it's easier to encode intelligence in DNA than it is in computer code. (Now, Kurzweil may be overstating his case when he talks about "compressed" DNA, because to be fair you should compare that to compressed (or compiled) computer code, which translates to much more actual code. I still think the size of the DNA is a very reasonable upper limit, especially when you consider that the DNA was coded by a bloody idiot whose main design pattern is "copy-and-paste", resulting in the bloated code we know)
1whpearson10yDo you have any reason to expect it to be the same? Do we have any reason at all? I'm not arguing that it will take more than 50MBs of code, I'm arguing that the DNA value is not informative. We are far less good at the doing the equivalent of changing neural structure or adding new neurons (we don't know why or how neurogenesis works for one) in computer programs.
2Emile10yIf I know a certain concept X requires 12 seconds of speech to express in English, and I don't know anything about Swahili beyond the fact that it's a human language, my first guess will be that concept X requires 12 seconds of speech to express in Swahili. I would also express compressed versions of translations in various languages of the same book to be roughly the same size. So, even with very little information, a first estimate (with a big error margin) would be that it takes as many bits to "encode" intelligence in DNA than it does in computer code. In addition, the fact that some intelligence-related abilities such as multiplying large numbers are easy to express in computer code, but rare in nature would make me revise that estimate towards "code as more expressive than DNA for some intelligence-related stuff". In addition, knowledge about the history of evolution would make me suspect that large chunks of the human genome are not required for intelligence, either because they aren't expressed, or because they only concern traits that have no impact on our intelligence beyond the fact of keeping us alive. That would also make me revise my estimate downwards for the code size needed for intelligence. None of those are very strong reasons, but they are reasons nonetheless!
4Kingreaper10yThe environment is information-rich, especially the social environment. Meyers make it quite clear that interactions with the environment are an expected input of information in his understanding. Do you disagree with information input from the environment?
4JamesAndrix10yYes, I disagree. If he's not talking about some stable information that is present in all environments that yield intelligent humans, then what's important is a kind of information that can be mass generated at low complexity cost. Even language exposure is relatively low complexity, and the key parts might be inferable from brain processes. And we already know how to offer a socially rich environment, so I don't think it should add to the complexity costs of this problem. And I think a reverse engineering of a newborn baby brain would be quite sufficient for kurzweil's goal. In short: we know intelligent brains get reliably generated. We know it's very complex. The source of that complexity must be something information rich, stable, and universal. I know of exactly one such source. Right now I'm reading myers argument as "a big part of human heredity is memetic rather than just genetic, and there is complex interplay between genes and memes, so you've got to count the memes as part of the total complexity." I say that Kurzweil is trying to create something compatible with human memes in the first plalce, so we can load them the same way we load children (at worst) And even some classes of memes (age appropriate language exposure) do interact tightly with genes, their information content is not all that high.
4Emile10yI see it that way too. The DNA can give us an upper bound on the information needed to create a human brain, but PZ Myers reads that as "Kurzweil is saying we will be able to take a strand of DNA and build a brain from that in the next 10 years!", and then procede to attack that straw man. This, however: ... I am quite enclined to trust. I would trust it more if it wasn't followed by wrong statements about information theory (that seem wrong to me, at least). Looking at the comments is depressing. I wish there was some "sane" ways for two communities (readers of PZ Myers and "singularitarians") to engage without it degenerating into name-calling. Though there are software solutions for that (takeonit and other stuff that's been discussed here), it wouldn't help either if the "leaders" (PZ Myers, Kurzweil, etc.) were a bit more responsible and made a genuine effort to acknowledge the other's points when there are strong. So they could converge or at least agree to disagree on something narrow. But nooo, it's much more fun to get angry, and it gets you more traffic too!
2knb10yMyers has always had a tendency to attack other people's arguments like enemy soldiers. A good example is his take on evolutionary psychology, which he hates so much it is actually funny. He also claims to have desecrated a consecrated host (the sacramental wafers Catholics consider to be the body of Jesus). That will show those evil theists how a good, rational person behaves!
2Mitchell_Porter10yMyers' thesis is that you are not going to figure out by brute-force physical simulation how the genome gives rise to the organism, knowing just the genomic sequence. On every scale - molecule, cell, tissue, organism - there are very complicated boundary conditions at work. You have to do experimental biology, observe those boundary conditions, and figure out what role they play. I predict he would be a lot more sympathetic if Kurzweil was talking about AIs figuring out the brain by doing experimental biology, rather than just saying genomic sequence + laws of physics will get us there.
5Perplexed10yAnd he is quite possibly correct. However, that has nothing at all to do with what Kurzweil said. I predict he would be more sympathetic if he just made the effort to figure out what Kurzweil said. But, of course, we all know there is no chance of that, so "conjecture" might be a better word than "predict".
2Mitchell_Porter10yMyers doesn't have an argument against Kurzweil's estimate of the brain's complexity. But his skepticism about Kurzweil's timescale can be expressed in terms of the difficulty of searching large spaces. Let's say it does take a million lines of code to simulate the brain. Where is the argument that we can produce the right million lines of code within twenty years? The space of million-line programs is very large.
1Perplexed10yI agree, both regarding timescale, and regarding reason for timescale difficulties. As I understand Kurzweil, he is saying that we will build the AI, not by finding the program for development and simulating it, but rather by scanning the result of the development and duplicating it in a different medium. The only relevance of that hypothetical million-line program is that it effectively puts a bound on the scanning and manufacturing tolerances that we need to achieve. Well, while it is probably true in general that we don't need to get the wiring exactly right on all of the trillions of neurons, there may well be some where the exact right embryonic wiring is crucial to success. And, since we don't yet have or understand that million-line program that somehow gets the wiring right reliably, we probably won't get them right ourselves. At least not at first. It feels a little funny to find myself making here an argument right out of Bill Dembski's playbook. No free lunch! Needle in a haystack. Only way to search that space is by exhaustion. Well, we shall see what we shall see.
3SilasBarta10yI agree, but at the same time, I wish biologists would learn more information theory, since their focus should be identifying the information flows going on, as this is what will lead us to a comprehensible model of human development and functionality. (I freely admit I don't have years in the trenches, so this may be a naive view, but if my experience with any other scientific turf war is any guide, this is important advice.)
2Paul Crowley10yThis was cited to me in a blog discussion as "schoolboy biology EY gets wrong" (he said something similar, apparently).

The real bone of contention here seems to be the long chain of inference leading from common scientific/philosophical knowledge to the conclusion that uFAI is a serious existential risk. Any particular personal characteristics of EY would seem irrelevant till we have an opinion on that set of claims.

If EY were working on preventing asteroid impacts with earth, and he were the main driving force behind that effort, he could say "I'm trying to save the world" and nobody would look at him askance. That's because asteroid impacts have definitely caus... (read more)

3jimrandomh10yYou shouldn't deny knowledge of how strong claims are, and refer to those claims as "a house of cards" in the same sentence. Those two claims are mutually exclusive, and putting them close together like this set off my propagandometer.
2Simulation_Brain10yUpvoted; the issue of FAI itself is more interesting than whether Eliezer is making an ass of himself and thereby the SIAI message (probably a bit; claiming you're smart isn't really smart, but then he's also doing a pretty good job as publicist). One form of productive self-doubt is to have the LW community critically examine Eliezer's central claims. Two of my attempted simplifications of those claims are posted here [http://lesswrong.com/lw/2lh/other_existential_risks/2h4s?c=1] and here [http://lesswrong.com/lw/2l0/should_i_believe_what_the_siai_claims/2fbc?c=1] on related threads. Those posts don't really address whether strong AI feasible; I think most AI researchers agree that it will become so, but disagree on the timeline. I believe it's crucial but rarely recognized that the timeline really depends on how many resources are devoted to it. Those appear to be steadily increasing, so it might not be that long.

It seems like an implication of your post that no one is ever allowed to believe they're saving the world. Do you agree that this is an implication? If not, why not?

Not speaking for multi, but, in any x-risk item (blowing up asteroids, stabilizing nuclear powers, global warming, catastrophic viral outbreak, climate change of whatever sort, FAI, whatever) for those working on the problem, there are degrees of realism:

"I am working on a project that may have massive effect on future society. While the chance that I specifically am a key person on the project are remote, given the fine minds at (Google/CDC/CIA/whatever), I still might be, and that's worth doing." - Probably sane, even if misguided.

"I am working on a project that may have massive effect on future society. I am the greatest mind in the field. Still, many other smart people are involved. The specific risk I am worried about may or not occur, but efforts to prevent its occurrence are valuable. There is some real possibility that I will the critical person on the project." - Possibly sane, even if misguided.

"I am working on a project that will save a near-infinite number of universes. In all likelihood, only I can achieve it. All of the people - even people perceived as having better credentials, intelligence, and ability - cannot do what I am doing. All crit... (read more)

[-][anonymous]10y 10

I don't think Eliezer believes he's irreplaceable, exactly. He thinks, or I think he thinks, that any sufficiently intelligent AI which has not been built to the standard of Friendliness (as he defines it) is an existential risk. And the only practical means for preventing the development of UnFriendly AI is to develop superintelligent FAI first. The team to develop FAI needn't be SIAI, and Eliezer wouldn't necessarily be the most important contributor to the project, and SIAI might not ultimately be equal to the task. But if he's right about the risk and the solution, and his untimely demise were to doom the world, it would be because no-one else tried to do this, not because he was the only one who could.

Not that this rules out your interpretation. I'm sure he has a high opinion of his abilities as well. Any accusation of hubris should probably mention that he once told Aubrey de Grey "I bet I can solve ALL of Earth's emergency problems before you cure aging."

2JamesAndrix10yThere may be multiple different projects projects, each necessary to save the world, and each having a key person who knows more about the project, and/or is more driven and/or is more capable than anyone else. Each such person has weirdly high expected utility, and could accurately make a statement like EY's and still not be the person with the highest expected utility. Their actual expected utility would depend on the complexity of the project and the surrounding community, and how much the success of the project alters the value of human survival. This is similar to the idea that responsibility is not a division of 100%. http://www.ranprieur.com/essays/mathres.html [http://www.ranprieur.com/essays/mathres.html]
2Jonathan_Graehl10yWhat you say sounds reasonable, but I feel it's unwise for me to worry about such things. If I were to sound such a vague alarm, I wouldn't expect anyone to listen to me unless I'd made significant contributions in the field myself (I haven't).
4Unknowns10yMultifoliaterose said this: Note that there are qualifications on this. If you're standing by the button that ends the world, and refuse to press it when urged, or you prevent others from pressing it (e.g. Stanislav Petrov), then you may reasonably believe that you're saving the world. But no, you may not reasonably believe that you are saving the world based on long chains of reasoning based on your intuition, not on anything as certain as mathematics and logic, especially decades in advance of anything happening.
2Eliezer Yudkowsky10yIt seems like an implication of this and other assumptions made by multi, and apparently shared by you, is that no one can believe themselves to be critical to a Friendly AI project that has a significant chance of success. Do you agree that this is an implication? If not, why not?
5Unknowns10yNo, I don't agree this is an implication. I would say that no one can reasonably believe all of the following at the same time with a high degree of confidence: 1) I am critical to this Friendly AI project that has a significant chance of success. 2) There is no significant chance of Friendly AI without this project. 3) Without Friendly AI, the world is doomed. But then, as you know, I don't consider it reasonable to put a high degree in confidence in number 3. Nor do many other intelligent people (such as Robin Hanson.) So it isn't surprising that I would consider it unreasonable to be sure of all three of them. I also agree with Tetronian's points.
4Eliezer Yudkowsky10yI see. So it's not that any one of these statements is a forbidden premise, but that their combination leads to a forbidden conclusion. Would you agree with the previous sentence? BTW, nobody please vote down the parent below -2, that will make it invisible. Also it doesn't particularly deserve downvoting IMO.
5Perplexed10yI would suggest that, in order for this set of beliefs to become (psychiatrically?) forbidden, we need to add a fourth item. 4) Dozens of other smart people agree with me on #3. If someone believes that very, very few people yet recognize the importance of FAI, then the conjunction of beliefs #1 thru #3 might be reasonable. But after #4 becomes true (and known to our protagonist), then continuing to hold #1 and #2 may be indicative of a problem.
5Eliezer Yudkowsky10yDozens isn't sufficient. I asked Marcello if he'd run into anyone who seemed to have more raw intellectual horsepower than me, and he said that John Conway gave him that impression. So there are smarter people than me upon the Earth, which doesn't surprise me at all, but it might take a wider net than "dozens of other smart people" before someone comes in with more brilliance and a better starting math education and renders me obsolete.
9[anonymous]10yJohn Conway is smarter than me, too. [http://www.google.com/search?q=look+who+thinks+he%27s+nothing]
8Spurlock10ySimply out of curiosity: Plenty of criticism (some of it reasonable) has been lobbed at IQ tests and at things like the SAT. Is there a method known to you (or anyone reading) that actually measures "raw intellectual horsepower" in a reliable and accurate way? Aside from asking Marcello.

Aside from asking Marcello.

I was beginning to wonder if he's available for consultation.

6rabidchicken10yRead the source code, and then visualize a few levels from Crysis or Metro 2033 in your head. While you render it, count the average Frames per second. Alternatively, see how quickly you can find the prime factors of every integer from 1 to 1000. Which is to say... Humans in general have extremely limited intellectual power. instead of calculating things efficiently, we work by using various tricks with caches and memory to find answers. Therefore, almost all tasks are more dependant on practice and interest than they are on intelligence. So, rather then testing the statement "Eliezer is smart" it has more bearing on this debate to confirm "Eliezer has spent a large amount of time optimizing his cache for tasks relating to rationality, evolution, and artificial intelligence". Intelligence is overrated.
3XiXiDu10ySheer curiosity, but have you or anyone ever contacted John Conway about the topic of u/FAI and asked him what the thinks about the topic, the risks associated with it and maybe the SIAI itself?
1xamdam10y"raw intellectual power" != "relevant knowledge". Looks like he worked on some game theory, but otherwise not much relevancy. Should we ask Steven Hawking? Or take a poll of Nobel Laureates? I am not saying that he can't be brought up to date in this kind of discussion, and has a lot to consider, but not asking him as things are indicates little.
1Perplexed10yCandid, and fair enough.
0whowhowho8yRaw intellectual horsepower is not the right kind of smart.
2Perplexed10yWith the hint from EY on another branch, I see a problem in my argument. Our protagonist might circumvent my straitjacket by also believing 5) The key to FAI is TDT, but I have been so far unsuccessful in getting many of those dozens of smart people to listen to me on that subject. I now withdraw from this conversation with my tail between my legs.
1katydee10yAll this talk of "our protagonist," as well the weird references to SquareSoft games, is very off-putting for me.
2Unknowns10yI wouldn't put it in terms of forbidden premises or forbidden conclusions. But if each of these statements has a 90% of being true, and if they are assumed to be independent (which admittedly won't be exactly true), then the probability that all three are true would be only about 70%, which is not an extremely high degree of confidence; more like saying, "This is my opinion but I could easily be wrong." Personally I don't think 1) or 3), taken in a strict way, could reasonably be said to have more than a 20% chance of being true. I do think a probability of 90% is a fairly reasonable assignment for 2), because most people are not going to bother about Friendliness. Accounting for the fact that these are not totally independent, I don't consider a probability assignment of more than 5% for the conjunction to be reasonable. However, since there are other points of view, I could accept that someone might assign the conjunction a 70% chance in accordance with the previous paragraph, without being crazy. But if you assign a probability much more than that I would have to withdraw this. If the statements are weakened as Carl Shulman suggests, then even the conjunction could reasonably be given a much higher probability. Also, as long as it is admitted that the probability is not high, you could still say that the possibility needs to be taken seriously because you are talking about the possible (if yet improbable) destruction of the world.

I certainly do not assign a probability as high as 70% to the conjunction of all three of those statements.

And in case it wasn't clear, the problem I was trying to point out was simply with having forbidden conclusions - not forbidden by observation per se, but forbidden by forbidden psychology - and using that to make deductions about empirical premises that ought simply to be evaluated by themselves.

I s'pose I might be crazy, but you all are putting your craziness right up front. You can't extract milk from a stone!

3PaulAlmond10yJust curious (and not being 100% serious here): Would you have any concerns about the following argument (and I am not saying I accept it)? 1. Assume that famous people will get recreated as AIs in simulations a lot in the future. School projects, entertainment, historical research, interactive museum exhibits, idols to be worshipped by cults built up around them, etc. 2. If you save the world, you will be about the most famous person ever in the future. 3. Therefore there will be a lot of Eliezer Yudkowsky AIs created in the future. 4. Therefore the chances of anyone who thinks he is Eliezer Yudkowsky actually being the orginal, 21st century one are very small. 5. Therefore you are almost certainly an AI, and none of the rest of us are here - except maybe as stage props with varying degrees of cognition (and you probably never even heard of me before, so someone like me would probably not get represented in any detail in an Eliezer Yudkowsky simulation). That would mean that I am not even conscious and am just some simple subroutine. Actually, now I have raised the issue to be scary, it looks a lot more alarming for me than it does for you as I may have just argued myself out of existence...
2wedrifid10yThat doesn't seem scary to me at all. I still know that there is at least one of me that I can consider 'real'. I will continue to act as if I am one of the instances that I consider me/important. I've lost no existence whatsoever.
2Unknowns10yThat's good to know. I hope multifoliaterose reads this comment, as he seemed to think that you would assign a very high probability to the conjunction (and it's true that you've sometimes given that impression by your way of talking.) Also, I didn't think he was necessarily setting up forbidden conclusions, since he did add some qualifications allowing that in some circumstances it could be justified to hold such opinions.
2CarlShulman10y1) can be finessed easily on its own with the idea that since we're talking about existential risk even quite small probabilities are significant. 3) could be finessed by using a very broad definition of "Friendly AI" that amounted to "taking some safety measures in AI development and deployment." But if one uses the same senses in 2), then one gets the claim that most of the probability of non-disastrous AI development is concentrated in one's specific project, which is a different claim than "project X has a better expected value, given what I know now about capacities and motivations, than any of the alternatives (including future ones which will likely become more common as a result of AI advance and meme-spreading independent of me) individually, but less than all of them collectively."
5WrongBot10yWho else is seriously working on FAI right now? If other FAI projects begin, then obviously updating will be called for. But until such time, the claim that "there is no significant chance of Friendly AI without this project" is quite reasonable, especially if one considers the development of uFAI to be a potential time limit.
5CarlShulman10yPeople who will be running DARPA, or Google Research, or some hedge fund's AI research group in the future (and who will know about the potential risks or be able to easily learn if they find themselves making big progress) will get the chance to take safety measures. We have substantial uncertainty about how extensive those safety measures would need to be to work, how difficult they would be to create, and the relevant timelines. Think about resource depletion or climate change: even if the issues are neglected today relative to an ideal level, as a problem becomes more imminent, with more powerful tools and information to deal with it, you can expect to see new mitigation efforts spring up (including efforts by existing organizations such as governments and corporations). However, acting early can sometimes have benefits that outweigh the lack of info and resources available further in the future. For example, geoengineering technology can provide insurance against very surprisingly rapid global warming, and cheap plans that pay off big in the event of surprisingly easy AI design may likewise have high expected value. Or, if AI timescales are long, there may be slowly compounding investments, like lines of research or building background knowledge in elites, which benefit from time to grow. And to the extent these things are at least somewhat promising, there is substantial value of information to be had by investigating now (similar to increasing study of the climate to avoid nasty surprises).
2DanielVarga10yEveryone is allowed to believe they're saving the world. It is two other things, both quite obvious. First, we do not say it out loud if we don't want to appear kooky. Second, if someone really believes that he is literally saving the world, then he can be sure that he has a minor personality disorder [1], regardless of whether he will eventually save the world or not. Most great scientists are eccentric, so this is not a big deal, if you manage to incorporate it into your probability estimates while doing your job. I mean, this bias obviously affects your validity estimate for each and every argument you hear against hard AI takeoff. (I don't think your debaters so far did a good job bringing up such counterarguments, but that's beside the point.) [1] by the way, in this case (in your case) grandiosity is the correct term, not delusions of grandeur.
8Eliezer Yudkowsky10ySo you'd prohibit someone of accurate belief? I generally regard that as a reductio.
3Tyrrell_McAllister10yIf a billion people buy into a 1-in-a-billion raffle, each believing that he or she will win, then every one of them has a "prohibited" belief, even though that belief is accurate in one case.
6Paul Crowley10yStanislav Petrov had this disorder? In thinking he was making the world a safer place, Gorbachev had this disorder? It seems a stretch to me to diagnose a personality disorder based on an accurate view of the world.
3DanielVarga10yGorbachev was leading an actual superpower, so his case is not very relevant in a psychological analysis of grandiosity. At the time of the famous incident, Petrov was too busy to think about his status as a world-savior. And it is not very relevant here what he believed after saving the world. I didn't mean to talk about an accurate view of the world. I meant to talk about a disputed belief about a future outcome. I am not interested in the few minutes while Petrov may had the accurate view that he is currently saving the world.

I don't quite understand your confusion. An AGI is a computer program, and friendliness is a property of a computer program. Yes, these concepts allude to mental concepts on our maps, but these mental concepts are reducible to properties of the nonmental substrates that are our brains. In fact, the goal of FAI research is to find the reduction of friendliness to nonmental things.

Imagine an AI as intelligent and well informed as an FAI, but one without much power - as a result of physical safeguards, say

There's some part of my brain that just processes "the Internet" as a single person and wants to scream "But I told you this a thousand times already!"

http://yudkowsky.net/singularity/aibox

2dclayh10yEliezer, while you're defending yourself from charges of self-aggrandizement, it troubles me a little bit that AI Box page states that your record is 2 for 2, and not 3 for 5 [http://lesswrong.com/lw/up/shut_up_and_do_the_impossible/].
4Eliezer Yudkowsky10yObviously I'm not trying to keep it a secret. I just haven't gotten around to editing.
1dclayh10yI'm sure that's the case, I'm just saying it looks bad. Presumably you'd like to be Caesar's wife?
2steven046110ySurely it's possible to imagine a successfully boxed AI.
3wedrifid10yI could imagine successfully beating Rybka at chess too. But it would be foolish of me to take any actions that considered it as a serious possibility. If motivated humans cannot be counted on to box an Eliezer then expecting a motivated, overconfident and prestige seeking AI creator to successfully box his AI creation is reckless in the extreme.
2steven046110yWhat Eliezer seemed to be objecting to was someone proposing a successfully boxed AI as an example of why "able to destroy humanity" can't be a part of the definition of "AI" (or more charitably, "artificial superintelligence"). For boxed AI to be such an example (as opposed to a good idea to actually strive toward), it only has to be not knowably impossible.
1ata10yI see your point there. But I think this discussion sort of went in an irrelevant direction, albeit probably my fault for not being clear enough. When I put "powerful enough to destroy humanity" in that criterion, I mainly meant "powerful" as in "really powerful optimization process", mathematical optimization power, not "power" as in direct influence over the world. We're inferring that the former will usually lead fairly easily to the latter, but they are not identical. So "powerful enough to destroy humanity" would mean something like "powerful enough to figure out a good subjunctive plan to do so given enough information about the world, even if it has no output streams and is kept in an airtight safe at the bottom of the ocean".

And FAI counts as not "supernatural" how?

In the ordinary sense that Richard Dawkins and James Randi use.

In any case, nuclear war, peak oil, global warming, overpopulation attracted a huge number of people who claimed that civilization will end unless this or that will be done.

"If we don't continue to practice agriculture or hunting and gathering, civilization will end."

There are plenty of true statements like that. Your argument needs people who said that such and such things needed to be done, and that they were the ones who we... (read more)

I agree about the benefits of larger research community, although feasibility of "collaborating with existing institutions" is in question, due to the extreme difficulty of communicating the problem statement. There are also serious concerns about the end-game, where it will be relatively easy to instantiate a random-preference AGI on the basis of tools developed in the course of researching FAI.

Although the instinct is to say "Secrecy in science? Nonsense!", it would also be an example of outside view, where one completes a pattern wh... (read more)

1cousin_it10yHey, three days have passed and I want that post!
1Vladimir_Nesov10yI have an excuse, I got a cold!
3cousin_it10yOkay hurry up then, you're wasting lives in our future light cone.
2wedrifid10y"Shut up and do the temporarily inconvenient!"
1cousin_it10yThree more days have passed.
1Vladimir_Nesov10yPlanning is the worst form of procrastination. I now have 7 (!) posts planned before the roadmap post I referred to (with the readmap post closing the sequence), so I decided on writing a mini-sequence of 2-3 posts on LW about ADT first.
1multifoliaterose10yMaybe things could gradually change with more interface between people who are interested in FAI and researchers in academia. I agree with this and believe that this could justify secrecy, but I think that it's very important that we hold the people who we trust with the end-game to very high standards for demonstrated epistemic rationality and scrupulousness. I do not believe that the SIAI staff have met such standards. My belief on this matter regard is a major reason why I'm pursuing my current trajectory of postings.

An interesting post, well written, upvoted. Mere existence of such posts here constitutes a proof that LW is still far from Objectivism, not only because Eliezer is way more rational (and compassionate) than Ayn Rand, but mainly because the other people here are aware of dangers of cultism.

However, I am not sure whether the right way to prevent cultish behaviour (whether the risk is real or not) is to issue warning like this to the leader (or any sort of warning, perhaps). The dangers of cultism emerge from simply having a leader; whatever the level of per... (read more)

6Paul Crowley10yThe case for devoting all of your altruistic efforts to a single maximally efficient cause seems strong to me, as does the case that existential risk mitigation is that maximally efficient cause. I take it you're familiar with that case (though see eg "Astronomical Waste" if not) so I won't set it all out again here. If you think I'm mistaken, actual counter-arguments would be more useful than emotional reactions.
3prase10yI don't object to devoting (almost) all efforts to a single cause generally. I do, however, object to such devotion in case of FAI and the Singularity. If a person devotes all his efforts to a single cause, his subjective feeling of importance of the cause will probably increase and most people will subsequently overestimate how important the cause is. This danger can be faced by carefully comparing the results of one's deeds with the results of other people's efforts, using a set of selected objective criteria, or measure it using some scale ideally fixed at the beginning, to protect oneself from moving the goalposts. The problem is, if the cause is put so far in the future and based so much on speculations, there is no fixed point to look at when countering one's own biases, and the risk of a gross overestimation of one's agenda becomes huge. So the reason why I dislike the mentioned suggestions (and I am speaking, for example, about the idea that it is a strict moral duty for everybody who can to support the FAI research as much as they can, which were implicitly present at least in the discussions about the forbidden topic) is not that I reject single-cause devotion in principle (although I like to be wary about it in most situations), but that I assign too low probability to the correctness of the underlying ideas. The whole business is based on future predictions of several tens or possibly hunderts years in advance, which is historically a very unsuccessful discipline. And I can't help but include it in that reference class. Simultaneously, I don't accept the argument of very huge utility difference between possible outcomes, which should justify one's involvement even if the probability of success (or even probability that the effort has sense) is extremely low. Pascal-wageresque reasoning is unreliable, even if formalised, because it needs careful and precise estimation of probabilities close to 1 or 0, which humans are provably bad at.
5Wei_Dai10yAssuming you're right, why doesn't rejection of Pascal-like wagers also require careful and precise estimation of probabilities close to 1 or 0?
2prase10yI use a heuristic which tells me to ignore Pascal-like wagers and to do whatever I would do if I haven't learned about the wager (in first approximation). I don't behave like an utilitarian in this case, so I don't need to estimate the probabilities and utilities. (I think if I did, my decision would be fairly random, since the utilities and probabilities included would be almost certainly determined mostly by the anchoring effect).
6Perplexed10yI am not sure exactly what using this heuristic entails. I certainly understand the motivation behind the heuristic: * when you multiply an astronomical utility (disutility) by a miniscule probability, you may get an ordinary-sized utility (disutility), apparently suitable for comparison with other ordinary-sized utilities. Don't trust the results of this calculation! You have almost certainly made an error in estimating the probability, or the utility, or both. But how do you turn that (quite rational IMO) lack of trust into an action principle? I can imagine 4 possible precepts: * Don't buy lottery tickets * Don't buy insurance * Don't sell insurance * Don't sell back lottery tickets you already own. Is it rationally consistent to follow all 4 precepts, or is there an inconsistency?
4timtyler10yAnother red flag is when someone else helpfully does the calculation for you - and then expects you to update on the results. Looking at the long history of Pascal-like wagers, that is pretty likely to be an attempt at manipulation.
2timtyler10y"I believe SIAI’s probability of success is lower than what we can reasonably conceptualize; this does not rule it out as a good investment (since the hoped-for benefit is so large), but neither does the math support it as an investment (donating simply because the hoped-for benefit multiplied by the smallest conceivable probability is large would, in my view, be a form of falling prey to “Pascal’s Mugging”." * http://blog.givewell.org/2009/04/20/the-most-important-problem-may-not-be-the-best-charitable-cause/ [http://blog.givewell.org/2009/04/20/the-most-important-problem-may-not-be-the-best-charitable-cause/]
1ShardPhoenix10yWhat do those examples have to do with anything? In those cases we actually know the probabilities so they're not Pascal's-Wager-like scenarios.
1Perplexed10ySo, what is the probability that my house will burn? It may depend on whether I start smoking again. I hope the probability of both is low, but I don't know what it is. I'm not sure exactly what the definition of Pascal's-Wager-like should be. Is there a definition I should read? Should we ask Prase what he meant? I understood the term to mean anything involving small estimated probabilities and large estimated utilities.
2Paul Crowley10yWhich of the axioms of the Von Neumann–Morgenstern utility theorem [http://en.wikipedia.org/wiki/Von_Neumann%E2%80%93Morgenstern_utility_theorem] do you reject?
3Wei_Dai10yI think the theorem implicitly assumes logical omniscience, and using heuristics instead of doing explicit expected utility calculations should make sense in at least some types of situations for us. The question is whether it makes sense in this one. I think this is actually an interesting question. Is there an argument showing that we can do better than prase's heuristic of rejecting all Pascal-like wagers, given human limitations?
1prase10yIf I had to describe my actual choices, I don't know. No one necessarily, any of the axioms possibly. My inner decision algorithm is probably inconsistent in different ways, I don't believe for example that my choices always satisfy transitivity. What I wanted to say is that although I know that my decisions are somewhat irrational and thus sub-optimal, in some situations, like Pascal wagers, I don't find consciously creating an utility function and to calculate the right decision to be an attractive solution. It would help me to be marginally more rational (as given by the VNM definition), but I am convinced that the resulting choices would be fairly arbitrary and probably will not reflect my actual preferences. In other words, I can't reach some of my preferences by introspection, and think that an actual attempt to reconstruct an utility function would sometimes do worse than simple, although inconsistent heuristic.
3CarlShulman10yThe best way to advance this goal being is probably to write an interesting top-level post.
4prase10yI agree. However not everybody is able to.
1multifoliaterose10yThanks for correcting the misspelling! Totally agree about LW vs. Objectivism.

Let me put it this way then: Most LW readers don't like reading unproductive conversations. And it is hard to get more unproductive than one person saying "I believe in X!" and another saying "Yeah, well I believe ~X, so there!" You are welcome to do that, but don't be surprised if the rest of us decide to vote down such comments as things we don't want.

But why is the estimate that I gave obviously on the wrong order of magnitude?

The original statement was

I assign a probability of less than 10^(-9) to you succeeding in playing a critical role on the Friendly AI project that you're working on."

The way to estimate probabilities like that is to break them into pieces. This one divides naturally into two pieces: the probability that an AGI will be created in the not-too-distant future, and the probability that Eliezer will play a critical role if it is. For the former, I estimate a probability of... (read more)

There's no particular reason to doubt that a significant amount of the final data is encoded in the gestational environment.

To the contrary, there is every reason to doubt that. We already know that important pieces of the gestational environment (the genetic code itself, core metabolism, etc.) are encoded in the genome. By contrast, the amount of epigenetic information that we know of is miniscule. It is, of course, likely that we will discover more, but it is very unlikely that we will discover much more. The reason for this skepticism is that we ... (read more)

1timtyler10yI think you may have missed my devastating analysis of this issue a couple of years back: "So, who is right? Does the brain's design fit into the genome? - or not? The detailed form of proteins arises from a combination of the nucleotide sequence that specifies them, the cytoplasmic environment in which gene expression takes place, and the laws of physics. We can safely ignore the contribution of cytoplasmic inheritance - however, the contribution of the laws of physics is harder to discount. At first sight, it may seem simply absurd to argue that the laws of physics contain design information relating to the construction of the human brain. However there is a well-established mechanism by which physical law may do just that - an idea known as the anthropic principle. This argues that the universe we observe must necessarily permit the emergence of intelligent agents. If that involves a coding the design of the brains of intelligent agents into the laws of physics then: so be it. There are plenty of apparently-arbitrary constants in physics where such information could conceivably be encoded: the fine structure constant, the cosmological constant, Planck's constant - and so on. At the moment, it is not even possible to bound the quantity of brain-design information so encoded. When we get machine intelligence, we will have an independent estimate of the complexity of the design required to produce an intelligent agent. Alternatively, when we know what the laws of physics are, we may be able to bound the quantity of information encoded by them. However, today neither option is available to us." * http://alife.co.uk/essays/how_long_before_superintelligence/ [http://alife.co.uk/essays/how_long_before_superintelligence/]
3Perplexed10yYou suggest that the human brain might have a high Kolmogorov complexity, the information for which is encoded, not in the human genome (which contains a mere 7 gigabits of information), but rather in the laws of physics, which contain arbitrarily large amounts of information, encoded in the exact values of physical constants. For example, first 30 billion decimal digits of the fine structure constant contain 100 gigabits of information, putting the genome to shame. Do I have that right? Well, I will give you points for cleverness, but I'm not buying it. I doubt that it much matters what the constants are, out past the first hundred digits or so. Yes, I realize that the details of how the universe proceeds may be chaotic; it may involve sensitive dependence both on initial conditions and on physical constants. But I don't think that really matters. Physical constants haven't changed since the Cambrian, but genomes have. And I think that it is the change in genomes which led to the human brain, the dolphin brain, the parrot brain, and the octopus brain. Alter the fine structure constant in the 2 billionth decimal place, and those brain architectures would still work, and those genomes would still specify development pathways leading to them. Or so I believe.

I find it ironic that multifoliaterose said

I personally think that the best way to face the present situation is to gather more information about all existential risks rather than focusing on one particular existential risk

and then the next post, instead of delineating what he found out about other existential risks (or perhaps how we should go about doing that), is about how to save Eliezer.

Right, but the historical precedent for an amateur scientist even being at all involved in a substantial scientific breakthrough over the past 50 years is very weak.

Hold on - there are two different definitions of the word "amateur" that could apply here, and they lead to very different conclusions. The definition I think of first, is that an amateur at something is someone who doesn't get paid folr doing it, as opposed to a professional who makes a living at it. By this definition, amateurs rarely achieve anything, and if they do, they usuall... (read more)

might lead him to focus on inspiring others and on existential risk reduction advocacy (things that he has demonstrated capacity to do very well) rather than Friendly AI research

That would absolutely be a waste. If for some reason he was only to engage in advocacy from now on, it should specifically be Friendly AI advocacy. I point again to the huge gaping absence of other people who specialize in this problem and who have worthwhile ideas. The other "existential risks" have their specialized advocates. No-one else remotely comes close to fill... (read more)

I think there's a limit on how much you can disagree with other human beings-- unless you're claiming to be something superhuman.

At least for epistemic meanings of "superhuman", that's pretty much the whole purpose of LW, isn't it?

Did you see the link to this comment thread? I would like to see your response to the discussion there.

My immediate response is as follows: yes, dependency relations might concentrate most of the improbability of a religion to a relatively small subset of its claims. But the point is that those claims themselves ... (read more)

7Unknowns10yLet's pick an example. How probable do you think it is that Islam is a true religion? (There are several ways to take care of logical contradictions here, so saying 0% is not an option.) Suppose there were a machine--for the sake of tradition, we can call it Omega--that prints out a series of zeros and ones according to the following rule. If Islam is true, it prints out a 1 on each round, with 100% probability. If Islam is false, it prints out a 0 or a 1, each with 50% probability. Let's run the machine... suppose on the first round, it prints out a 1. Then another. Then another. Then another... and so on... it's printed out 10 1's now. Of course, this isn't so improbable. After all, there was a 1/1024 chance of it doing this anyway, even if Islam is false. And presumably we think Islam is more likely than this to be false, so there's a good chance we'll see a 0 in the next round or two... But it prints out another 1. Then another. Then another... and so on... It's printed out 20 of them. Incredible! But we're still holding out. After all, million to one chances happen every day... Then it prints out another, and another... it just keeps going... It's printed out 30 1's now. Of course, it did have a chance of one in a billion of doing this, if Islam were false... But for me, this is my lower bound. At this point, if not before, I become a Muslim. What about you? You've been rather vague about the probabilities involved, but you speak of "double digit negative exponents" and so on, even saying that this is "conservative," which implies possibly three digit exponents. Let's suppose you think that the probability that Islam is true is 10^-20; this would seem to be very conservative, by your standards. According to this, to get an equivalent chance, the machine would have to print out 66 1's. If the machine prints out 50 1's, and then someone runs in and smashes it beyond repair, before it has a chance to continue, will you walk away, saying, "There is a chance
9cousin_it10yThank you a lot for posting this scenario. It's instructive from the "heuristics and biases" point of view. Imagine there are a trillion variants of Islam, differing by one paragraph in the holy book or something. At most one of them can be true. You pick one variant at random, test it with your machine and get 30 1's in a row. Now you should be damn convinced that you picked the true one, right? Wrong. Getting this result by a fluke is 1000x more likely than having picked the true variant in the first place. Probability is unintuitive and our brains are mush, that's all I'm sayin'.
1Unknowns10yI agree with this. But if the scenario happened in real life, you would not be picking a certain variant. You would be asking the vague question, "Is Islam true," to which the answer would be yes if any one of those trillion variants, or many others, were true. Yes, there are trillions of possible religions that differ from one another as much as Islam differs from Judaism, or whatever. But only a few of these are believed by human beings. So I still think I would convert after 30 1's, and I think this would reasonable.
4cousin_it10yIf a religion's popularity raises your prior for it so much, how do you avoid Pascal's Mugging with respect to the major religions of today? Eternity in hell is more than 2^30 times worse than anything you could experience here; why aren't you religious already?
2Unknowns10yIt doesn't matter whether it raises your prior or not; eternity in hell is also more than 2^3000 times worse etc... so the same problem will apply in any case. Elsewhere I've defended Pascal's Wager against the usual criticisms, and I still say it's valid given the premises. But there are two problematic premises: 1) It assumes that utility functions are unbounded. This is certainly false for all human beings in terms of revealed preference; it is likely false even in principle (e.g. the Lifespan Dilemma). 2) It assumes that humans are utility maximizers. This is false in fact, and even in theory most of us would not want to self-modify to become utility maximizers; it would be a lot like self-modifying to become a Babyeater or a Super-Happy.
1Wei_Dai10yDo you have an answer for how to avoid giving in to the mugger in Eliezer's original Pascal's Mugging scenario [http://lesswrong.com/lw/kd/pascals_mugging_tiny_probabilities_of_vast/]? If not, I don't think your question is a fair one (assuming it's meant to be rhetorical).
1thomblake10yOddly, I think you meant "Pascal's Wager".
1FAWS10yPascal's Mugging [http://lesswrong.com/lw/kd/pascals_mugging_tiny_probabilities_of_vast/]. Pascal's Wager with something breaking symmetry (in this case observed belief of others).
3komponisto10yOf course I'm serious (and I hardly need to point out the inadequacy of the argument from the incredulous stare). If I'm not going to take my model of the world seriously, then it wasn't actually my model to begin with. Sewing-Machine's comment below [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2i1b?c=1] basically reflects my view, except for the doubts about numbers as a representation of beliefs. What this ultimately comes down to is that you are using a model of the universe according to which the beliefs of Muslims are entangled with reality to a vastly greater degree than on my model. Modulo the obvious issues about setting up an experiment like the one you describe in a universe that works the way I think it does, I really don't have a problem waiting for 66 or more 1's before converting to Islam. Honest. If I did, it would mean I had a different understanding of the causal structure of the universe than I do. Further below you say this, which I find revealing: As it happens, given my own particular personality, I'd probably be terrified. The voice in my head [http://lesswrong.com/lw/1j7/the_amanda_knox_test_how_an_hour_on_the_internet/] would be screaming. In fact, at that point I might even be tempted to conclude that expected utilities favor conversion, given the particular nature of Islam. But from an epistemic point of view, this doesn't actually change anything. As I argued in Advancing Certainty [http://lesswrong.com/lw/1mw/advancing_certainty/] , there is such a thing as epistemically shutting up and multiplying. Bayes' Theorem says the updated probability is one in a hundred billion, my emotions notwithstanding. This is precisely the kind of thing we have to learn to do in order to escape the low-Earth orbit of our primitive evolved epistemology -- our entire project here, mind you -- which, unlike you (it appears), I actually believe is possible.
4Wei_Dai10yHas anyone done a "shut up and multiply" for Islam (or Christianity)? I would be interested in seeing such a calculation. (I did a Google search and couldn't find anything directly relevant.) Here's my own attempt, which doesn't get very far. Let H = "Islam is true" and E = everything we've observed about the universe so far. According to Bayes: P(H | E) = P(E | H) P(H) / P(E) Unfortunately I have no idea how to compute the terms above. Nor do I know how to argue that P(H|E) is as small as 10^-20 without explicitly calculating the terms. One argument might be that P(H) is very small because of the high complexity of Islam, but since E includes "23% of humanity believe in some form of Islam", the term for the complexity of Islam seems to be present in both the numerator and denominator and therefore cancel each other out. If someone has done such a calculation/argument before, please post a link?
3FAWS10yActually it doesn't, human generated complexity is different from naturally generated complexity (for instance it fits into narratives, apparent holes are filled with the sort of justifications a human is likely to think of etc.). That's one of the ways you can tell stories from real events. Religious accounts contain much of what looks like human generated complexity.
3cousin_it10yP(E) includes the convincingness of Islam to people on average, not the complexity of Islam. These things are very different because of the conjunction fallacy. So P(H) can be a lot smaller than P(E).
3Wei_Dai10yI don't understand how P(E) does not include a term for the complexity of Islam, given that E contains Islam, and E is not so large that it takes a huge number of bits to locate Islam inside E.
1Furcas10yI don't think that's true; cousin_it had it right the first time. The complexity of Islam is the complexity of a reality that contains an omnipotent creator, his angels, Paradise, Hell, and so forth. Everything we've observed about the universe includes people believing in Islam, but not the beings and places that Islam says exist. In other words, E contains Islam the religion, not Islam the reality.
2PaulAlmond10yThe really big problem with such a reality is that it contains a fundamental, non-contingent mind (God's/Allah's, etc) - and we all know how much describing one of those takes - and the requirement that God is non-contingent means we can't use any simpler, underlying ideas like Darwinian evolution. Non-contingency, in theory selection terms, is a god killer: It forces God to incur a huge information penalty - unless the theist refuses even to play by these rules and thinks God is above all that - in which case they aren't even playing the theory selection game.
2Perplexed10yI don't see this. Why assume that the non-contingent, pre-existing God is particularly complex. Why not assume that the current complexity of God (if He actually is complex) developed over time as the universe evolved since the big bang. Or, just as good, assume that God became complex before He created this universe. It is not as if we know enough about God to actually start writing down that presumptive long bit string. And, after all, we don't ask the big bang to explain the coastline of Great Britain.
1PaulAlmond10yIf we do that, should we even call that "less complex earlier version of God" God? Would it deserve the title?
1Perplexed10ySure, why not? I refer to the earlier, less complex version of Michael Jackson as "Michael Jackson".
1Furcas10yAgreed. It's why I'm so annoyed when even smart atheists say that God was an ok hypothesis before evolution was discovered. God was always one of the worst possible hypotheses! Or, put more directly: Unless the theist is deluding himself. :)
1cousin_it10yI'm confused. In the comments to my post you draw a distinction [http://lesswrong.com/lw/2n4/the_prior_of_a_hypothesis_does_not_depend_on_its/2imc?c=1] between an "event" and a "huge set of events", saying that complexity only applies to the former but not the latter. But Islam is also a "huge set of events" - it doesn't predict just one possible future, but a wide class of them (possibly even including our actual world, ask any Muslim!), so you can't make an argument against it based on complexity of description alone. Does this mean you tripped on the exact same mine I was trying to defuse with my post? I'd be very interested in hearing a valid argument about the "right" prior we should assign to Islam being true - how "wide" the set of world-programs corresponding to it actually is - because I tried to solve this problem and failed.
2[anonymous]10y1. Here's a somewhat rough way of estimating probabilities of unlikely events. Let's say that an event X with P(X) = about 1-in-10 is a "lucky break." Suppose that there are L(1) ways that Y could occur on account of a single lucky break, L(2) ways that Y could occur on account of a pair of independent lucky breaks, L(3) ways that Y could occur on account of 3 independent lucky breaks, and so on. Then P(Y) is approximately the sum over all n of L(n)/10^n. I have the feeling that arguments about whether P(Y) is small versus extremely small are arguments about the growth rate of L(n). 2. I discussed the problem of estimating P("23% of humanity believes...") here [http://lesswrong.com/lw/2jd/open_thread_august_2010/2i0r?c=1]. I'd be grateful for thoughts or criticisms.
0gjm5yThere are some very crude sketches of shutting-up-and-multiplying, from one Christian and a couple of atheists, here [http://www.wall.org/~aron/blog/christianity-is-true/] (read the comments as well as the post itself), and I think there may be more with a similar flavour in other blog posts there (and their comments) from around the same time. (The author of the blog has posted a little on LW. The two skeptics responsible for most of the comments on that post have both been quite active here. One of them still is, and is in fact posting this comment right now :-).)
2RichardKennaway10yAt this point, if not before, I doubt Omega's reliability, not mine.
2Pavitra10yIt is a traditional feature of Omega that you have confidence 1 in its reliability and trustworthiness.
3RichardKennaway10yTraditions do not always make sense, neither are they necessarily passed down accurately. The original Omega, the one that appears in Newcomb's problem, does not have to be reliable with probability 1 for that problem to be a problem. Of course, to the purist who says that 0 and 1 are not probabilities, you've just sinned by talking about confidence 1, but the problem can be restated to avoid that by asking for one's conditional probability P(Islam | Omega is and behaves as described). In the present case, the supposition that one is faced with an overwhelming likelihood ratio raising the probability that Islam is true by an unlimited amount is just a blue tentacle scenario [http://yudkowsky.net/rational/technical]. Any number that anyone who agrees with the general anti-religious view common on LessWrong comes up with is going to be nonsense. Professing, say, 1 in a million for Islam on the grounds that 1 in a billion or 1 in a trillion is too small a probability for the human brain to cope with is the real cop-out, a piece of reversed stupidity with no justification of its own. The scenario isn't going to happen. Forcing your brain to produce an answer to the question "but what if it did?" is not necessarily going to produce a meaningful answer.
2Pavitra10yQuite true. But if you want to dispute the usefulness of this tradition, you should address the broader and older tradition of which it is an instance: that thought experiments should abstract away real-world details irrelevant to the main point. This is a pet peeve of mine, and I've wanted an excuse to post this rant for a while. Don't take it personally. That "purist" is as completely wrong as the person who insists that there is no such thing as centrifugal force [http://xkcd.com/123/]. They are ignoring the math in favor of a meme that enables them to feel smugly superior. 0 and 1 are valid probabilities in every mathematical sense: the equations of probability don't break down when passed p=0 or p=1 the way they do with genuine nonprobabilities like -1 or 2. A probability of 0 or 1 is like a perfect vacuum: it happens not to occur in the world that we happen to inhabit, but it is perfectly well-defined, we can do math with it without any difficulty, and it is extraordinarily useful in thought experiments. When asked to consider a spherical black body of radius one meter resting on a frictionless plane, you don't respond "blue tentacles [http://lesswrong.com/lw/it/semantic_stopsigns/]", you do the math.
1[anonymous]10yYou've asked us to take our very small number, and imagine it doubling 66 times. I agree that there is a punch to what you say -- no number, no matter how small, could remain small after being doubled 66 times! But in fact long ago Archimedes made a compelling case that there are such numbers. Now, it's possible that Archimedes was wrong and something like ultrafinitism is true. I take ultrafinitist ideas quite seriously, and if they are correct then there are a lot things that we will have to rethink. But Islam is not close to the top of list of things we would should rethink first. Maybe there's a kind of meta claim here: conditional on probability theory being a coherent way to discuss claims like "Islam is true," the probability that Islam is true really is that small.

Um, that's not an informative answer to anyone but yourself. Is there any specific piece of evidence that became known to you, or became more significant to you, at that time?

Who else is nearly as good or better at Friendly AI development than Eliezer Yudkowsky?

I mean besides me, obviously.

Oh, I know it's not your fault, but seriously, have "the Internet" ask you the same question 153 times in a row and see if you don't get slightly frustrated with "the Internet".

2Perplexed10yYeah, after reading your "some part of my brain" thing a second time, I realized I had misinterpreted. Though I will point out that my question was not directed to you. You should learn to delegate the task of becoming frustrated with the Internet. I read the article (though not yet any of the transcripts). Very interesting. I hope that some tests using a gatekeeper committee are tried someday.

Those are narrow AI tasks, and the safety considerations are correspondingly narrow. FAI is the problem of creating a machine intelligence that is powerful enough to destroy humanity or the world but doesn't want to, and solving such a problem is nothing like building an autopilot system that doesn't crash the plane. Among people who think they're going to build an AGI, there often doesn't seem to be a deep understanding of the impact of such an invention (it's more like "we're working on a human-level AI, and we're going to have it on the market in 5... (read more)

  1. Inflation.

  2. The richest person on earth currently has a net worth of $53.5 billion.

  3. The greatest peak net worth in recorded history, adjusted for inflation, was Bill Gates' $101 billion, which was ten years ago. No one since then has come close. A 10-fold increase in <6 years strikes me as unlikely.

  4. In any case, your extrapolated curve points to 2116, not 2016.

I am increasingly convinced that your comments on this topic are made in less than good faith.

"It's basically a modern version of a religious belief system and there's no purpose to it, like why, why must we have another one of these things ... you get an afterlife out of it because you'll be on the inside track when the singularity happens - it's got all the trappings of a religion, it's the same thing." - Jaron here.

I've encountered people who think Singularitarians think that, never any actual Singularitarians who think that.

8ata10yYeah, "people who think Singularitarians think that" is what I meant. I've actually met exactly one something-like-a-Singularitarian who did think something-like-that — it was at one of the Bay Area meetups, so you may or may not have talked to him, but anyway, he was saying that only people who invent or otherwise contribute to the development of Singularity technology would "deserve" to actually benefit from a positive Singularity. He wasn't exactly saying he believed that the nonbelievers would be left to languish when cometh the Singularity, but he seemed to be saying that they should. Also, I think he tried to convert me to Objectivism.

relatively few of the generally accepted laws of physics

"relatively few"? Name two.

Haven't there been a lot more than a million people in history that claimed saving the world, with 0 successes?

Can you name ten who claimed to do so via non-supernatural/extraterrestrial means? Even counting claims of the supernatural I would be surprised to learn there had been a million.

Take no pride in your confession that you too are biased; do not glory in your self-awareness of your flaws. This is akin to the principle of not taking pride in confessing your ignorance; for if your ignorance is a source of pride to you, you may become loathe to relinquish your ignorance when evidence comes knocking. Likewise with our flaws - we should not gloat over how self-aware we are for confessing them; the occasion for rejoicing is when we have a little less to confess.

There's something to what Eliezer is saying here: when people are too str

... (read more)

But there is further information. We must expect Eliezer to make use of all of the information available to him when making such an implied estimation and similarly use everything we have available when evaluating credibility of any expressed claims.

2NancyLebovitz10yNitpick: Do you mean credulity or credibility?
1wedrifid10yThe one that makes sense. Thanks. :)

it's very important that those of us who aspire to epistemic rationality incorporate a significant element of "I'm the sort of person who engages in self-doubt because it's the right thing to do" into our self-image

I think most of us do. Your argument for this is compelling. However, I think Eliezer was just claiming that it's possible to overdo it - at least, that's the defensible core of his insight.

I've wondered if I'm obsessed with Eliezer's writings, and whether I esteem him too highly. Answers: no, and no.

Anything that has even a sl

... (read more)
1multifoliaterose10yThanks for correcting my typos.
1Jonathan_Graehl10yYou're welcome - I've redacted my comment so it no longer mentions them.

I have no comment to add but I will say that this is well written and researched. It also prompted a degree of self reflection on my part. At least, that's what I told myself and I feel this warm glow inside. ;)

As of yet Eliezer's importance is just a stochastic variable yet to be realized, for all I know he could be killed in a car accident tomorrow or simply fail at his task of "saving the world" in numerous ways.

Up until now Vasili Arkhipov, Stanislav Petrov and a few other people I do not know the names of (including our earliest ancestors who managed to avoid being killed during their emigration out of Africa) trump Eliezer by a tiny margin of actually saving humanity -or at least civilization.

All that being said Eliezer is still pretty awesome by my standards. And he writes good fanfiction, too.

Eliezer took exception to my estimate linked in my comment here.

Less than 1 in 1 billion! :-) May I ask exactly what the proposition was? At the link you say "probability of ... you succeeding in playing a critical role on the Friendly AI project that you're working on". Now by one reading that probability is 1, since he's already the main researcher at SIAI.

Suppose we analyse your estimate in terms of three factors:

(probability that anyone ever creates Friendly AI) x (conditional probability SIAI contributed) x (conditional probability that Eliezer contributed)

Can you tell us where the bulk of the 10^-9 is located?

Eliezer took exception to my estimate linked in my comment here.

And he was right to do so, because that estimate was obviously on the wrong order of magnitude. To make an analogy, if someone says that you weigh 10^5kg, you don't have to reveal your actual weight (or even measure it) to know that 10^5 was wrong.

I'm not sure about what you mean about the "complete blueprints" - I agree that the DNA isn't a complete blueprint, and that an alien civilization with a different chemistry would (probably) find it impossible to rebuild a human if they were just given it's DNA. The gestational environment is essential, I just don't think it encodes much data on the actual working of the brain.

It seems to me that the interaction between the baby and the gestational environment is relatively simple, at least compared to organ development and differentiation. There... (read more)

I'm embarrassed to admit that I was reading while tired, and didn't even notice there was a link in the comment. However, even after reading that, rwallace's epiphany remains opaque to the rest of us. He explains that a change of attitude was necessary in order to accept whatever evidence of AI's difficulty he already had, but he explains nothing about what that evidence might be. It's still uninformative to anyone but himself.

It looks to me like Eliezer gave your post the most generous interpretation possible, i.e. that it actually contained an argument attempting to show that he's deluding himself, rather than just defining a reference class and pointing out that Eliezer fits into it. Since you've now clarified that your post did nothing more than that, there's not much left to do except suggest you read all of Eliezer's posts tagged 'FAI', and this.

Maybe I should have qualified my statement by saying "this estimate may be a gross overestimate or a gross underestimate."

It sounds, then, like you're averaging probabilities geometrically rather than arithmetically. This is bad!

1multifoliaterose10yI understand your position and believe that it's fundamentally unsound. I will have more to say about this later. For now I'll just say that the arithmetical average of the probabilities that I imagine I might ascribe to Eliezer's current strategy resulting in an FAI to be 10^(-9).

You can reduce an AGI to the behavior of computer chips (or whatever fancy-schmancy substrate they end up running on), which are themselves just channels for the flow of electrons. Nothing mental there. Friendliness is a description of the utility function and decision theory of an AGI, both of which can be reduced to patterns of electrons on a computer chip.

It's all electrons floating around. We just talk about ridiculous abstract things like AGI and Friendliness because it makes the math tractable.

what's "it" in "it being virtually certain."

"it being virtually certain that there are three independent 1 in 1000 events required, or nine independent 1 in 10 events required, or something along those lines"

models of what, final probability of what?

Models of the world that we use to determine how likely it is that Eliezer will play a critical role through a FAI team. Final probability of that happening.

A billion is big compared to the relative probabilities we're rationally entitled to have between models where a serie... (read more)

A - his beliefs on MWI have no bearing on his relative importance wrt the future of the world.

B - when you say "defensible", you mean "accepted by the clear majority of scientists working in the field".

This question sounds disingenuous to me. There is a large gap between "10^-9 chance of Eliezer accomplishing it" and "so easy for the average machine learning PhD." Whatever else you think about him, he's proved himself to be at least one or two standard deviations above the average PhD in ability to get things done, and some dimension of rationality/intelligence/smartness.

CarlShulman is correct, but for reference, Richard Carrier's definition of "supernatural":

In short, I argue "naturalism" means, in the simplest terms, that every mental thing is entirely caused by fundamentally nonmental things, and is entirely dependent on nonmental things for its existence. Therefore, "supernaturalism" means that at least some mental things cannot be reduced to nonmental things.

You don't seem to want to state your beliefs clearly, and I don't have the patience to write more than this one post encouraging you to do so.

Do you believe that the difficulty of developing technology depends on the mind trying to develop it only when that mind happens to be human? Or that nothing can be smarter than the smartest human? Or what?

2rwallace10yI'm being as clear as I can without writing an essay in every comment. But I'll put it this way: 1. Nothing is currently smarter than the smartest human. 2. This is not going to change anytime soon. 3. While AI does have the potential to produce better tools than we have now, there are still going to be enormous gaps in the abilities of those tools. For example, suppose you had an AI that was great at writing code from formal specifications, but didn't know enough about the real world to know what code to write. Then you would have a tool that you might find useful, but that you could not sit back and let solve your problems for you. At the end of the day, the responsibility for solving your problems would still be yours. This is very different from the Singularitarian vision where creating a superintelligent AI is the last job we need to do.
2Risto_Saarelma10yMaybe you could write a full post about your views. I'd very much like to read good criticism of singularitarism, but so far your objections aren't very strong. The core assumptions in this comment, for example, seem to be not really visible. I'm guessing the idea is something like it'd be really, really hard to do an AI that can do everything a human does, and trying to leave real-world problem-solving to subhuman AIs won't work. But no-one's talking about going after problems in the physical world with a glorified optimizing compiler, so why do you bring up this as the main example? The starting for a lot of current AGI thinking, as far as I've understood, is to make an AI with the ability to learn and some means to interact with the world. This AI is then expected to learn to act in the world like humans learn when they grow from newborns to adults. So is there some kind of basic difference in understanding here, when I'm thinking of AIs as learning semi-autonomous agents, and you're thinking them as, I guess, some kind of pre-programmed unchanging procedures for doing specific things?
4rwallace10yYes, basically my claim is that an AI of the sort you're talking about is a job for the world over timescales of generations, not for a single team over timescales of years or decades; it's hard to prove a negative, and you are right that the comments I've been making here don't -- can't -- strongly justify that claim. I'll think about whether I can put together my reasoning into a full post.
2Jonathan_Graehl10yYour position is one that most people assign some probability mass to. However, I get the impression that you're extremely (over)confident in it. So I look forward to hearing your case.
1Mitchell_Porter10yThis requires two things: knowing what you want, and learning about the world. I don't see the fundamental problem in getting an AI to learn about the world. The informal human epistemic process has been analyzed into components, and these have been formalized and implemented in ways far more powerful than an unaided human can manage. It's a lot of work to put it all together in a self-consistent package, and to give it enough self-knowledge and world-knowledge to set it in motion, and it would require a lot of computing power. But I don't see any fundamental difficulty. What the AI wants is utterly contingent on initial conditions. But an AI that can represent the world and learn about it, can also represent just about any goal you care to give it, so there's no extra problem to solve here. (Except for Friendliness. But that is the specific problem of identifying a desirable goal, not the general problem of implementing goal-directed behavior.) Just reviewing this basic argument reinforces the prior impression that we are already drifting towards transhuman AI and that there's no fundamental barrier in the way. We already know enough for hard work alone to get us there - I mean the hard work of tens of thousands of researchers in many fields, not one person or one group making a super-duper effort. The other factor which seals our fate is distributed computing. Even if Moore's law breaks down, computers can be networked, and there are lots of computers. So, we are going to face something smarter than human, which means something that can outwit us, which means something that should win if its goals are ever in conflict with ours. And there is no law of nature to guarantee that its goals will be humanly benevolent. On the contrary, it seems like anything might serve as the goal of an AI, just as "any" numerical expression might be fed to a calculator for evaluation. What we don't know is how likely it is that the first transhuman AI's goals will be bad for

Agree with your principle but not exactly the particular expression or figures. A relative, not absolute, measure seems more appropriate. I think Eliezer has been careful to never give figures to success probabilities. But see 'shut up and do the impossible'.

I would perhaps change the claim to 'doing more than anyone else to save the world'. I'm not certain what self evaluated probability could be so claimed by Eliezer. I would accept as credible something far higher than 10^-4, probably higher than 10^-3. Even at 10^-2 I wouldn't sneer. But the figure is... (read more)

Nothing is going to come along and solve our problems for us, and AI is not going to be a magical exception to the rule that developing technology is hard.

Do you think many people here think that "something is going to come along and solve our problem for us", or that "developing AI is easy"?

2rwallace10yYes. In particular, the SIAI is explicitly founded on the beliefs that 1. Superintelligent AI will solve all our problems. 2. Creating same is (unlike other, much less significant technological developments) so easy that it can be done by a single team within our lifetimes.
5Aleksei_Riikonen10yThe following summary of SIAI's position says otherwise: http://singinst.org/riskintro/index.html [http://singinst.org/riskintro/index.html] It seems you're confusing what you personally thought earlier with what SIAI currently thinks. (Though, technically you're partly right that what SIAI folks thought when said institution was founded is closer to what you say than their current position. But it's not particularly interesting what they thought 10 years ago if they've revised their position to be much better since then.)
4rwallace10yAh, thanks for the update; you're right, their claims regarding difficulty and timescale have been toned down quite a bit.
3Emile10yThat isn't really evidence that people here (currently) believe either of those. You're claiming people here believe things even though they go against some of Eliezer's writing (and I don't remember any cries of "No, Eliezer, you're wrong! Creating AI is easy!", but I might be mistaken), and even though quite a few commenters are telling you nobody here believes that.
4whpearson10yIt depends what you mean by easy and hard. From previous conversations I expect Mr Wallace is thinking something easy is doable by means of a small group over 20-30 years and hard is a couple of generations of the whole of civilizations work.
1rwallace10yYes, that's how I am using the terms.

This post is a pretty accurate description of me a few years ago, when I was a Singularitarian. The largest attraction of the belief system, to me, was that it implied as an AI researcher I was not just a hero, but a superhero, potentially capable of almost single-handedly saving the world. (And yes, I loved those video games too.)

2cousin_it10yWhat's your current position?

A few unrelated points:

  1. I tend to agree with you on the first section, but I think I'm less confident about it than you are. :)
  2. What is a genuinely utilitarian lifestyle? Is there someone you can cite as living such a lifestyle?
  3. I'm not sure what you're talking about in the last sentence. Prevent what from happening to Eliezer? Failing to lose hope when he should? (He wrote a post about that, BTW.)
1multifoliaterose10ySorry to take so long to get back to you :) Obviously humans are extremely ill-suited for being utilitarians (just as humans would be extremely ill-suited for being paperclip maximizers even if they wanted to be.) When I refer to a "genuinely utilitarian lifestyle" I mean subject to human constraints. There are some people who do this much better than others - for example, Bill Gates and Warren Buffett have done much better than most billionaires. I think that with a better peer network, Gates and Buffett could have done still better (for example I would have liked to see them take existential risk into serious consideration with their philanthropic efforts). A key point here is that as I've said elsewhere [http://towardabetterworld.wordpress.com/2010/06/08/altruism-and-sacrifice/] I don't think that leading a (relatively) utilitarian lifestyle has very much at all to do with personal sacrifice, but rather with realigning one's personal motivational structure in a way that (at least for many people) does not entail a drop in quality of life. If you haven't already done so, see my post on missed opportunities for doing well by doing good [http://lesswrong.com/lw/2ha/missed_opportunities_for_doing_well_by_doing_good/?sort=controversial] . Thanks for the reference. I edited the end of my posting to clarify what I had in mind.
1Wei_Dai10yIf that's the kind of criteria you have in mind, why did you say "Eliezer appears to be deviating so sharply from leading a genuinely utilitarian lifestyle"? It seems to me that Eliezer has also done much better than most ... (what's the right reference class here? really smart people who have been raised in a developed country?) Which isn't to say that he couldn't do better, but your phrasing strikes me as rather unfair...
2multifoliaterose10yWhat I was getting at in my posting is that in exhibiting unwillingness to seriously consider the possibility that he's vastly overestimated his chances of building a Friendly AI it appears that Eliezer is deviating sharply from leading a utilitarian lifestyle (relative to what one can expect from humans). I was not trying to make a general statement about Eliezer's attainment of utilitarian goals relative to other humans. I think that there's a huge amount of uncertainty on this point to such an extent that it's meaningless to try to make a precise statement. The statement that I was driving at is a more narrow one. I think that it would be better for Eliezer and for the world at large if Eliezer seriously considered the possibility that he's vastly overestimated his chances of building a Friendly AI. I strongly suspect that if he did this, his strategy for reducing existential risk would change for the better. If his current views turn out to be right, he can always return to them later on. I think that the expected benefits of him reevaluating his position far outweigh the expected costs.
3Mitchell_Porter10yWhy? What sort of improvement would you expect? Remember that he is still the one person in the public sphere who takes the problem of Friendly AI (under any name) seriously enough to have devoted his life to it, and who actually has quasi-technical ideas regarding how to achieve it. All this despite the fact that for decades now, in fiction and nonfiction, the human race has been expressing anxiety about the possibility of superhuman AI. Who are his peers, his competitors, his predecessors? If I was writing the history of attempts to think about the problem, Chapter One would be Isaac Asimov with his laws of robotics, Chapter Two would be Eliezer Yudkowsky and the idea of Friendly AI, and everything else would be a footnote.
2wedrifid10yWe haven't heard Eliezer say how likely he believes it is that he creates a Friendly AI. He has been careful to not to discuss that subject. If he thought his chances of success were 0.5% then I would expect him to make exactly the same actions. (ETA: With the insertion of 'relative' I suspect I would more accurately be considering the position you are presenting.)
3multifoliaterose10yRight, so in my present epistemological state I find it extremely unlikely that Eliezer will succeed in building a Friendly AI. I gave an estimate here [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2h9y?c=1] which proved to be surprisingly controversial. The main points that inform my thinking here are: 1. The precedent for people outside of the academic mainstream having mathematical/scientific breakthroughs in recent times is extremely weak. In my own field of pure math I know of only two people without PhD's in math or related fields who have produced something memorable in the last 70 years or so, namely Kurt Heegner [http://en.wikipedia.org/wiki/Kurt_Heegner] and Martin Demaine [http://en.wikipedia.org/wiki/Martin_Demaine]. And even Heegner and Demaine are (relatively speaking) quite minor figures. It's very common for self-taught amateur mathematicians to greatly underestimate the difficulty of substantive original mathematical research. I find it very likely that the same is true in virtually all scientific fields and thus have an extremely skeptical Bayesian prior against any proposition of the type "amateur intellectual X will solve major scientific problem Y." 2. From having talked with computer scientists and AI researchers, I have a very strong impression that the consensus is that AGI is way out of reach at present. See for example points #1 and #5 of Scott Aaronson's The Singularity is Far [http://scottaaronson.com/blog/?p=346]. The fact that Eliezer does not appear to have seriously contemplated or addressed the the two points above and their implications diminishes my confidence in his odds of success still further.
5thomblake10yThat you have this impression greatly diminishes my confidence in your intuitions on the matter. Are you seriously suggesting that Eliezer has not contemplated AI researchers' opinions about AGI? Or that he hasn't thought about just how much effort should go into a scientific breakthrough? Someone please throw a few hundred relevant hyperlinks at this person.
4Wei_Dai10yRegarding your first point, I'm pretty sure Eliezer does not expect to solve FAI by himself. Part of the reason for creating LW was to train/recruit potential FAI researchers, and there are also plenty of Ph.D. students among SIAI visiting fellows. Regarding the second point, do you want nobody to start researching FAI until AGI is within reach?
1timtyler10yI don't think there's any such consensus. Most of those involved know that they don't know with very much confidence. For a range of estimates, see the bottom of: http://alife.co.uk/essays/how_long_before_superintelligence/ [http://alife.co.uk/essays/how_long_before_superintelligence/]
2multifoliaterose10yFor what it's worth, in saying "way out of reach" I didn't mean "chronologically far away," I meant "far beyond the capacity of all present researchers." I think it's quite possible that AGI is just 50 years away. I think that the absence of plausibly relevant and concrete directions for AGI/FAI research, the chance of having any impact on the creation of an FAI through research is diminished by many orders of magnitude. If there are plausibly relevant and concrete directions for AGI/FAI research then the situation is different, but I haven't heard examples that I find compelling.
1timtyler10y"Just 50 years?" Shane Legg's explanation of why his mode is at 2025: http://www.vetta.org/2009/12/tick-tock-tick-tock-bing/ [http://www.vetta.org/2009/12/tick-tock-tick-tock-bing/] If 15 years is more accurate - then things are a bit different.
1multifoliaterose10yThanks for pointing this out. I don't have the subject matter knowledge to make an independent assessment of the validity of the remarks in the linked article, but it makes points that I had not seen before. I'd recur to CarlShulman's remark [http://lesswrong.com/lw/2l0/should_i_believe_what_the_siai_claims/2fd3?c=1] about selection bias here. I look forward to seeing the results of the hypothetical Bostrom survey and the SIAI collection of all public predictions. I agree. There's still an issue of a lack of concrete directions of research at present but if 15 years is accurate then I agree with Eliezer that we should be in "crunch" mode (amassing resources specifically directed at future FAI research).
1Will_Newsome10yAt any rate, most rationalists who have seriously considered the topic will agree that there is a large amount of probability mass 15 years into the future: large enough that even if the median estimate till AGI is 2050, we're still in serious crunch time. The tails are fat in both directions. (This is important because it takes away a lot of the Pascalian flavoring that makes people (justifiably) nervous when reasoning about whether or not to donate to FAI projects: 15% chance of FOOM before 2020 just feels very different to a bounded rationalist than a .5% chance of FOOM before 2020.) For what it's worth, Shane Legg is a pretty reasonable fellow who understands that AGI isn't automatically good, so we can at least rule out that his predictions are tainted by the thoughts of "Yay, technology is good, AGI is close!" that tend to cast doubt on the lack of bias in most AGI researchers' and futurists' predictions. He's familiar with the field and indeed wrote the book on Machine Super Intelligence. I'm more persuaded by Legg's arguments than most at SIAI, though, and although this isn't a claim that is easily backed by evidence, the people at SIAI are really freakin' good thinkers and are not to be disagreed with lightly.
1timtyler10yThe biggest optimist I have come across is Peter Voss. His estimate in 2009 was around 8 years [http://www.vimeo.com/3461663] - 7:00 in. However, he obviously has something to sell - so maybe we should not pay too much attention to his opinion - due to the signalling effects associated with confidence.
1[anonymous]10yEliezer addresses point 2 in the comments of the article you linked to in point 2. He's also previously answered the questions of whether he believes he personally could solve FAI and how far out it is -- here [http://lesswrong.com/lw/qu/a_premature_word_on_ai/], for example.
1multifoliaterose10yEdit: Should I turn my three comments starting here [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2k33?c=1] into a top level posting? I hesitate to do so in light of how draining I've found the process of making top level postings and especially reading and responding to the ensuing comments, but the topic may be sufficiently important to justify the effort.
1Wei_Dai10yWhat evidence do you have of this? One reason I doubt that it's true is that Eliezer has been relatively good at admitting flaws in his ideas, even when doing so implied that building FAI is harder than he previously thought. I think you could reasonably argue that he's still overconfident about his chances of successfully building FAI, but I don't see how you get "unwillingness to seriously consider the possibility".
1multifoliaterose10yEliezer was not willing to engage with my estimate here [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2h9y?c=1]. See his response [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2ha0?c=1]. For the reasons that I point out here [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2k3a?c=1], I think that my estimate is well grounded. Eliezer's apparent lack of willingness to engage with me on this point does not immediately imply that he's unwilling to seriously consider the possibility that I raise. But I do see it as strongly suggestive. As I said in response to ThomBlake [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2k3h?c=1], I would be happy to pointed to any of Eliezer's writings which support the idea that Eliezer has given serious consideration to the two points that I raised to explain my estimate. Edit: I'll also add that given the amount of evidence that I see against the proposition that Eliezer will build a Friendly AI, I have difficulty imagining how he could be persisting in holding his beliefs without having failed to give serious consideration to the possibility that he might be totally wrong. It seems very likely to me that if he had explored this line of thought, he would have a very different world view than he does at present.
2Wei_Dai10yHave you noticed that many (most?) commenters/voters seem to disagree with your estimate [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2h9y?c=1]? That's not necessarily strong evidence that your estimate is wrong (in the sense that a Bayesian superintelligence wouldn't assign a probability as low as yours), but it does show that many reasonable and smart people disagree with your estimate even after seriously considering your arguments. To me that implies that Eliezer could disagree with your estimate even after seriously considering your arguments, so I don't think his "persisting in holding his beliefs" offers much evidence for your position that Eliezer exhibited "unwillingness to seriously consider the possibility that he's vastly overestimated his chances of building a Friendly AI".
4multifoliaterose10yYes. Of course, there's a selection effect here - the people on LW are more likely to assign a high probability to the proposition that Eliezer will build a Friendly AI (whether or not there's epistemic reason to do so). The people outside of LW who I talk to on a regular basis have an estimate in line with my own. I trust these people's judgment more than I trust LW posters judgment simply because I have much more information about their positive track records for making accurate real world judgments than I do for the people on LW. Yes, so I agree that in your epistemological state you should feel this way. I'm explaining why in my epistemological state I feel the way I do.
2Wei_Dai10yIn your own epistemological state, you may be justified in thinking that Eliezer and other LWers are wrong about his chances of success, but even granting that, I still don't see why you're so sure that Eliezer has failed to "seriously consider the possibility that he's vastly overestimated his chances of building a Friendly AI". Why couldn't he have, like the other LWers apparently did, considered the possibility and then (erroneously, according to your epistemological state) rejected it?
1Alan10y1. Jeremy Bentham may be a candidate, or perhaps James Mill, father of J.S. Mill--though there's been some recent speculation that the former fell somewhere on the autism spectrum (no slight intended). By the way, if you're interested, check out the research on shifting modes of moral congition, deontological vs. consequentialist, depending upon subject matter, featured in the work of David Pizarro, e.g. Further afield, one may check out what Taleb has to say about who has led a genuinely Popperian lifestyle.
1wedrifid10yOptimising one's lifestyle for the efficient acquisition of power to enable future creation of bulk quantities of paper-clips. For example.

Right, but the historical precedent for an amateur scientist even being at all involved in a substantial scientific breakthrough over the past 50 years is very weak.

What are we supposed to infer from that? That if you add an amateur scientist to a group of PhDs, that would substantially decrease their chance of making a breakthrough?

The impressions that I've gotten from my private correspondence with Eliezer and from his comments have given me a very strong impression that I would find him too difficult to work with for me to be able to do productive

... (read more)

As far as I can tell, Eliezer does have confidence in the idea that he is (at least nearly) the most important person in human history. Eliezer's silence only serves to further confirm my earlier impressions

I suppose you also believe that Obama must prove he's not a muslim? And must do so again every time someone asserts that he is?

Let me say that Eliezer may have already done more to save the world than most people in history. This is going on the assumption that FAI is a serious existential risk. Even if he is doing it wrong and his work will never di... (read more)

2multifoliaterose10yI don't see the situation that you cite as comparable. Obama has stated that he's a Christian, and this seriously calls into question the idea that he's a Muslim. Has Eliezer ever said something which calls my interpretation of the situation into question? If so I'll gladly link a reference to it in my top level post. (As an aside, I agree with Colin Powell [http://www.youtube.com/watch?v=b2U63fXBlFo] that whether or not Obama is a Muslim has no bearing on whether he's fit to be president.) I definitely agree that some of what Eliezer has done has reduced existential risk. As I've said elsewhere [http://lesswrong.com/lw/2m5/transparency_and_accountability/2hr0?c=1], I'm grateful to Eliezer for inspiring me personally to think more about existential risk. However, as I've said [http://lesswrong.com/lw/2l8/existential_risk_and_public_relations/], in my present epistemological state I believe that he's also had (needless) negative effects on existential risk on account of making strong claims with insufficient evidence. See especially my responses to komponisto's comment [http://lesswrong.com/lw/2l8/existential_risk_and_public_relations/2g2s?c=1]. I may be wrong about this. In any case, I would again emphasize that my most recent posts should not be interpreted as personal attacks on Eliezer. I'm happy to support Eliezer to the extent that he does things that I understand to lower existential risk. My conscious motivation making my most recent string of posts is given in my Transparency and Accountability [http://lesswrong.com/lw/2m5/transparency_and_accountability/] posting. I have no conscious awareness of having a motivation of the type that you describe. Of course, I may be deluded about this (just as all humans may be deluded about possessing any given belief). In line with my top level posting, I'm interested in seriously considering the possibility that my unconscious motivations are working against my conscious goals. However, I see your own impressio
2Eneasz10yDoes whether Eliezer is over-confident or not have any bearing on whether he's fit to work on FAI? From the comment: The claim is not credible. I've seen a few examples given, but with no way to determine if the people "repelled" would have ever been open to mitigating existential risk in the first place. I suspect anyone who actually cares about existential risk wouldn't dismiss an idea out of hand because a well-known person working to reduce risk thinks his work is very valuable. It is unlikely to be their true rejection [http://lesswrong.com/lw/wj/is_that_your_true_rejection/] The latest post made this clear, and cheers for that. But the previous ones are written as attacks on Eliezer. It's hard to see a diatribe against someone describing them as a cult leader who's increasing existential risk and would do best to shut up and not interpret it as a personal attack. Fair enough, can't blame you for that. I'm happy with my enthusiasm.
2multifoliaterose10yOh, I don't think so, see my response to Eliezer here [http://lesswrong.com/lw/2m5/transparency_and_accountability/2hzd?c=1]. Yes, so here it seems like there's enough ambiguity as to how the publicly available data is properly interpreted so that we may have a legitimate difference of opinion on account of having had different experiences. As Scott Aaronson mentioned in the blogging heads conversation, humans have their information stored in a form (largely subconscious) such that it's not readily exchanged. All I would add to what I've said is that if you haven't already done so, see the responses to michaelkeenan's comment here [http://lesswrong.com/lw/2l8/existential_risk_and_public_relations/2fv1?c=1] (in particular those by myself, bentarm and wedrifid). If you remain unconvinced, we can agree to disagree without hard feelings :-)
[-][anonymous]10y 3

I think after somewhere between 30 and 300 coin flips, I would convert. With more thought and more details about what package of claims is meant by "Islam," I could give you a better estimate. Escape routes that I'm not taking: I would start to suspect Omega was pulling my leg, I would start to suspect that I was insane, I would start to suspect that everything I knew was wrong, including the tenets of Islam. If answers like these are copouts -- if Omega is so reliable, and I am so sane, and so on -- then it doesn't seem like much of a bullet ... (read more)

Yes. Because there is always the possibility that some smart geek will say "'moon-onna-stick', huh? I bet I could do that. I see a clever trick." Or maybe some other geek will say "Would you settle for Sputnik-on-a-stick?" and the User will say "Well, yes. Actually, that would be even better."

At least that is what they preach in the Process books.

CEV is a bizarre wishlist, apparently made with minimal consideration of implementation difficulties ...

It is what the software professionals would call a preliminary requirements document. You are not supposed to worry about implementation difficulties at that stage of the process. Harsh reality will get its chance to force compromises later.

I think CEV is one proposal to consider, useful to focus discussion. I hate it, myself, and suspect that the majority of mankind would agree. I don't want some machine that I have never met and don't trust to b... (read more)

1timtyler10yThat seems unlikely to help. Luddites have never had any power. Becoming a Luddite usually just makes you more xxxxxd.

Karma scores in this thread suggest it falls in reference class of "arguing against groupthink", which ironically increases estimates of Eliezer being a crackpot, and lesswrong turning into a cult, possibly via evaporative cooling.

No, that's really not borne out by the evidence. Multifolaterose's posts have been strongly upvoted, it seems to me, by a significant group of readers who see themselves as defenders against groupthink. It's just that you have been voted down for refusing to see a distinction that's clearly there, between "here ... (read more)

2taw10yIs there a way to find random sample of threads with heavy downvoting? My experience on reddit suggests it's usually groupthink.
1orthonormal10ySet your preferences to only hide comments below -5. Go to an old Open Thread or a particularly large discussion, and search for "comment score below threshold".

Anyway, the huge modern wealth inequalities are well established - and projecting them into the future doesn't seem especially controversial.

Projecting anything into a future with non-human intelligences is controversial. You have made an incredibly large assumption without realizing it. Please update.

That FAI will significantly change things is a pretty conclusive antiprediction. Status quo hath no moral power.

1rwallace10yAgreed. We aren't working on new technology with the intent of letting it gather dust while people continue to suffer and die.

The word "safety" as you used it here has nothing to do with our concern. If your sense of "safety" is fully addressed, nothing changes.

I think that I know the scientific community better than you, and have confidence that if creating an AGI was as easy as you seem to think it is (how easy I don't know because you didn't give a number) then there would be people in the scientific community who would be working on AGI.

Um, and there aren't?

1multifoliaterose10yGive some examples. There may be a few people in the scientific community working on AGI, but my understanding is that basically everybody is doing narrow AI.
5Vladimir_Nesov10yWhat is currently called the AGI field will probably bear no fruit, perhaps except for the end-game when it borrows then-sufficiently powerful tools from more productive areas of research (and destroys the world). "Narrow AI" develops the tools that could eventually allow the construction of random-preference AGI.
4Nick_Tarleton10yThe folks here [http://agi-conf.org/], for a start.

AGI researchers who are not concerned with Friendliness are trying to destroy human civilization. They may not believe that they are doing so, but this does not change the fact of the matter. If FAI is important, only people who are working on FAI can be expected to produce positive outcomes with any significant probability.

7Morendil10y"Trying to" normally implies intent. I'll grant that someone working on AGI (or even narrower AI) who has become aware of the Friendliness problem, but doesn't believe it is an actual threat, could be viewed as irresponsible - unless they have reasoned grounds to doubt that their creation would be dangerous. Even so, "trying to destroy the world" strikes me as hyperbole. People don't typically say that the Project Manhattan scientists were "trying to destroy the world" even though some of them thought there was an outside chance [http://en.wikipedia.org/wiki/Manhattan_Project#cite_ref-14] it would do just that. On the other hand, the Teller report on atmosphere ignition should be kept in mind by anyone tempted to think "nah, those AI scientists wouldn't go ahead with their plans if they thought there was even the slimmest chance of killing everyone".

Eliezer has not proved himself to be at the same level of the average machine learning PhD at getting things done.

He actually stated that himself several times.

So I do understand that, and I did set out to develop such a theory, but my writing speed on big papers is so slow that I can't publish it. Believe it or not, it's true.

Yes, ok, this does not mean his intellectual power isn't on par, but his ability to function in an academic environment.

As far as I know he has no experience with narrow AI research.

Well...

I tried - once - going to an

... (read more)

Try peak oil/anti-nuclear/global warming/etc. activists then? They tend to claim their movement saves the world, not themselves personally, but I'm sure I could find sufficient number of them who also had some personality cult thrown in.

1xamdam10ySure, but that would 1) reduce you 1/100000 figure, esp. if you take only the leaders of the said movement. And I would not find claims of saving the world by anti-nuke scientists in say the 1960s preposterous. I think that if you accept that AGI is "near", that FAI is important to try in order to prevent it, and that EY was at the very least the person who brought spotlight to the problem (which is a fact), you can end up thinking that he might actually make a difference.
5Paul Crowley10yYeah, I'm tickled by the estimate that so far 0 people have saved the world. How do we know that? The world is still here, after all.
1Morendil10yEliezer has already placed a Go stone [http://lesswrong.com/lw/jq/926_is_petrov_day/] on that intersection, it turns out.
2CarlShulman10yAs the comments discuss, that was not an extinction event, barring further burdensome assumptions about nuclear winter or positive feedbacks of social collapse.
2taw10yI already did, there was a huge number of such movements, most of them highly obscure (not unlike Eliezer). I'd expect some power law distribution in prominence, so for every one we've heard about there'd be far more we didn't. I don't, and the link from AGI to FAI is as weak as from oil production statistics to civilizational collapse peakoilers promised.

Generally speaking, your argument isn't very persuasive unless you believe that the world is doomed without FAI and that direct FAI research is the only significant contribution you can make to saving it.

The argument I gave doesn't include justification of things it assumes (that you referred to). It only serves to separate the issues with claims about a person from issues with claims about what's possible in the world. Both kinds of claims (assumptions in the argument I gave) could be argued with, but necessarily separately.

A million? The only source of that quantity of would-be saviours I can think of is One True Way proselytising religions, but those millions are not independent -- Christianity and Islam are it.

There has been at least one technological success, so that's a success rate of 1 out of 3, not 0 out of a million.

But the whole argument is wrong. Many claimed to fly and none succeeded -- until someone did. Many claimed transmutation and none succeeded -- until someone did. Many failed to resolve the problem of Euclid's 5th postulate -- until someone did. That no-on... (read more)

4taw10yJust for a starter: * http://en.wikipedia.org/wiki/List_of_messiah_claimants [http://en.wikipedia.org/wiki/List_of_messiah_claimants] * http://en.wikipedia.org/wiki/List_of_people_considered_to_be_deities [http://en.wikipedia.org/wiki/List_of_people_considered_to_be_deities] * http://en.wikipedia.org/wiki/Category:Deified_people [http://en.wikipedia.org/wiki/Category:Deified_people] * http://en.wikipedia.org/wiki/Jewish_Messiah_claimants [http://en.wikipedia.org/wiki/Jewish_Messiah_claimants] And for every notable prophet or peace activist or whatever there are thousands forgotten by history. And if you count Petrov - it's not obvious why as he didn't save the world - in any case he wasn't claiming that he's going to save the world earlier, so P(saved the world|claimed to be world-savior) is less than P(saved the world|didn't claim to be world-savior). You seem to be horribly confused here. I'm not arguing that nobody will ever save the world, just that a particular person claiming to is extremely unlikely. Given how low the chance is, I'll pass.
3orthonormal10yYou should count Bacon, who believed himself– accurately– to be taking the first essential steps toward understanding and mastery of nature for the good of mankind. If you don't count him on the grounds that he wasn't concerned with existential risk, then you'd have to throw out all prophets who didn't claim that their failure would increase existential risk.
2RichardKennaway10yI'll give you more than two, but that still doesn't amount to millions, and not all of those claimed to be saving the world. But now we're into reference class tennis. Is lumping Eliezer in with people claiming to be god more useful than lumping him in with people who foresee a specific technological existential threat and are working to avoid it? Of course, but the price of the Spectator's Argument is that you will be wrong every time someone does save the world. That may be the trade you want to make, but it isn't an argument for anyone else to do the same.

Honestly, I don't think Eliezer would look overly eccentric if it weren't for LessWrong/Overcomingbias. Comp sci is notoriously eccentric, AI research possibly more so. The stigma against Eliezer isn't from his ideas, it isn't from his self confidence, it's from his following.

Kurzweil is a more dulled case: he has good ideas, but is clearly sensational, he has a large following, but that following isn't nearly as dedicated as the one to Eliezer (not necessarily to Eliezer himself, but to his writings and the "practicing of rationality"). And the ... (read more)

2ata10yWould you include SL4 there too? I think there were discussions there years ago (well before OB, and possibly before Kurzweil's overloaded Singularity meme complex became popular) about the perception of SIAI/Singularitarianism as a cult. (I wasn't around for any such discussions, but I've poked around in the archives from time to time. Here is one example. [http://www.sl4.org/archive/0508/12003.html])

In high school I went through a period when I believed that I was a messianic figure whose existence had been preordained by a watchmaker God who planned for me to save the human race. It's appropriate to say that during this period of time I suffered from extreme delusions of grandeur. I viscerally understand how it's possible to fall into an affective death spiral.

Not that the two are exclusive, but this sounds an awful lot like a manic episode. I assume you gave that due consideration?

So why was this post voted down so far? It appears to be a relevant and informative link to a non-crank source, with no incivility that I could see.

3Paul Crowley10yWith an introduction like that, the link should go to a recent announcement in a major scientific journal by a lot of respected people based on overwhelming evidence, not this one guy writing a non-peer-reviewed argument about an experiment ten years ago that AFAICT most physicists see as perfectly consistent with our existing understanding of QM.
3wedrifid10yOverconfidence in the assertion. Presumption of a foregone conclusion. It was a relevant link and I enjoyed doing the background reading finding out just how seriously relevant authorities take this fellow's stance. He is not a crank but he is someone with a large personal stake. The claim in the article seems to have an element of spin in the interpretations of interpretations as it were. I did lower my confidence in how well I grasp QM but much of that confidence was restored once I traced down some more expert positions and scanned some wikipedia articles. I focussed in particular on whether MW is a 'pure' interpretation. That is, whether it does actually deviate from the formal math.
0endoself10yIt is a source targeted at the general public, which unfortunately does not know enough to hire a competent columnist. John Cramer has used the wrong equations to arrive at an incorrect description of the Afshar experiment, which he uses to justify his own interpretation of QM, which he wants to be correct. The experiment is not in conflict with the known laws of physics. In general, I advise you to mistrust reports of recent developments in physics, if you have no physics training. I check a number of popular sources occasionally and about half of the articles are either wrong or misleading. For example, you may have recently heard about Erik Verlinde's theories about entropic gravity. If gravity were an entropic force, gravitational field would cause extremely rapid decoherence, preventing, for example, the standard double-slit experiment. This is obviously not observed, yet this theory is one of the more well-known ones among physics fans.
0Strange710yIncivility gets most of the big downvotes, and genuine insight gets the big upvotes, but I've noticed that the +1s and -1s tend to reflect compliance with site norms more than skill. This is worrying, of course, but I'm not equipped to fix it.
1TheOtherDave10yIf the stated rule for voting is "upvote what you want more of; downvote what you want less of," and the things that are getting upvoted are site norms and the things that are getting downvoted aren't, one interpretation is that that the system is working properly: they are site norms precisely because they are the things people want more of, which are therefore getting upvoted.
0wedrifid10y:P Skill? What is this skill of which you speak? Ignore it and write comments worth +5. :)
0Strange710yIt's easier to write five yes-man quotes for +1 each than one +5 comment, which seems like a flawed incentive system.
1wedrifid10yThat isn't my experience. When in the mood to gain popularity +5 comments are easy to spin while bulk +1s take rather a lot of typing. I actually expect that even trying to get +1s I would accidentally get about at least 1/5th as many +5s as +1s. Edit: I just scanned back through the last few pages of my comments. I definitely haven't been in a 'try to appear deep and insightful' kind of mood and even so more karma came from +5s than +1s. I was surprised because I actually thought my recent comments may have been an exception.
2Paul Crowley10yThis is what I find, scanning back over my last 20 comments. My last 30 include a +19 so I didn't even bother. And of course karma is a flawed incentive system. It's not meant as an incentive system.
0wedrifid10yI actually ignored everything that wasn't exactly a +5 to make the world that much less convenient. :P

This is a tiny minority opinion, based on math that is judged incorrect by the overwhelming majority of experts.

0[anonymous]10yCan someone link to a good explanation of all this. Or write one?

In general, when one individual asserts that something seems very likely to them it isn't helpful to simply assert that the opposite seems extremely likely without giving some minimal reasoning for why you think that will be the case.

How do you support this? Have you done a poll of mainstream scientists (or better yet - the 'best' ones)?

I have not done a poll of mainstream scientists. Aside from Shane Legg, the one mainstream scientist who I know of who has written on this subject is Scott Aaronson in his The Singularity Is Far article.

I was not claiming that I have strong grounds for confidence in my impressions of expert views. But it is the case if there's a significant probability that we'll see AGI over the next 15 years, mainstream scientists are apparently oblivious to this... (read more)

1jacob_cannell10yYes, from my reading of Shane Legg I think his prediction is a reasonable inside view and close to my own. But keep in mind it is also something of popular view. Kurzweil's latest tome was probably not much new news for most of it's target demographic (silicon valley). I've read Aaronson's post and his counterview seems to boil down to generalized pessimism, which I don't find to be especially illuminating. However, he does raise the good point about solving subproblems first. Of course, Kurzweil spends a good portion of TSIN summarizing progress in sub-problems of reverse engineering the brain. There appears to be a good deal of neuroscience research going on right now, but perhaps not nearly enough serious computational neuroscience and AGI research as we may like, but it is still proceeding. MIT's lab is no joke. There is some sort of strange academic stigma though as Legg discusses on his blog - almost like a silent conspiracy against serious academic AGI. Nonetheless, there appears to be no stigma against the precursors, which is where one needs to start anyway. I do not think we can infer their views on this matter based on their behaviour. Given the general awareness of the meme I suspect a good portion of academics in general have heard of it. That doesn't mean that anyone will necessarily change their behavior. I agree this seems really odd, but then I think - how have I changed my behavior? And it dawns on me that this is a much more complex topic. For the IEEE singularity issue - just google it .. something like "IEEE Singularity special issue". I'm having slow internet atm. Because any software problem can become easy given enough hardware. For example, we have enough neuroscience data to build reasonably good models of the low level cortical circuits today . We also know the primary function of perhaps 5% of the higher level pathways. For much of that missing 95% we have abstract theories but are still very much in the dark. With enough comput

I find it fairly likely that the class will expand dramatically if there's a breakthrough that brings AGI in within reach.

I should hope not! If that happens, it means the person who made the breakthrough released it to the public. That would be a huge mistake, because it would greatly increase the chances of an unfriendly AI being built before a friendly one.

Despite factors (i) and (ii), putting all of the information that I have together, my estimate of 10^(-9) still feels about right to me.

That's only because you said it in public and aren't willi... (read more)

1timtyler10yYou are so concerned about the possibility of failure that you want to slow down research, publication and progress in the field - in order to promote research into safety? Do you think all progress should be slowed down - or just progress in this area? The costs of stupidity are a million road deaths a year, and goodness knows how many deaths in hospitals. Intelligence would have to be pretty damaging to outweigh that. There is an obvious good associated with publication - the bigger the concentration of knowledge about intelligent machines there is in one place, the greater wealth inequality is likely to result, and the harder it would be for the rest of society to deal with a dominant organisation. Spreading knowlege helps spread out the power - which reduces the chance of any one group of people becoming badly impoverished. Such altruistic measures may help to prevent a bloody revolution from occurring.
1multifoliaterose10yTwo points: 1. It seems very likely to me that there's a string of breakthroughs which will lead to AGI and that it will gradually become clear that to people that they should be thinking about friendliness issues. 2. Even if there's a single crucial breakthrough, I find it fairly likely that the person who makes it will not have friendliness concerns in mind. I believe that the human brain is extremely poorly calibrated to determining probabilities through the explicit process that you describe and that the human brain's intuition is often more reliable for such purposes. My attitude is in line with Holden's comments 14 [http://blog.givewell.org/2010/06/29/singularity-summit/#comment-156676] and 16 [http://blog.givewell.org/2010/06/29/singularity-summit/#comment-156859] on the GiveWell Singularity Summit thread [/]. In line with the last two paragraphs of one of my earlier comments [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2k45?c=1], I find your quickness to assume that my thinking on these matters stems from motivated cognition disturbing. Of course, I may be exhibiting motivated cognition, but the same is true of you, and your ungrounded confidence in your superiority to me is truly unsettling. As such, I will cease to communicate further with you unless you resolve to stop confidently asserting that I'm exhibiting motivated cognition.

Selecting a random English sentence will introduce a bias towards concepts that are easy to express in English.

My answer (for why I don't believe in a popular religion as a form of giving in to a Pascal's Mugging) would be that I'm simultaneously faced with a number of different Pascal's Muggings, some of which are mutually exclusive, so I can't just choose to give in to all of them. And I'm also unsure of what decision theory/prior/utility function I should use to decide what to do in the face of such Muggings. Irreversibly accepting any particular Mugging in my current confused state is likely to be suboptimal, so the best way forward at this point seems to be to work on the relevant philosophical questions.

0endoself10yThat's what I think too! You're only the second other person I have seen make this explicit, so I wonder how many people have even considered this. Do you think more people would benefit from hearing this argument?
0Wei_Dai10ySure, why do you ask? (If you're asking because I've thought of this argument but haven't already tried to share it with a wider audience, it probably has to do with reasons, e.g., laziness, that are unrelated to whether I think more people would benefit from hearing it.)
0endoself10yI was considering doing a post on it, but there are many posts that I want to write, many of which require research, so I avoided implying that it would be done soon/ever.

A Christian faced with an analogous Christian prophet would denounce him as the Antichrist.

This is sect-dependent. The Mormons would probably be quite happy to accept one provided he attained prophet-hood through church-approved channels.

The conjunction fallacy is the assignment of a higher probability to some statement of the form A&B than to the statement A. It is well established that for certain kinds of A and B, this happens.

The fallacy in your proof that this cannot happen is that you have misstated what the conjunction fallacy is.

My point in mentioning it is that people committing the fallacy believe a logical impossibility. You can't get much more improbable than a logical impossibility. But the conjunction fallacy experiments demonstrate that is common to believe such things.

T... (read more)

Complete agreement, but downvoted for making comments that don't promote paperclips.

2thomblake10yI think Clippy was just testing whether ve'd successfully promoted that to a community norm.

Yes, there are trillions of possible religions that differ from one another as much as Islam differs from Judaism, or whatever. But only a few of these are believed by human beings.

Privileging the hypothesis! That they are believed by human beings doesn't lend them probability.

No. It doesn't lend probability, but it seems like it ought to lend something. What is this mysterious something? Lets call it respect.

Privileging the hypothesis is a fallacy. Respecting the hypothesis is a (relatively minor) method of rationality.

We respect the hypotheses t... (read more)

4Vladimir_Nesov10yNo, it's a method of anti-epistemic horror.
3FAWS10yYou can dispense with this particular concept of respect since in both your examples you are actually supplied with sufficient Bayesian evidence to justify evaluating the hypothesis, so it isn't privileged. Whether this is also the case for believed in religions is the very point contested.

Well, it does to the extent that lack of believers would be evidence against them. I'd say that Allah is considerably more probable than a similarly complex and powerful god who also wants to be worshiped and is equally willing to interact with humans, but not believed in by anyone at all. Still considerably less probable than the prior of some god of that general sort existing, though.

What about the conjunction fallacy?

E includes C implies that K(C) <= K(E) + K(information needed to locate C within E). In this case K(information needed to locate C within E) seems small enough not to matter to the overall argument, which is why I left it out. (Since you said "this is a small point" I guess you probably understand and agree with this.)

1[anonymous]10yActually no I hadn't thought of that. But I wonder if the amount of information it takes to locate "lots of people are muslims" within E is as small as you say. My particular E does not even contain that much information about Islam, and how people came to believe it, but it does contain a model of how people come to believe weird things in general. Is that a misleading way of putting things? I can't tell.

You can edit comments after submitting them -- when logged in, you should see an edit button.

By the way, I'm reading your part 15, section 2 now.

Are you aware that most species that have ever lived have indeed been wiped out? Not thinking about such possibilities worked well for them, eh?

EDIT: And of course we can also present scholarly analyses of why extinction in the case of our species is not particularly unlikely: http://www.nickbostrom.com/fut/evolution.html

on what evidence would you take seriously someone's claim to be doing effective work against an existential threat?

Eliezer's claims are not that he's doing effective work, his claims are pretty much of being a messiah saving humanity from super-intelligent paperclip optimizers. That requires far more evidence. Ridiculously more, because you not only have to show that his work reduces some existential threat, but at the same time it doesn't increase some other threat to larger degree (pro-technology vs anti-technology crowds suffer from this - it's not o... (read more)

No, the Permanent Mission of the Russian Federation to the United Nations disagrees with this story, and Wikipedia quotes that disagreement. The very next section explains why that disagreement may be incorrect.

I've never gotten that impression. What I've gotten is that evolutionary pressures will, in the long term, still exist--even if technological self-modification leads to a population that's 99.99% satisfied to live within strict resource consumption limits, unless they harshly punish defectors the .01% with a drive for replication or expansion will overwhelm the rest within a few millenia, until the average income is back to subsistence. This doesn't depend on human preferences, just the laws of physics and natural selection.

1WrongBot10yWhat evolutionary pressures? Even making the incredible assumption that we will continue to use sequences of genes as a large part of our identities, what's to stop a singleton of some variety from eliminating drives for replication or expansion entirely? I feel uncomfortable speculating about a post-machine-intelligence future even to this extent; this is not a realm in which I am confident about any proposition. Consequently, I view all confident conclusions with great skepticism.
4khafra10yYou're still not getting the breadth and generality of Hanson's model. To use recent LW terminology, it's an anti-prediction. It doesn't matter whether agents perpetuate their strategies by DNA mixing, binary fission, cellular automata, or cave paintings. Even if all but a tiny minority of posthumans self-modify not to want growth or replication, the few that don't will soon dominate the light-cone. A singleton, like I'd mentioned, is one way to avert this. Universal extinction and harsh, immediate punishment of expansion-oriented agents are the only others I see.
1WrongBot10yYou (or Robin, I suppose) are just describing a many-agent prisoner's dilemma. If TDT agents beat the dilemma by cooperating with other TDT agents, then any agents that started out with a different decision theory will have long since self-modified to use TDT. Alternately, if there is no best decision theoretical solution to the prisoner's dilemma, then we probably don't need to worry about surviving to face this problem.

Counter: superintelligent agents won't need actually-existing humans to have good models of other alien races.

Counter to the counter: humans use up only a tiny fraction of the resources available in the solar system and surroundings, and who knows, maybe the superintelligence sees a tiny possibility of some sort of limit to the quality of any model relative to the real thing.

One possible counter to the counter to the counter: but when the superintelligence in question is first emerging, killing humanity may buy it a not-quite-as-tiny increment of probability of not being stopped in time.

[-][anonymous]10y 2

Why are people boggling at the 1-in-a-billion figure? You think it's not plausible that there are three independent 1-in-a-thousand events that would have to go right for EY to "play a critical role in Friendly AI success"? Not plausible that there are 9 1-in-10 events that would have to go right? Don't I keep hearing "shut up and multiply" around here?

Edit: Explain to me what's going on. I say that it seems to me that events A, B are likely to occur with probability P(A), P(B). You are allowed to object that I must have made a mi... (read more)

The 1-in-a-billion follows not from it being plausible that there are three such events, but from it being virtually certain. Models without such events will end up dominating the final probability. I can easily imagine that if I magically happened upon a very reliable understanding of some factors relevant to future FAI development, the 1 in a billion figure would be the right thing to believe. But I can easily imagine it going the other way, and absent such understanding, I have to use estimates much less extreme than that.

1multifoliaterose10yYes, this is of course what I had in mind.

Perhaps compare a doomsday cult with a drug addict:
The outside view (e.g. of family and practitioners) looks one way - while the inside view often looks pretty different.

That's not what "inside view" means. The way you seem to intend it, it admittedly is a useless tool, but having it as an option in the false dichotomy together with reference class tennis is transparently disingenuous (or stupid).

Not knowing that a problem exists is pretty different from acknowledging it and working on it.

That's not true for any reasonable definition of "belief," least of all a Bayesian one. If all the raffle participants believed "I am likely to win," or "I am certain to win," then they are all wrong and they will all remain wrong after one of them wins. If all the raffle participants believed "I have a one in a billion chance to win," then they are all correct and they will all remain correct.

???

Of course. But no English speaker would utter the phrase "I will win this raffle" as a gloss for "I hav... (read more)

I wasn't making an analogy. I am surprised by that interpretation. I was providing a counterexample to the claim that it is absurd to prohibit accurate beliefs. One of my raffle-players has an accurate belief, but that player's belief is nonetheless prohibited by the norms of rationality.

I'm wondering where people said AI development was going to be easy.

2wedrifid10yIndeed. There was a post "shut up and do the impossible" for a reason!
2NancyLebovitz10yAnd I'm wondering where it was said that superintelligent AI will solve all our problems.
1rwallace10yThe original idea of the SIAI was that when they (or someone else) implement superintelligent AI, it takes over the world and implements CEV (or turns everyone into paperclips or whatever). Is their current position on that also more moderate?

The mechanism that determines human action is that we do what makes us feel good (at the margin) and refrain from doing what makes us feel bad (at the margin).

"The" mechanism? Citation needed.

a fundamental mechanism of the human brain which was historically correlated with gaining high status is to make us feel good when we have high self-image and feel bad when we have low self-image.

Better, but still unsupported and unclear. What was correlated with what?

[-][anonymous]10y 1

Since it redirects, the relevant history page is the technological singularity history page. Namely, this one. And there was indeed a recent change to the first sentence. See for example this comparison.

Wait, did you really mean "no, the page has always redirected there" instead of "no, the page does not, in fact, redirect there"?

2AdeleneDawner10y"A page that is extremely similar to X" implies "a page that is not X", assuming normal use of the English language. The rapture of the nerds page has always led to the technological singularity page, and the technological singularity page is not a page that is not the technological singularity page. Reading the relevant comment with the strictest possible definitions of all the terms, it's technically correct, but the way that the comment is structured implies an interpretation other than the one that is true, and it could easily have been structured in a way that wouldn't imply such an interpretation.
1CuSithBell10yHuh. Put like that, I guess I understand now, but it seems as though your refutation could also have been more clear on that point. Thanks for the disentangling!
0timtyler10yThe pages are subtly different - in the way I described in detail in my original comment [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/462q]. Count the words in the first sentence - the one starting: "A technological singularity is..." to see the difference. My guess is that a Wikipedia "redirect" allows for a prefix header to be prepended, which would explain the difference.
1AdeleneDawner10yAll four versions of the page - redirect and not, secure and not - start with the same two sentences for me: "A technological singularity is a hypothetical event. It will occur if technological progress becomes so rapid and the growth of super-human intelligence so great that the future (after the singularity) becomes qualitatively different and harder to predict." I suspect you have a cache issue.
0timtyler10yThat seems likely. I used http://hidemyass.com/proxy/ [http://hidemyass.com/proxy/] - and it gives a more consistent picture.
1[anonymous]10yMuch time could have been saved had you copied and pasted the two diverging sentences rather than asking people to count the words. For indeed there was a recent change in the page, and if this was the source of the difference, then had you provided the exact sentences then the cause could have been determined quickly, avoiding a lot of back and forth. Copying and pasting from a comparison, the slightly earlier version is: The slightly more recent version is: The rest of the earlier sentence was split off into separate sentences.
0wedrifid10yNot that I am necessarily one to talk but much time could have been saved if nobody argued about such an irrelevant technicality. ;)
1[anonymous]10yIt was the key and only evidence in an accusation of lying, which is a pretty damn serious accusation that should neither be taken lightly nor made lightly. The evidence was small but the role it played in the accusation made it important. If your point is that the accuser should have held their tongue so to speak, you may be right. But they didn't, and so the question took on importance.
2wedrifid10yYes, responding to accusations of lying is important. Making them, not so much. :)
0AdeleneDawner10y*nods* *edits ancestral comment*

Harumpf.

Adelene, you are still being very discourteous!

I recommend that you calm down, try to be polite - and go a bit easier in the future on the baseless accusations and recriminations.

Before you start flinging accusations around, perhaps check, reconsider - or get a second opinion?

To clarify, for me, http://en.wikipedia.org/wiki/Rapture_of_the_Nerds still gives me:

Technological singularity

From Wikipedia, the free encyclopedia

(Redirected from Rapture of the Nerds)

1Morendil10yMaybe Adelene meant that "now" is an untruth, in that it implies a change occurring between the timestamp of the comment you reply to and the reply itself. A truthful observation would "RotN has always redirected to a page that, etc."
0timtyler10yThe implication that you refer to is based on a simple misunderstanding of my comment - and does not represent a "lie" on my part.

Interesting, but I don't think that's the right characterization of the content of the link. It's John Cramer (proponent of the transactional interpretation) claiming that the Afshar Experiment's results falsfify both Copenhagen and MWI. I think you're better off reading about the experiment directly.

As for your "not-so-fair response" - I seriously doubt that you know enough about academia to have any confidence in this view. I think that first hand experience is crucial to developing a good understanding of the strengths and weaknesses of academia.

I definitely don't have the necessary first-hand-experience: I was reporting second-hand the impressions of a few people who I respect but whose insights I've yet to verify. Sorry, I should have said that. I deserve some amount of shame for my lack of epistemic hygiene there.

(I say this with a

... (read more)

A fair response to this requires a post that Less Wrong desperately needs to read: People Are Crazy, the World Is Mad. Unfortunately this requires that I convince Michael Vassar or Tom McCabe to write it. Thus, I am now on a mission to enlist the great power of Thomas McCabe.

(A not-so-fair response: you underestimate the extent to which academia is batshit insane just like nearly every individual in it, you overestimate the extent to which scientists ever look outside of their tiny fields of specialization, you overestimate the extent to which the most rat... (read more)

I personally would be interested in working with Eliezer if he appeared to me to be well grounded. The impressions that I've gotten from my private correspondence with Eliezer and from his comments have given me a very strong impression that I would find him too difficult to work with for me to be able to do productive FAI research with him.

My reading of this is that before you corresponded privately with Eliezer, you were

  1. interested in personally doing FAI research
  2. assigned high enough probability to Eliezer's success to consider collaborating with hi
... (read more)
3multifoliaterose10ySo, the situation is somewhat different than the one that you describe. Some points of clarification. •I first came across Overcoming Bias in 2008. Eliezer was recommended to me by a friend who I respect a great deal. My reactions to the first postings that I read by Eliezer was strong discomfort with his apparent grandiosity and self absorption. This discomfort was sufficiently strong for me to lose interest despite my friend's endorsement. •I started reading Less Wrong in earnest in the beginning of 2010. This made it clear to me that Eliezer has a lot to offer and that it was unfortunate that I had been pushed away by my initial reaction. •I never assigned a very high probability to Eliezer making a crucial contribution to an FAI research project. My thinking was that the enormous positive outcome associated with success might be sufficiently great to justify the project despite the small probability. •I didn't get much of a chance to correspond privately with Eliezer at all. He responded to a couple of my messages with one line dismissive responses and then stopped responding to my subsequent messages. Naturally this lowered the probability that I assigned to being able to collaborating with him. This also lowered my confidence in his ability to attract collaborators in general. •If Eliezer showed strong ability to attract and work well with collaborators (including elite academics who are working on artificial intelligence research) then I would find it several orders of magnitude more likely that he would make a crucial contribution to an FAI research project. For concreteness I'll throw out the number 10^(-6). •I feel that the world is very complicated and that randomness plays a very large role. This leads me to assign a very small probability to the proposition that any given individual will play a crucial role in eliminating existential risk. •I freely acknowledge that I may be influenced by emotional factors. I make an honest effort at being level
8komponisto10yI'd be really interested to know which posts these were, because it would help me to distinguish between the following interpretations: (1) First impressions really do matter: even though you and I are probably very similar in many respects, we have different opinions of Eliezer simply because in the first posts of his I read, he sounded more like a yoga instructor than a cult leader; whereas perhaps the first thing you read was some post where his high estimation of his abilities relative to the rest of humanity was made explicit, and you didn't have the experience of his other writings to allow you to "forgive" him for this social transgression. (2) We have different personalities, which cause us to interpret people's words differently: you and I read more or less the same kind of material first, but you just interpreted it as "grandiose" whereas I didn't. What's interesting in any case is that I'm not sure that I actually disagree with you all that much about Eliezer having a small chance of success (though I think you quantify it incorrectly with numbers like 10^(-9) or 10^(-6) -- these are way too small). Where we differ seems to be in the implications we draw from this. You appear to believe that Eliezer and SIAI are doing something importantly wrong, that could be fixed by means of a simple change of mindset, and that they shouldn't be supported until they make this change. By contrast, my interpretation is that this is an extremely difficult problem, that SIAI is basically the first organization that has begun to make a serious attempt to address it, and that they are therefore worthy of being supported so that they can increase their efforts in the directions they are currently pursuing and potentially have a larger impact than they otherwise would. I've been meaning to ask you: given your interest in reducing existential risk, and your concerns about SIAI's transparency and their general strategy, have you considered applying to the Visiting Fellows pr
2multifoliaterose10yRight, so the first posts that I came across were Eliezer's Coming of Age [http://wiki.lesswrong.com/wiki/Yudkowsky%27s_coming_of_age] posts which I think are unrepresentatively self absorbed. So I think that the right interpretation is the first that you suggest. Since I made my top level posts, I've been corresponding with Carl Shulman who informed me of some good things that SIAI has been doing that have altered my perception of the institution. I think that SIAI may be worthy of funding. Regardless as to the merits of SIAI's research and activities, I think that in general it's valuable to promote norms of Transparency and Accountability [http://lesswrong.com/lw/2m5/transparency_and_accountability/?sort=controversial] . I would certainly be willing to fund SIAI if it were strongly recommended by a highly credible external charity evaluator like GiveWell. Note also a comment which I wrote in response to Jasen [http://lesswrong.com/lw/2m5/transparency_and_accountability/2hko?c=1]. I would like to talk more about these things - would you like to share email addresses? PM me if so. At this point I worry that I've alienated the SIAI people to such an extent that they might not be happy to have me. But I'd certainly be willing if they're favorably disposed toward me. I'll remark that back in December after reading Anna Salamon's posting on the SIAI Visting Fellows program [http://intelligence.org/aboutus/opportunities/visiting-fellow] I did send Anna Salamon a long email expressing some degree of interest and describing some my concerns without receiving a response. I now find it most plausible that she just forgot about it and that I should have tried again, but maybe you can understand from this how I got the impression that becoming an SIAI Visiting Fellow was not a strong option for me.
2komponisto10yDone. As it happens, the same thing happened to me; it turned out that my initial message had been caught in a spam filter. I eventually ended up visiting for two weeks, and highly recommend the experience.
4Wei_Dai10yThis, along with your other estimate of 10^(-9), implies that your probability for Eliezer being able to eventually attract and work well with collaborators is currently 1/1000. Does that really seem reasonable to you (would you be willing to bet at those odds?), given other evidence besides your private exchange with Eliezer? Such as: * Eliezer already had a close collaborator, namely Marcello * SIAI has successfully attracted many visiting fellows * SIAI has successfully attracted top academics to speak at their Singularity Summit * Eliezer is currently writing a book on rationality, so presumably he isn't actively trying to recruit collaborators at the moment * Other people's reports of not finding Eliezer particularly difficult to work with It seems to me that rationally updating on Eliezer's private comments couldn't have resulted in such a low probability. So I think a more likely explanation is that you were offended [http://lesswrong.com/lw/13s/the_nature_of_offense/] by the implications of Eliezer's dismissive attitude towards your comments. (Although, given Eliezer's situation, it would probably be a good idea for him to make a greater effort to avoid offending potential supporters, even if he doesn't consider them to be viable future collaborators.) Your responses to me seem pretty level headed and sober. I hope that means you don't find my comments too hostile.

How about "people who have publically declared an intention to try to build an FAI"? That seems like a much more relevant reference class, and it's tiny. (I'm not sure how tiny, exactly, but it's certainly smaller than 10^3 people right now) And if someone else makes a breakthrough that suddenly brings AGI within reach, they'll almost certainly choose to recruit help from that class.

That should be a very easy claim to prove, actually. If someone really were the sysadmin of the universe, they could easily do a wide variety of impossible things that anyone can could verify. For example, they could write their message in the sky with a special kind of photon that magically violates the laws of physics in an obvious way (say, for example, it interacts with all elements normally except one which it inexplicably doesn't interact with at all). Or find/replace their message into the genome of a designated species. Or graffiti it onto every la... (read more)

Fair enough. But if we're doing that, I think the original question with the Omega machine abstracts too much away. Let's consider the kind of evidence that we would actually expect to see if Islam were true.

Let us stipulate that, on the 1st of Muḥarram, a prominent ayatollah claims to have suddenly become a prophet. They go on television and answer questions on all topics. All verifiable answers they give, including those to NP-complete questions submitted for experimental purposes, turn out to be true. The new prophet asserts the validity of the Qur'an a... (read more)

4PaulAlmond10yI'll give a reworded version of this, to take it out of the context of a belief system with which we are familiar. I'm not intending any mockery by this: It is to make a point about the claims and the evidence: "Let us stipulate that, on Paris Hilton's birthday, a prominent Paris Hilton admirer claims to have suddenly become a prophet. They go on television and answer questions on all topics. All verifiable answers they give, including those to NP-complete questions submitted for experimental purposes, turn out to be true. The new prophet asserts that Paris Hilton is a super-powerful being sent here from another world, co-existing in space with ours but at a different vibrational something or whatever. Paris Hilton has come to show us that celebrity can be fun. The entire universe is built on celebrity power. Madonna tried to teach us this when she showed us how to Vogue but we did not listen and the burden of non-celebrity energy threatens to weigh us down into the valley of mediocrity when we die instead of ascending to a higher plane where each of us gets his/her own talkshow with an army of smurfs to do our bidding. Oh, and Sesame Street is being used by the dark energy force to send evil messages into children's feet. (The brain only appears to be the source of consciousness: Really it is the feet. Except for people with no feet. (Ah! I bet you thought I didn't think of that.) Today's lucky food: custard." There is a website where you can suggest questions to put to the new prophet. Not all submitted questions get answered, due to time constraints, but interesting ones do get in reasonably often. Are there any questions you'd like to ask?" The point I am making here is that the above narrative is absurd, and even if he can demonstrate some unusual ability with predictions or NP problems (and I admit the NP problems would really impress me), there is nothing that makes that explanation more sensible than any number of other stupid explanations. Nor does he ha
1PaulAlmond10yYes - I would ask this question: "Mr Prophet, are you claiming that there is no other theory to account for all this that has less intrinsic information content than a theory which assumes the existence of a fundamental, non-contingent mind - a mind which apparently cannot be accounted for by some theory containing less information, given that the mind is supposed to be non-contingent?" He had better have a good answer to that: Otherwise I don't care how many true predictions he has made or NP problems he has solved. None of that comes close to fixing the ultra-high information loading in his theory.
[-][anonymous]10y 1

Why do you have at most 99.999999999% certainty that they are not? Where does that number one-minus-a-billionth come from?

For simplicity we may assume P(E|H) to be near-certainty: if there is an attention-seeking god, we'd know about it. This leaves P(E) and P(H), and P(H|E) is tiny exactly for the reason you named: P(H) is much smaller than P(E), because H is optimized for meme-spreading to a great extent, which makes for a given complexity (that translates into P(H)) probability of gaining popularity P(E) comparatively much higher.

Thus, just arguing from complexity indeed misses the point, and the real reason for improbability of cultish claims is that they are highly optim... (read more)

Also... if you haven't been to Australia, is it privileging the hypothesis to accept the word of those who say that it exists? There are trillions of possible countries that could exist that people don't believe exist...

And don't tell me they say they've been there... religious people say they've experienced angels etc. too.

And so on. People's beliefs in religion may be weaker than their belief in Austrialia, but it certainly is not privileging a random hypothesis.

Privileging the hypothesis!

Begging the question!

This whole discussion is about this very point. Downvoted for contradicting my position without making an argument.

I don't think Pascal recognized any potential symmetry in the first place, or he would have addressed it properly.

I just realized that you may have misunderstood my original point completely. Otherwise you wouldn't have said this: "I thought the salient feature of Islam was that many people believed it, not that it has less complexity than I thought, or more evidence in its favor than I thought."

I only used the idea of complexity because that was komponisto's criterion for the low probability of such claims. The basic idea is people believe things that their priors say do not have too low a probability: but as I showed in the post on Occam's razor, everyone... (read more)

Nitpick: c is a dimensioned quantity, so changes in it aren't necessarily meaningful.

1WrongBot10y*Blink.* *Reads Wikipedia.* Would I be correct in thinking that one would need to modify the relationship of c to some other constant (the physics equation that represent some physical law?) for the change to be meaningful? I may be failing to understand the idea of dimension. Thank you for the excuse to learn more math, by the way.
2Psy-Kosh10yYes, you would be correct, at least in terms of our current knowledge. In fact, it's not that unusual to choose units so that you can set c = 1 (ie, to make it unitless). This way units of time and units of distance are the same kind, velocities are dimensionless geometric quantities, etc... You might want to think of "c" not so much as a speed as a conversion factor between distance type units and time type units.

Would you actually go as far as maintaining that, if a change were to happen tomorrow to the 1,000th decimal place of a physical constant, it would be likely to stop brains from working, or are you just saying that a similar change to a physical constant, if it happened in the past, would have been likely to stop the sequence of events which has caused brains to come into existence?

[-][anonymous]10y 1

Let's consider two situations:

  1. For each 80-digit binary number X, let N(X) be the assertion "Unknowns picked an 80-digit number at random, and it was X." In my ledger of probabilities, I dutifully fill in, for each of these statements X, 2^{-80} in the P column. Now for a particular 80-digit number Y, I am told that "Unknowns claims he picked an 80-digit number at random, and it was Y" -- call that statement U(Y) -- and am asked for P(N(Y)|U(Y)).

My answer: pretty high by Bayes formula. P(U|N(Y)) is pretty high because Unknowns is ... (read more)

What?!? What makes you think that?

Sensitive dependence on initial conditions is an extremely well-known phenomenon. If you change the laws of physics a little bit, the result of a typical game of billiards will be different. This kind of phenomenon is ubiquitous in nature, from the orbit of planets, to the paths rivers take.

If a butterfly's wing flap can cause a tornado, I figure a small physical constant jog could easily make the difference between intelligent life emerging, and it not doing so billions of years later.

Sensitive dependence on initial co... (read more)

1Kingreaper10yDid you miss this bit: Sensitivity to initial conditions is one thing. Sensitivity to 1 billion SF in a couple of decades?

Artificial wombs

I agree that Kurzweil could have acknowledged P.Z.Myers' expertise a bit more, especially the "nobody in my field expects a brain simulation in the next ten years" bit.

50 MB - that's still a hefty amount of code, especially if it's 50MB of compiled code and not 50 MB of source code (comparing the size of the source code to the size of the compressed DNA looks fishy to me, but I'm not sure Kurzweil has been actually doing that - he's just been saying "it doesn't require trillions of lines of code").

Is the size of gcc the source code or the compiled version? I didn't see that info on Wikipedia, and don't have gcc on this machine.

2timtyler10yAs I see it, Myers delivered a totally misguided rant. When his mistakes were exposed he failed to apologise. Obviously, there is no such thing as bad publicity.
1RobinZ10yI'm looking at gcc-4.5.0.tar.gz [ftp://gcondra.cs.washington.edu/gnu/gcc/gcc-4.5.0/].
2Emile10yThat includes the source code, the binaries, the documentation, the unit tests, changelogs ... I'm not surpised it's pretty big! I consider it pretty likely that it's possible to program a human-like intelligence with a compressed source code of less than 50 MB. However, I'm much less confident that the source code of the first actual human-like intelligence coded by humans (if there is one) will be that size.

Note that I said there should be a lower bound on the probability for things that people believe, and even made it specific: something on the order of one in a billion. But I don't recall saying (you can point it out if I'm wrong) that there is a lower bound on the probability of things that are raised as possibilities. Rather, I merely said that the probability is vastly increased.

To the comment here, I responded that raising the possibility raised the probability of the thing happening by orders of magnitude. But I didn't say that the resulting probabili... (read more)

[-][anonymous]10y 1

It is always profitable to give different concepts different names.

Let GM be the assertion that I'll one day play guitar on the moon. Your claim is that this ratio

P(GM|I raised GM as a possibility)/P(GM)

is enormous. Bayes theorem says that this is the same as

P(I raised GM as a possibility|GM)/P(I raised GM as a possibility)

so that this second ratio is also enormous. But it seems to me that both numerator and denominator in this second ratio are pretty medium-scale numbers--in particular the denominator is not miniscule. Doesn't this defeat your idea?

Ok, of course it's fictional - hasn't happened yet!

Still, when I imagine something that is smarter than man who created it, it seems it would be able to improve itself.I would bet on that; I do not see a strong reason why this would not happen. What about you? Are you with Hanson on this one?

I set a lower bound of one in a billion on the probability of "a natural language claim that a significant number of people accept as likely true". The number of such mutually exclusive claims is surely far less than a billion, so the math issue will resolve easily.

Yes, it is easy to find more than a billion claims, even ones that some people consider true, but they are not mutually exclusive claims. Likewise, it is easy to find more than a billion mutually exclusive claims, but they are not ones that people believe to be true, e.g. no one expects 1000 heads in a row, no one expects a sequence of five hundred successive heads-tails pairs, and so on.

I didn't downvote you.

Thanks for this link. Sounds kind of scary. American political conservatives will be thrilled. "I'm from the CEV and I'm here to help you."

Incidentally, there should be an LW wiki entry for "CEV". The acronym is thrown around a lot in the comments, but a definition is quite difficult to find. It would also be nice if there were a top-level posting on the topic to serve as an anchor-point for discussion. Because discussion is sorely needed.

It occurs to me that it would be very desirable to attempt to discover the CEV of humanity lo... (read more)

Thinking about such things is the necessary first step to preventing such new species from arising that would make you extinct. So yes, if they had thought about these things competently enough, and otherwise been competent enough, it would have enabled them to survive.

Doesn't seem very smart of you to argue against thinking. If you don't think, you're certainly even more screwed than with thinking.

This is different from things like Pascal's wager where the actual probability may vary by many orders of magnitude from our best estimate.

According to the Bayesians, our best estimate is the actual probability. (According to the frequentists, the probabilities in Pascal's wager are undefined.)

What parent means by "We know the probability to a reasonable level of accuracy - eg consider acturial tables" is that it is possible for a human to give a probability without having to do or estimate a very hairy computation to compute a prior probabil... (read more)

"Karma" is the only definitionally supernatural item on that list - it is defined to be not reducible to nonmental mechanism. The others are merely elements of belief systems which contain elements that are supernatural (e.g. God).

Yes, the concept of "karma" can be reduced to naturalistic roots if you accept metaphysical naturalism, but the actual thing cannot be. It's the quotation which you can reduce, not the referent.

[-][anonymous]10y 1

Bacon doesn't seem to have any special impact on anything

Man, I hope you don't mean that.

He believed that the scientific method he developed and popularized would improve the world in ways that were previously unimaginable. He was correct, and his life accelerated the progress of the scientific revolution.

The claim may be weaker than a claim to help with existential risk, but it still falls into your reference class more easily than a lot of messiahs do.

You know, that is the first time I have seen a definition of FAI. Is that the "official" definition or just your own characterization?

My own characterization. It's more of a bare minimum baseline criterion for Friendliness, rather than a specific definition or goal; it's rather broader than what the SIAI people usually mean when they talk about what they're trying to create. CEV is intended to make the world significantly better on its own (but in accordance with what humans value and would want a superintelligence to do), rather than just bei... (read more)

No, in general p(n beings similar to A can do X) does not equal n multiplied by p(A can do X).

Yes, strictly speaking we'd need even more, if that.

No. There is a very small chance that I will be able to move my couch down the stairs alone. But it's fairly likely that I and my friend will be able to do it together.

Similarly, 10^5 Eliezer-level researchers would together constitute a research community that could do things that Eliezer himself has less than probability 10^(-5) of doing on his own.

2Vladimir_Nesov10yAgreed, I was not thinking clearly. The original comment [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2hb5?c=1] stands, since what you suggest is one way to dissolve the apparent inconsistency, but my elaboration [http://lesswrong.com/lw/2lr/the_importance_of_selfdoubt/2hc2?c=1] was not lucid.

How, in a post-AGI world, would you define wealth? Computational resources? Matter?

I don't think there's any foundation for speculation on this topic at this time.

2khafra10yUnless we get a hard-takeoff singleton, which is admittedly the SIAI expectation, there will be massive inequality, with a few very wealthy beings and average income barely above subsistence. Thus saith Robin Hanson [http://www.google.com/search?q=wealth+future+subsistence+site:overcomingbias.com] , and I've never seen any significant holes poked in that thesis.
1Vladimir_Nesov10yControl, owned by preferences.

As far as I know he has no experience with narrow AI research. I see familiarity with narrow AI as a prerequisite to AGI research.

Most things can be studied through the use of textbooks. Some familiarity with AI is certainly helpful, but it seems that most AI-related knowledge is not on the track to FAI (and most current AGI stuff is nonsense or even madness).

1multifoliaterose10yThe reason that I see familiarity with narrow AI as a prerequisite to AGI research is to get a sense of the difficulties present in designing machines to complete certain mundane tasks. My thinking is the same as that of Scott Aaronson in his The Singularity Is Far [http://scottaaronson.com/blog/?p=346] posting: "there are vastly easier prerequisite questions that we already don’t know how to answer."
2Vladimir_Nesov10yFAI research is not AGI research, at least not at present, when we still don't know what it is exactly that our AGI will need to work towards, how to formally define human preference [http://causalityrelay.wordpress.com/2010/04/18/preference-of-programs/].
1multifoliaterose10ySo, my impression is that you and Eliezer have different views of this matter. My impression is that Eliezer's goal is for SIAI to actually build an AGI unilaterally. That's where my low probability was coming from. It seems much more feasible to develop a definition of friendliness and then get governments to mandate that it be implemented in any AI or something like that. As I've said, I find your position sophisticated and respect it. I have to think more about your present point - reflecting on it may indeed alter my thinking about this matter.
6Vladimir_Nesov10yStill, build AGI eventually, and not now. Expertise in AI/AGI is of low relevance at present. It seems obviously infeasible to me that governments will chance upon this level of rationality. Also, we are clearly not on the same page if you say things like "implement in any AI". Friendliness is not to be "installed in AIs", Friendliness is the AI (modulo initial optimizations necessary to get the algorithm going and self-optimizing, however fast or slow that's possible). The AGI part of FAI is exclusively about optimizing the definition of Friendliness (as an algorithm), not about building individual AIs with standardized goals. See also this post [http://causalityrelay.wordpress.com/2010/01/24/fai-vector-for-human-preference/] for a longer explanation of why weak-minded AIs are not fit to carry the definition of Friendliness. In short, such AIs are (in principle) as much an existential danger as human AI researchers.
2Wei_Dai10yI wonder if we systematically underestimate the level of rationality of major governments. Historically, they haven't done that badly. From an article about RAND [http://www.atharosama.com/the-original-think-tank-an-article-about-rand-my-almamater] : (Huh, this is the first time I've heard of the Delphi Method [http://en.wikipedia.org/wiki/Delphi_method].) Many of the big names in game theory (von Neumann, Nash, Shapley, Schelling) worked for RAND at some point, and developed their ideas there.
1gwern10yRAND has a lot of good work (I like their recent reports on Iran), but keep in mind that big misses can undo a lot of their credit; for example, even RAND acknowledges (in their retrospective published this year or last) that they screwed up massively with Vietnam.
1mattnewport10yThis is not really a relevant example in the context of Vladimir_Nesov's comment. Certain government funded groups (often within the military interestingly) have on occasion shown decent levels of rationality. The suggestion to "develop a definition of friendliness and then get governments to mandate that it be implemented in any AI or something like that." that he was replying to requires rational government policy making / law making rather than rare pockets of rationality within government funded institutions however. That is something that is essentially non-existent in modern democracies.
2Vladimir_Nesov10yIt's not adequate to "get governments to mandate that [Friendliness] be implemented in any AI", because Friendliness is not a robot-building standard - refer the rest of my comment. The statement about government rationality was more tangential, about governments doing anything at all concerning such a strange topic, and wasn't meant to imply that this particular decision would be rational.
1rhollerith_dot_com10yData point: the internet is almost completely a creation of government. Some say entrepreneurs and corporations played a large role, but except for corporations that specialize in doing contracts for the government, they did not begin to exert a significant effect till 1993 whereas government spending on research that led to the internet began in 1960, and the direct predacessor to internet (the ARPAnet) became operational in 1969. Both RAND and the internet were created by the part of the government most involved in an enterprise (namely, the arms race during the Cold War) on which depended the long-term survival of the nation in the eyes of most decision makers (including voters and juries). EDIT: significant backpedalling in response to downvotes in my second paragraph.

Close. Actually, I had looked at the first part of the comment and then written my response under the delusion that wedrifid had been the OP.

I am now going to edit my comment to cleanly replace the mistaken "you" with "multi"