Book Review: How Minds Change

(Sorry, it doesn't look like the conservatives have caught on to this kind of approach yet.)

Actually, if you look at religious proselytization, you'll find that these techniques are all pretty well-known, albeit under different names and with different purposes. And while this isn't actually synonymous with political canvassing, it often has political spillover effects.

If you wanted, one could argue this the other way: left-oriented activism is more like proselytization than it is factual persuasion. And LessWrong, in particular, has a ton of quasi-religious elements, which means that its recruitment strategy necessarily looks a lot like evangelism.

[-]Jim Pivarski3y105

And even more deeply than door-to-door conversations, political and religious beliefs spread through long-term friend and romantic relationships, even unintentionally.

I can attest to this first-hand because I converted from atheism to Catholicism (25 years ago) by the unintended example of my girlfriend-then-wife, and then I saw the pattern repeat as a volunteer in RCIA, an education program for people who have decided to become Catholic (during the months before confirmation), and pre-Cana, another program for couples who plan to be married in the church (also months-long). The pattern in which a romantic relationship among different-religion (including no-religion) couples eventually ends up with one or the other converting is extremely common. I'd say that maybe 90% of the people in RCIA had a Catholic significant other, and maybe 40% of the couples in pre-Cana were mixed couples that became both-Catholic. What this vantage point didn't show me was the fraction in which the Catholic member of the couple converted away or maybe just got less involved and decided against being married Catholic (and therefore no pre-Cana). I assume that happens approximately as often. But it still shows that being friends or more than friends is an extremely strong motivator for changing one's views, whichever direction it goes.

Since it happened to me personally, the key thing in my case was that I didn't start with a clear idea of what Catholics (or some Catholics, anyway) actually believe. In reading this article and the ones linked from it, I came to Talking Snakes: A Cautionary Tale, which illustrates the point very well: scottalexander quoted Bill Maher as saying that Christians believe that sin was caused by a talking snake, and scottalexander himself got into a conversation with a Muslim in Cairo who thought he believed that monkeys turned into humans. Both are wild caricatures of what someone else believes, or at least a way of phrasing it that leads to the wrong mental image. In other words, miscommunication. What I found when I spent a lot of time with a Catholic—who wasn't trying to convert me—was that what some Catholics (can't attest for all of them) meant by the statements in their creed isn't at all the ridiculous things that written creed could be made to sound like.

In general, that point of view is the one Yudkowsky dismissed in Outside the Laboratory, which is to say that physical and religious statements are in different reality-boxes, but he dismissed it out of hand. Maybe there are large groups of people who interpret religious statements the same way they interpret the front page of the newspaper, but it would take a long-term relationship, with continuous communication, to even find out if that is true, for a specific individual. They might say that they're biblical literalists on the web or fill out surveys that way, but what someone means by their words can be very surprising. (Which is to say, philosophy is hard.) Incidentally, another group I was involved in, a Faith and Reason study group in which all of the members were grad students in the physical sciences, couldn't even find anyone who believed in religious claims that countered physical facts. Our social networks didn't include any.

Long-term, empathic communication trades the birds-eye view of surveys for narrow depth. Surely, the people I've come in contact with are not representative of the whole, but they're not crazy, either.

[-]Tim Freeman3y21

If you are Catholic, or remember being Catholic, and you're here, maybe you can explain something for me.

How do you reconcile God's benevolence and omnipotence with His communication patterns? Specifically: I assume you believe that the Good News was delivered at one specific place and time in the world, and then allowed to spread by natural means. God could have given everyone decent evidence that Jesus existed and was important, and God could have spread that information by some reliable means. I could imagine a trickster God playing games with an important message like that, but the Christian God is assumed to be good, not a trickster. How do you deal with this?

[-]Jim Pivarski3y41

I'm still Catholic. I was answering your question and it got long, so I moved it to a post: Answer to a question: what do I think about God's communication patterns?

[-]Charles M3y82

The whole technique of asking peoples' opinion and repeating it back to them is extraordinarily effective with respect to currently in-fashion gender ideology. "What is a Woman" did just that; let people explain themselves in their own words and calmly and politely repeated it back. They hung themselves with no counter argument whatsoever. Now, whether they ever actually changed their mind is another thing.

I think you could do the same in the climate change context, though it's not quite as easy.

[-]mruwnik3y32

Jehovah's Witnesses are what first came to mind when reading the OP. They're sort of synonymous with going door to door in order to have conversations with people, often saying that they're willing for their minds to be changed through respectful discussions. They also are one of few christian-adjacent sects (for lack of a more precise description) to actually show large growth (at least in the west).

[-]bc4026bd4aaa5b7fe3y20

This is absolutely a fair point that I did not think about. All of David's examples in the book are left-ish-leaning and I was mostly basing it on those. My goal with that sentence was to just lampshade that fact.

[-]Tim Freeman3y1713

In response to "The real problem is humanity's lack of rationalist skills. We have bad epistemology, bad meta-ethics, and we don't update our beliefs based on evidence.":

Another missing rationalist skill is having some sensible way to decide who to trust. This is necessary because there isn't time to be rational about all topics. At best you can dig at the truth of a few important issues and trust friends to give you accurate beliefs about the rest. This failure has many ramifications:

The SBF/FTX fiasco.
I quit LessWrong for some years in part because there were people there who were arguing in bad faith and the existing mechanisms to control my exposure to such people were ineffective.
Automata and professional trolls lie freely on social media with no effective means to stop them.
On a larger scale, bad decisions about who to trust lead to perpetuation of religion, bad decisionmaking around Covid, and many other beliefs held mostly by people who haven't taken the time to attempt to be rational about them.

[-]Ruby3y138

Curated. I like this post taking LessWrong back to its roots of trying to get us humans to reason better and believe truth things. I think we need that now as much as we did in 2009, and I fear that my own beliefs have become ossified through identity and social commitment, etc. LessWrong now talks a lot of about AI, and AI is increasingly a political topic (this post is a little political in a way I don't want to put front and center but I'll curate anyway), which means recalling the ways our minds get stuck and exploring ways to ask ourselves questions in ways where the answer could come back different.

[-]toothpaste3y34

Won't the goal of getting humans to reason better necessarily turn political at a certain point? After all, if there is one side of an issue that is decidedly better from some ethical perspective we have accepted, won't the rationalist have to advocate that side? Won't refraining from taking political action then be unethical? This line of reasoning might need a little bit of reinforcement to be properly convincing, but it's just to make the point that it seems to me that since political action is action, having a space cover rationality and ethics and not politics would be stifling a (very consequential) part of the discussion.

I'm not here very frequently, I just really like political theory and have seen around the site that you guys try to not discuss it too much. Not very common to find a good place to discuss it, as one would expect. But I'd love to find one!

[-]ryan_b3y60

Won't the goal of getting humans to reason better necessarily turn political at a certain point?

Trivially, yes. Among other things, we would like politicians to reason better, and for everyone to profit thereby.

I'm not here very frequently, I just really like political theory and have seen around the site that you guys try to not discuss it too much.

As it happens, this significantly predates the current political environment. Minimizing talking about politics, in the American political party horse-race sense, is one of our foundational taboos. It is not so strong anymore - once even a relevant keyword without appropriate caveats would pile on downvotes and excoriation in the comments - but for your historical interest the relevant essay is Politics Is The Mind-Killer. You can search that phrase, or similar ones like "mind-killed" or "arguments are soldiers" to get a sense of how it went. The basic idea was that while we are all new at this rationality business, we should try to avoid talking about things that are especially irrational.

Of course at the same time the website was big on atheism, which is an irony we eventually recognized and corrected. The anti-politics taboo softened enough to allow talking about theory, and mechanisms, and even non-flashpoint policy (see the AI regulation posts). We also added things like arguing about whether or not god exists to the taboo list. There was a bunch of other developments too, but that's the directional gist.

Happily for you and me both, political theory tackled well as theory finds a good reception here. As an example I submit A voting theory primer for rationalists and the follow-up posts by Jameson Quinn. All of these are on the subject of theories of voting, including discussing some real life examples of orgs and campaigns on the subject, and the whole thing is one of my favorite chunks of writing on the site.

[-]mruwnik3y20

It depends what you mean by political. If you mean something like "people should act on their convictions" then sure. But you don't have to actually go in to politics to do that, the assumption being that if everyone is sane, they will implement sane policies (with the obvious caveats of Moloch, Goodhart etc.).

If you mean something like "we should get together and actively work on methods to force (or at least strongly encourage) people to be better", then very much no. Or rather it gets complicated fast.

[-]bc4026bd4aaa5b7fe3y20

Thank you. I don't think it's possible to review this book without talking a bit about politics, given that so many of the techniques were forged and refined via political canvassing, but I also don't think that's the main takeaway, and I hope this introduced some good ideas to the community.

[-]simon3y*13-16

This post has a lot of great points.

But one thing that mars it for me to some extent is the discussion of OpenAI.

To me, the criticism of OpenAI feels like it's intended as a tribal signifier, like "hey look, I am of the tribe that is against OpenAI".

Now maybe that's unfair and you had no intention of anything like that and my vibe detection is off, but if I get that impression, I think it's reasonably likely that OpenAI decisionmakers would get the same impression, and I presume that's exactly what you don't want based on the rest of the post.

And even leaving aside practical considerations, I don't think OpenAI warrants being treated as the leading example of rationality failure.

First, I am not convinced that the alternative to OpenAI existing is the absence of a capabilities race. I think, in contrast, that a capabilities race was inevitable and that the fact that the leading AI lab has as decent a plan as it does is potentially a major win by the rationality community.

Also, while OpenAI's plans so far look inadequate, to me they look considerably more sane than MIRI's proposal to attempt a pivotal act with non-human-values-aligned AI. There's also potential for OpenAI's plans to be improved as more knowledge on mitigating AI risk is obtained, which is helped by their relatively serious attitude as compared to, for example, ~~Google after their recent reorganization.~~ Meta.

And while OpenAI is creating a race dynamic by getting ahead, IMO MIRI's pivotal act plan would be creating a far worse race dynamic if they were showing signs of being able to pull it off anytime soon.

I know many others don't disagree, but I think that there is enough of a case for OpenAI being less bad than potential alternatives to feel using it as if it were an uncontroversial bad thing detracts from the post.

[-]bc4026bd4aaa5b7fe3y61

I single it out because Yudkowsky singled it out and seems to see it as a major negative consequence to the goals he was trying to achieve with the community.

[-]zrezzed3y100

This isn't where the community is supposed to have ended up. If rationality is systematized winning, then the community has failed to be rational.

Great post, and timely, for me personally. I found myself having similar thoughts recently, and this was a large part of why I recently decided to start engaging with the community more (so apologies for coming on strong in my first comment, while likely lacking good norms).

Some questions I'm trying to answer, and this post certainly helps a bit:

Is there general consensus on the "goals" of the rationalist community? I feel like there implicitly is something like "learn and practice rationality as a human" and "debate and engage well to co-develop valuable ideas".
Would a goal more like "helping raise the overall sanity waterline" ultimately be a more useful, and successful "purpose" for this community? I potentially think so. Among other reasons, as bc4026bd4aaa5b7fe points out, there are a number of forces that trend this community towards being insular, and an explicit goal against that tendency would be useful.

[-]DirectedEvolution3y81

Just noting a point of confusion - if changing minds is a social endeavor having to do with personal connection, why is it necessary to get people to engage System 2/Central Route thinking? Isn’t the main thing to get them involved in a social group where the desired beliefs are normal and let System 1/Peripheral Route thinking continue to do its work?

[-]AnthonyC3y72

If I understand correctly I think it's more that system 1/peripheral route thinking can get someone to affectively endorse an idea without forming a deeper understanding of it, whereas system 2/central route thinking can produce deeper understanding, but many (most?) people need to feel sufficiently psychologically and socially safe/among friends to engage in that kind of thinking.

[-]Seth Herd3y10

I think you are absolutely correct that getting someone involved in a social group where everyone already has those ideas would be better at changing minds. But that's way harder than getting someone to have a ten-minute conversation. In fact, it's so hard that I don't think it's ever been studied experimentally. Hopefully I'm wrong and there are limited studies; but I've looked for them and not found them (~5 years ago).

I'd frame it this way: what you're doing in that interview is supplying the motivation to do System 2 thinking. The Socratic method is about asking people the same questions they'd ask themselves if they cared enough about that topic, and had the reasoning skills to reach the truth.

[-]Daniel Kokotajlo3y80

Great post, will buy the book and take a look!

I feel like I vaguely recall reading somewhere that some sort of california canvassing to promote gay rights experiment either didn't replicate or turned out to be outright fraud. Wish I could remember the details. It wasn't the experiment you are talking about though hopefully?

[-]Steven Byrnes3y*110

I just started the audiobook myself, and in the part I’m up to the author mentioned that there was a study of deep canvassing that was very bad and got retracted, but then later, there was a different group of scientists who studied deep canvassing, more on which later in the book. (I haven’t gotten to the “later in the book” yet.)

Wikipedia seems to support that story, saying that the first guy was just making up data (see more specifically "When contact changes minds" on wikipedia).

“If a fraudulent paper says the sky is blue, that doesn’t mean it’s green” :)

UPDATE: Yeah, my current impression is that the first study was just fabricated data. It wasn't that the data showed bad results so he massaged it, more like he never bothered to get data in the first place. The second study found impressive results (supposedly - I didn't scrutinize the methodology or anything) and I don't think the first study should cast doubt on the second study.

[-]bc4026bd4aaa5b7fe3y30

Thanks for the summary. Yes, David addresses this in the book. There was an unfortunately fraudulent paper published due to (IIRC) the actions of a grad student, but the professors involved retracted the original paper and later research reaffirmed the approach did work.

[-]Alan E Dunne3y51

https://statmodeling.stat.columbia.edu/2015/12/16/lacour-and-green-1-this-american-life-0/

and generally "beware the one of just one study"

[-]Seth Herd3y20

I read this. This is about the first study, which was retracted. However, a second, carefully monitored and reviewed study found most of the same results, including the remarkably high effect size of one in ten people appearing to completely drop their prejudice toward homosexuals after the ten-minute intervention.

Yes beware the one study. But in the absence of data, small amounts are worth a good deal, and careful reasoning from other evidence is worth even more.

My reasoning from indirect data and personal experience are in line with this one study. The 900 studies on how minds don't change are almost all about impersonal, data-and-argument based approaches.

Emotions affect how we make and change beliefs. You can't force someone to change their mind, but they can and do change their mind when they happen to think through an issue without being emotionally motivated to keep their current belief.

[-]bc4026bd4aaa5b7fe3y10

As mentioned above, David addresses this in the book. There was an unfortunately fraudulent paper published due to (IIRC) the actions of a grad student, but the professors involved retracted the original paper and later research reaffirmed the approach did work.

[-]PeterMcCluskey3y61

Many rationalists do follow something resembling the book's advice.

CFAR started out with too much emphasis on lecturing people, but quickly noticed that wasn't working, and pivoted to more emphasis on listening to people and making them feel comfortable. This is somewhat hard to see if you only know the rationalist movement via its online presence.

Eliezer is far from being the world's best listener, and that likely contributed to some failures in promoting rationality. But he did attract and encourage people who overcame his shortcomings for CFAR's in-person promotion of rationality.

I consider it pretty likely that CFAR's influence has caused OpenAI to act more reasonably than it otherwise would act, due to several OpenAI employees having attended CFAR workshops.

It seems premature to conclude that rationalists have failed, or that OpenAI's existence is bad.

Sorry, it doesn’t look like the conservatives have caught on to this kind of approach yet.

That's not consistent with my experiences interacting with conservatives. (If you're evaluating conservatives via broadcast online messages, I wouldn't expect you to see anything more than tribal signaling).

It may be uncommon for conservatives to use effective approaches at explicitly changing political beliefs. That's partly because politics are less central to conservative lives. You'd likely reach a more nuanced conclusion if you compare how Mormons persuade people to join their religion, which incidentally persuades people to become more conservative.

[-]mukashi3y20

Any source you would recommend to know more about the specific practices of Mormons you are referring to?

[-]PeterMcCluskey3y20

No. I found a claim of good results here. Beyond that I'm relying on vague impressions from very indirect sources, plus fictional evidence such as the movie Latter Days.

[-]bc4026bd4aaa5b7fe3y10

Fair enough, I haven't interacted with CFAR at all. And the "rationalists have failed" framing is admittedly partly bait to keep you reading, partly parroting/interpreting how Yudkowsky appears to see his efforts towards AI Safety, and partly me projecting my own AI anxieties out there.

The Overton window around AI has also been shifting so quickly that this article may already be kind of outdated. (Although I think the core message is still strong.)

Someone else in the comments pointed out the religious proselytization angle, and yeah, I hadn't thought about that, and apparently neither did David. That line was basically a throwaway joke lampshading how all the organizations discussed in the book are left-leaning, I don't endorse it very strongly.

[-]Thomas Kwa3y4-1

It's not just his fiction. Recently he went on what he thought was a low-stakes crypto podcast and was surprised that the hosts wanted to actually hear him out when he said we were all going to die soon:

I don't think we can take this as evidence that Yudkowsky or the average rationalist "underestimates more average people". In the Bankless podcast, Eliezer was not trying to do anything like trying to explore the beliefs of the podcast hosts, just explaining his views. And there have been attempts at outreach before. If Bankless was evidence towards "the world at large is interested in Eliezer's ideas and takes them seriously", The Alignment Problem and Human Compatible and rejection of FDT from academic decision theory journals is stronger evidence against. It seems to me that the lesson we should gather is that alignment's time in the public consciousness has come sometime in the last ~6 months.

I'm also not sure the techniques are asymmetric.

Have people with false beliefs tried e.g. Street Epistemology and found it to fail?
I think few of us in the alignment community are actually in a position to change our minds about whether alignment is worth working on. With a p(doom) of ~35% I think it's unlikely that arguments alone push me below the ~5% threshold where working on AI misuse, biosecurity, etc. become competitive with alignment. And there are people with p(doom) of >85%.

That said it seems likely that rationalists should be incredibly embarrassed for not realizing the potential asymmetric weapons in things like Street Epistemology. I'd make a Manifold market for it, but I can't think of a good operationalization.

[-]Thomas Kwa2y70

I think few of us in the alignment community are actually in a position to change our minds about whether alignment is worth working on. With a p(doom) of ~35% I think it's unlikely that arguments alone push me below the ~5% threshold where working on AI misuse, biosecurity, etc. become competitive with alignment. And there are people with p(doom) of >85%.

I have changed my mind and now think some of the core arguments for x-risk don't go through, so it's plausible that I go below 5% if there is continued success in alignment-related ML fields and could substantially change my mind from a single conversation.

[-]cata3y75

I think few of us in the alignment community are actually in a position to change our minds about whether alignment is worth working on. With a p(doom) of ~35% I think it's unlikely that arguments alone push me below the ~5% threshold where working on AI misuse, biosecurity, etc. become competitive with alignment. And there are people with p(doom) of >85%.

This makes little sense to me, since "what should I do" isn't a function of p(doom). It's a function of both p(doom) and your inclinations, opportunities, and comparative advantages. There should be many people for whom, rationally speaking, a difference between 35% and 34% should change their ideal behavior.

[-]Thomas Kwa3y20

Thanks, I agree. I would still make the weaker claim that more than half the people in alignment are very unlikely to change their career prioritization from Street Epistemology-style conversations, and that in general the person with more information / prior exposure to the arguments will be less likely to change their mind.

[-]romeostevensit3y30

How identification works, afaict: one of the ways alienation works is by directly invalidating people's experiences or encouraging preference falsification about how much they prefer being in the shape of a good, interchangeable worker, which is a form of indirectly invalidating your own experience, especially experiences of suffering. People feel that their own experience is not a valid source of sovereignty and so they are encouraged to invest their experience into larger, more socially legible and accepted constructs. This construct needs to be immutable and therefore more difficult to attack. But this creates a problem, the socially constructed categories are now under disputed ownership from many people each trying to define or control it in a way advantageous to themselves, and to avoid running into the same problem they had originally. Namely that if others control the category definition, then they will again feel like their experience is invalidated. So now you have activation of fighting/tribal circuitry in the people involved.

The next layer up is particularly sticky and difficult, something like "I am being attacked because I am X." Attempting to defuse this construct is perceived as an attempt to invalidate/erase their experience, rather than an attempt to show them how constructing things in this way both causes them suffering, and encourages them to spread this shape of suffering to others and fight hard for it, along with not providing them a good predictive model of the world that would allow them to update towards more useful actions and mental representations.

Attempting to defuse this structure directly is extraordinarily difficult, and I have mostly found success only with techniques that first encourage the person to exit the tight network of concepts involved and return to direct experience (the classic, you are racist against this group, but here is such a person in front of you, notice how your direct experience is that you can talk to them and get back reasonable human sounding things rather than the caricature you expect).

[-]romeostevensit3y40

A nice compression I hadn't thought of before is that moral categories and group categories are not type safe within the brain.

[-]testingthewaters1mo20

Great review and post, leaves me with a lot more hope for positive, non-coercive, and non-guilting/brow-beating change in beliefs. I read the book before reading your review and agree with your summary, and I would go so far as thanking you for raising/summarising points made in the book that I didn't get during my own read-through. At this point I have a pretty firm belief that (as they say in Inception) positive motivation is stronger than negative motivation, at least for the purposes of long-term, intentional activities like cultivating an open attitude to facts and reason in the self.

[-]ryan_b3y20

With AI chatbots behaving badly around the world

Welp, I guess it is time to take a look at how to make a good-faith actual-mind-changing chatbot now.

[-]ErisApprentice2y10

Old post, but wanted to comment because I just read this book and was thinking of writing a post on exactly the same topic! What are the chances? I wonder if I have anything to contribute - even though this article was written a year ago, I don't feel like there has been much of a shift in the rationality community overall.

[-]Review Bot2y*10

The LessWrong Review runs every year to select the posts that have most stood the test of time. This post is not yet eligible for review, but will be at the end of 2024. The top fifty or so posts are featured prominently on the site throughout the year.

Hopefully, the review is better than karma at judging enduring value. If we have accurate prediction markets on the review results, maybe we can have better incentives on LessWrong today. Will this post make the top fifty?

[-]Fleece Minutia2y10

Thanks for the review! It got me to buy and then devour the book. It was a great read; entertaining and providing a range of useful mental models. Applying McRaney's amalgam Street Epistemology is hard work, but makes for exceedingly interesting and very friendly conversations about touchy subjects - I am happy to have this tool in my life now.

[-]bc4026bd4aaa5b7fe3y10

The mods seem to have shortened my account name. For the record, it was previously bc4026bd4aaa5b7fe0bdcd47da7a22b453953f990d35286b9d315a619b23667a

[-]Collapse Kitty3y10

I am just the kind of person you describe. I've committed nearly all spare time over the last few years to understanding the core issues of alignment, and the deeper flaws of Moloch/collective action leading to the polycrisis.

What does the next step look like? Social media participation? I don't think that's a viable battleground.

There are dozens of AI experts now giving voice to existential dangers. What angle or strategy is not yet being employed by them that we might back?

What are the best causes to throw one's efforts behind right now? Are there trustworthy companies/non-profits that need volunteers? What is our call to action for those we do inform about the catastrophic risks humanity is facing?

Sorry about the rough formatting and rambling train of thought. I'm eager to commit more of my life to working toward whatever strategy seems most viable, but feel quite lost at the moment.

[-]bc4026bd4aaa5b7fe3y10

I don't have an answer for you, you'll have to chart your own path. I will say that I agree with your take on social media, it seems very peripheral-route-focused.

If you're looking to do something practical on AI consider looking into a career counseling organization like 80000 Hours. From what I've seen, they fall into some of the traps I mentioned here (seems like they mostly think that trying to change people's minds isn't very valuable unless you actively want to become a political lobbyist) but they're not bad overall for answering these kinds of questions.

Ultimately though, it's your own life and you alone have to decide your path through it.

[-]Raphael Royo-Reece3y10

I cannot agree with this post enough. One of the things that stops me despairing and keeps me trying to convince people of AI is that I think Eliezer is overly pessimistic when it comes to people taking threats seriously.

He has a lot of reasons to be - I mean he's been harping for 20 years and only now does there seem to be traction - but that is where I think it's likiest there is a way for us to avoid doom.

[-]Jeffs3y*10

I would love to see a video or transcript of this technique in action in a 1:1 conversation about ai x-risk.

Answer to my own question: https://www.youtube.com/watch?v=0VBowPUluPc

[-]Rheagan Zambelli3y-1-1

If there were no demons, perhaps there would be no need for the rituals found in religion to dispel them….

[-]CronoDAS3y20

What do you mean by demons? Literal supernatural creatures, a metaphor for psychological anxieties, or something else entirely?

[-]toothpaste3y-10

You claim that the point of the rationalist community was to stop an unfriendly AGI. One thing that confuses me is exactly how it intends to do so, because that certainly wasn't my impression of it. I can see the current strategy making sense if the goal is to develop some sort of Canon for AI Ethics that researchers and professionals in the field get exposed to, thus influencing their views and decreasing the probability of catastrophe. But is it really so?

If the goal is to do it by shifting public opinion in this particular issue, by making a majority of people rationalists, or by making political change and regulations, it isn't immediately obvious to me. And I would bet against it because institutions that for a long time have been following those strategies with success, from marketing firms to political parties to lobbying firms to scientology, seem to operate very differently, as this post also implies.

If the goal is to do so by converting most people to rationalism (in a strict sense), I'd say I very much disagree with that being likely or maybe even a desirable effort. I'd love to discuss this subject here in more detail and have my ideas go through the grinder, but I've found this place to be very hard to penetrate, so I'm rarely here.

[-]mruwnik3y52

The answer is to read the sequences (I'm not being facetious). They were written with the explicit goal of producing people with EY's rationality skills in order for them to go into producing Friendly AI (as it was called then). It provides a basis for people to realize why most approaches will by default lead to doom.

At the same time, it seems like a generally good thing for people to be as rational as possible, in order to avoid the myriad cognitive biases and problems that plague humanities thinking, and therefore actions. My impression is that the hope was to make the world more similar to Dath Ilan.

[-]toothpaste3y10

So the idea is that if you get as many people in the AI business/research and as possible to read the sequences, then that will change their ideas in a way that will make them work in AI in a safer way, and that will avoid doom?

I'm just trying to understand how exactly the mechanism that will lead to the desired change is supposed to work.

If that is the case, I would say the critique made by OP is really on point. I don't believe the current approach is convincing many people to read the sequences, and I also think reading the sequences won't necessarily make people change their actions when business/economic/social incentives work otherwise. The latter being unavoidably a regulatory problem, and the former a communications strategy problem.

Or are you telling me to read the sequences? I intend to sometime, I just have a bunch of stuff to read already and I'm not exactly good at reading a lot consistently. I don't deny having good material on the subject is not essential either.

[-]mruwnik3y10

More that you get as many people in general to read the sequences, which will change their thinking so they make fewer mistakes, which in turn will make more people aware both of the real risks underlying superintelligence, but also of the plausibility and utility of AI. I wasn't around then, so this is just my interpretation of what I read post-facto, but I get the impression that people were a lot less doomish then. There was a hope that alignment was totally solvable.

The focus didn't seem to be on getting people into alignment, as much as it generally being better for people to think better. AI isn't pushed as something everyone should do - rather as what EY knows - but something worth investigating. There are various places where it's said that everyone could use more rationality, that it's an instrumental goal like earning more money. There's an idea of creating Rationality Dojos, as places to learn rationality like people learn martial arts. I believe that's the source of CFAR.

It's not that the one and only goal of the rationalist community was to stop an unfriendly AGI. It's just that is the obvious result of it. It's a matter of taking the idea seriously, then shutting up and multiplying - assuming that AI risk is a real issue, it's pretty obvious that it's the most pressing problem facing humanity, which means that if you can actually help, you should step up.

Business/economic/social incentives can work, no doubt about that. The issue is that they only work as long as they're applied. Actually caring about an issue (as in really care, like oppressed christian level, not performance cultural christian level) is a lot more lasting, in that if the incentives disappear, they'll keep on doing what you want. Convincing is a lot harder, though, which I'm guessing is your point? I agree that convincing is less effective numerically speaking, but it seems a lot more good (in a moral sense), which also seems important. Though this is admittedly a lot more of an aesthetics thing...

I most certainly recommend reading the sequences, but by no means meant to imply that you must. Just that stopping an unfriendly AGI (or rather the desirability of creating an friendly AI) permeates the sequences. I don't recall if it's stated explicitly, but it's obvious that they're pushing you in that direction. I believe Scott Alexander described the sequences as being totally mind blowing the first time he read them, but totally obvious on rereading them - I don't know which would be your reaction. You can try the highlights rather than the whole thing, which should be a lot quicker.

[+]Don Salmon3y-50

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

327

Book Review: How Minds Change

327

327