I Was Not Almost Wrong But I Was Almost Right: Close-Call Counterfactuals and Bias

[-]fubarobfusco14y380

This reminds me of the notion of a premortem — an exercise for identifying weaknesses in a plan by asking you to imagine that you have implemented your plan and that it has failed. Why did it fail? By envisioning your future self conducting a postmortem on the failed plan, you might be able to identify weaknesses without going to all the expense of implementing it and failing.

[-]Kaj_Sotala14y250

So, feedback requested on the Dr. Zany thing. Made an otherwise dry post more interesting to read, or pointless and distracting?

[-]kilobug14y250

I liked it, it's always good to have an example, it makes reading more pleasant, and it helps to update with the info (not to understand what you are saying, but to propagate the new knowledge into your belief network). Or at least it does to me.

[-]atucker14y40

I entirely agree with this, but am writing a comment in addition to an upvote of the above just to make my appreciation towards Kaj for the Dr. Zany thing more salient to him.

[-]Kaj_Sotala14y10

It worked. Thanks. :-)

[-]Spurlock14y150

Liked it, but thought there was a bit too much of it (e.g. the blue-minimizing robot reference). Might be better to leave out details that don't help you illustrate your point, lest the reader get a sense that your example isn't going anywhere.

[-]Eliezer Yudkowsky14y110

Check. It was a good idea, but could've and should've been shortened. I skimmed it, and my guess is that it could've been set up in one or two paragraphs if only the minimum of required detail had been included.

[-]AspiringKnitter14y40

I disagree that there were too many extraneous details about Dr. Zany in this post. They didn't detract from the value of the post and, at least, the blue-minimizing robot reference was funny.

[-]woodchuck6414y140

Challenge the mutability of the antecedent. Since AS-01's counterfactual is of the form ”if A, then B”, Dr. Zany could question the plausibility of A.

brain balks at "mutability", stumbles over "antecedent", sprains ankle on "counterfactual"

”Baloney!” exclaims Dr. Zany. ”No TV reporter could ever have wandered past, let alone seen the robbery!”

Oh, I get it! Brain jumps up and down with glee.

I found it helpful and entertaining.

[-]MaoShan14y130

Well, it did make me more likely to accept your theory. After all...

"More fine-grained scenarios offer an opportunity to tell more detailed stories, and humans give disproportionate weight to detailed stories."

[-]Armok_GoB14y90

Would not have gotten through the post without it.

[-]Troshen14y80

Made the post more interesting to read.

[-]taw14y60

It was good, but mostly because it provided some nice examples.

[-]Normal_Anomaly14y60

I found it awkward and weird, especially the bits with the assistant. But it looks like you and some readers had fun, so I don't mind if you keep doing it.

[-]Ezekiel14y60

Made it more interesting, to me at least. I probably wouldn't have had the focus to get through the article otherwise.

[-]Solvent14y50

It was cute, particularly the conclusion.

[-]CarlShulman14y40

Annoying, at least for me.

[-]NancyLebovitz14y40

Somewhat more interesting. I'm not sure about the brain in a vat threat-- I'm pulled between "that's really creepy if it's read literally-- is she trapped there?" and the tone of "this is lightweight humor, you're supposed to read it with most of your empathy turned off".

[-]AspiringKnitter14y90

"this is lightweight humor, you're supposed to read it with most of your empathy turned off"

Ever since reading The Sword of Good, I've lost the ability to do that. Not that I was ever great at it. I wonder if that's happened to anyone else. /irrelevant tangent

[-]Will_Newsome14y40

Spoiler alert: People who've seen the movie Silent Hill might enjoy this comment. Vg jnf jrveq ubj n fznyy er-nffrffzrag bs gur cerzvfrf bs gur svyz znqr zr tb sebz "lrnuuuuu tb Fngna, xvyy nyy gubfr ovtbgrq Puevfgvna fgnaq-vaf!" gb "bu zl Tbq V jnf whfg purrevat nf gur qrivy znffnperq n ohapu bs cngurgvp fpnerq puhepu crbcyr va fbzr tbqsbefnxra yvzob jbeyq, gung'f nobhg nf Rivy nf vg trgf, jul nz V fb sevttva' vzcerffvbanoyr".

[-]Oligopsony14y60

the tone of "this is lightweight humor, you're supposed to read it with most of your empathy turned off".

This is hardly an original thought, but I wonder how much work this does in ethical thought experiments.

[-]David_Gerard14y30

Worked. Good character. Do please use him again.

[-]JoachimSchipper14y30

Slightly distracting, but worth it.

(On the other hand, the female assistant set off some gender-stereotypes-icky warning bells in me. Despite your obvious attempts at avoiding this. I'm probably just projecting some unfavourable impressions of the source material on your adaptation, but you may still want to be aware of this possibility.)

[-]Kaj_Sotala14y90

Oddly, I made the assistant female partially because having a mad scientist with a male assistant (Igor fetch brains, master...) felt too stereotypical.

I also considered making Dr. Zany himself female, but there the character felt so strongly male that my brain just wouldn't go along with it.

[-]JoshuaZ14y40

Strongly agreed. That aspect also seemed bad because the assistant being labeled like a robot while funny sounded almost like some form of symbolic objectification. And the fact that her main talent she's valued for is the ability to make sandwiches rather than say help tweak the ray guns or Tesla coils strongly didn't help matters.

[-]Kaj_Sotala14y60

But note that her being valued mostly for the sandwiches says more about Dr. Zany's attitude than about how things really are, and she's strongly implied to be the more competent of the two...

[-]GreenRoot14y20

Stories are a huge way we make sense of the world. Adding a narrative sequence to the post did helped me keep track of the ideas and how they fit together.

[-]duwease14y20

Worked great for me. I like to browse the articles during coffee breaks, and anything that helps me to easily grab on to an idea through example in "reality" rather than slow down and parse out the abstract concepts in my head makes the read go altogether easier :)

[-][anonymous]14y20

I like stories illustrating facts but I think that their usefulness is inversely proportional to the technical complexity (and maybe inferential distance) of the writing. So here it wasn't a problem but it probably wouldn't make much difference if you skipped it.

[-]Desrtopa14y20

Definitely felt it made the article more attention grabbing and easier to follow.

[-]Alexander Gietelink Oldenziel5y10

Datapoint: I skim-read the article today. I am interested in the overal thesis [need for closureness, counterfactual modelling etc]. I skipped the Dr. Zany story.

[-]Kaj_Sotala14y90

Tetlock (1998) also provided me with the two funniest-sounding sentences that I've read in a while (though that doesn't make them incorrect). Commenting on the "concede the counterfactual, but insist that it does not matter for the overall theory" defense:

This defense, which is the most popular of the three, is designated a second-order counterfactual inasmuch as it undoes the undoing of the original close-call counterfactual. Second-order counterfactuals allow for deviations from reality but minimize the significance of the deviations by invoking additional causal forces that soon bring events in the simulated counterfactual world back toward the observed historical path.

[-]Ben_Welchner14y60

He also notes that the experts who'd made failed predictions and employed strong defenses tended to update their confidence, while the experts who'd made failed predictions but didn't employ strong defenses did update.

I assume there's a 'not' missing in one of those.

[-]Kaj_Sotala14y30

Fixed, thanks.

[-]Jonathan_Graehl14y50

Good abstract. Feels obvious, But then, there are some nice details that didn't come out in the summary, like "imagine different outcomes" being risky due to story-thinking. It's worth reading the whole thing for the excellent speculations on how and why.

[-][anonymous]14y30

Feels obvious until it gets to using counterfactuals for possible debiasing and the dangers in the technique — this was quite interesting for me.

Also interesting are the "five logically defensible strategies".

Quibble:

If two people draw different conclusions from the same information, then at least one of them is wrong.

But different conclusions can be compatible?

[-]Tasky14y20

You used the terms "high-need-for-closure" and "low-need-for-closure" quite a lot in you essay. Would you mind explaining what they mean and/or linking to somewhere I can look up the definition, since I am not familiar with them?

Could you maybe also explain what those tests are and how they work (the ones to measure need for closure)?

[-]Kaj_Sotala14y20

I quoted this excerpt from Tetlock (1998) in the post, did you not find it helpful?

Theoretically, high need-for-closure individuals are characterized by two tendencies: urgency which inclines them to 'seize' quickly on readily available explanations and to dismiss alternatives and permanence which inclines them to 'freeze' on these explanations and persist with them even in the face of formidable counterevidence. In the current context, high need-for-closure individuals were hypothesized to prefer simple explanations that portray the past as inevitable, to defend these explanations tenaciously when confronted by dissonant close-call counterfactuals that imply events could have unfolded otherwise, to express confidence in conditional forecasts that extend these explanations into the future, and to defend disconfirmed forecasts from refutation by invoking second-order counterfactuals that imply that the predicted events almost happened.

The papers I referenced (see the end of the post for links) briefly discuss how this was measured. For instance, Tetlock 1998:

The Need for Closure Scale was adapted from a longer scale developed by Kruglanski and Webster (1996) and included the following eight items: "I think that having clear rules and order at work is essential for success"; "Even after I have made up my mind about something, I am always eager to consider a different opinion' l; " I dislike questions that can be answered in many different ways''; "I usually make important decisions quickly and confidently"; "When considering most conflict situations, I can usually see how both sides could be right"; "It is annoying to listen to someone who cannot seem to make up his or her mind"; "I prefer interacting with people whose opinions are very different from my own"; and "When trying to solve a problem I often see so many possible options that it is confusing."1 Experts rated their agreement with each item on 9-point disagree-agree scales.

Incidentally, Tetlock 1998 also used another measure that's theoretically different from need-for-closure, namely integrative complexity.

Integrative complexity should be negatively correlated with need for closure. It implies not only a willingness to entertain contradictory ideas but also an interest in generating, testing, and revising integrative cognitions that specify flexible boundary conditions for contradictory hypotheses. The two constructs— need for closure and integrative complexity—are, however, measured in very different ways: a traditional selfreport personality scale in the case of need for closure and an open-ended thought-sampling procedure requiring content analysis in the case of integrative complexity. Given the severe problems of method variance that have bedeviled cognitivestyle research over the past 50 years (Streufert, 1997), a major advantage of the present study is the inclusion of methodologically dissimilar but conceptually overlapping procedures for assessing cognitive style. [...]

The integrative complexity measure was derived from open-ended responses to a request to reflect on 20th-century history. The following question was used: "Did the 20th century have to be as violent as it has been?" We assured respondents that we understood that many books had been written on this subject and that many more undoubtedly would be written. Our goal was just to get a quick sense for the factors that they deemed most decisive in shaping the general course of events (the sort of shorthand answer they might give a respected colleague in a different discipline at a social occasion). Integrative complexity was coded on a 7-point scale in which scores of 1 were given to statements that identified only causal forces that increased or decreased the likelihood of the specified outcomes (e.g., "Nationalism and mass production of weapons guaranteed disaster"), scores of 3 were assigned to statements that identified causal forces with contradictory effects (e.g., ' "Iwentieth-century history will be remembered not only for the destructive forces unleashed—totalitarianism and weapons of mass destruction— but also for the initial steps toward global governance"), scores of 5 were assigned to statements that tried to integrate two contradictory causal forces (e.g., "Wars can be caused by being too tough or too soft and it is really hard to strike the right balance—that's the big lesson of 20th-century diplomacy"), and scores of 7 placed the problem of integrating causal forces into a broader systemic frame of reference (e.g., "YJU could argue that we got off lucky and escaped nuclear war or that we were incredibly unlucky and wound up with a holocaust that was the product of one man's obsession. How you look at it is a matter of personal temperament and philosophy. My guess is that we are running about par for the course"). Intercoder agreement was .85 between two raters who were blind to both the hypotheses being tested and to the sources of the material.

Tetlock 1998 found the results between the two measures of need-for-closure and integrative complexity to be highly similar, in that individuals with a high need-for-closure scored low on integrative complexity, and vice versa. He combined the results of the two measures to a single variable in analyzing the results of that study. IIRC, in later studies only need-for-closure was used.

[-]Armok_GoB14y10

Fleshed out and increased the weight of heuristics to counter this kind of thing in response to this.

[-]Rhwawn14y00

Upvoted; just enough math.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

86

I Was Not Almost Wrong But I Was Almost Right: Close-Call Counterfactuals and Bias

86

86