The Error of Crowds

[-]Doug_S.219y20

If this were anything like my high school math class, everyone else in the class would decide to copy my answer. In some cases, I have darn good reasons to believe I am significantly better than the average of the group I find myself in. For example, I give one of my freshman chemistry midterms. The test was multiple choice, with five possible answers for each question. My score was an 85 out of 100, among the highest in the class. The average was something like 42. On the final exam in that class, I had such confidence in my own answer that I declared that, for one of the questions, the correct answer was not among the responses offered - and I was right; one of the values in the problem was not what the professor intended it to be. I was also the only one in the class who had enough confidence to raise an objection to the question.

On the other hand, there are situations in which I would reasonably expect my estimate to be worse than average. If I wandered into the wrong classroom and had no idea what the professor was talking about, I'd definitely defer to the other students. If you ask me to predict the final score of a game between two well-known sports teams, I probably wouldn't have heard of either of them and just choose something at random. (The average American can name the two teams playing in the Super Bowl when it occurs. I rarely can, and I don't know whether to be proud or ashamed of this.) I also suspect that I routinely overestimate my chances of winning any given game of Magic. ;)

I'm not a random member of any group; I'm me, and I have a reasonable (if probably biased, given the current state of knowledge in psychology) grasp of my own relative standing within many groups.

Also, when you're told that there is a hidden gotcha, sometimes you can find it if you start looking; this is also new information. Of course, you can often can pick apart any given hypothetical situation used to illustrate a point, but I don't know if that matters.

[-]HalFinney19y00

This is a good point. I think squared errors are often used because they are always positive and also analytic - you can take derivatives and get smooth functions. But for many problems they are not especially appropriate.

Informally problems are often posed with an absolute-value error function. Like the square root, this has a cusp at zero and so will "hold water". If some people miss too high and others miss too low, then in this case it also makes sense to switch to the average. If everyone misses on the same side, then it doesn't help but doesn't hurt to switch to the average. So in general it is a good strategy.

I mentioned the other day one example of the good performance of the average in "guessing beans in a jar" type problems. In this case the average came out 3rd best compared to guesses from a class of 73 students. This implicitly uses an absolute-value error function and the problem was such that people missed on both sides. Jensen's Inequality shows why averages work well in such problems.

[-]NoSignalNoNoise13y20

Informally problems are often posed with an absolute-value error function. Like the square root, this has a cusp at zero

abs(x) has a corner, not a cusp at zero. For a cusp, the derivative approaches +infinity from one side and -infinity from the other; for a corner, it is undefined, but approaches a finite value from at least one of the sides.

[-]Robin_Hanson219y20

Eliezer, given opinions on some variable X, majoritarianism is not committed to the view that your optimal choice facing any cost function is E[X]. The claim should instead be that the best choice is some average appropriate to the problem. Since you haven't analyzed what is the optimal choice in the situation you offer, how can we tell that majoritarianism in fact gives the wrong answer here?

[-]Eliezer Yudkowsky19y60

Hal, the surprising part of the beans-in-a-jar problem is that the guessers must collectively act as an unbiased estimator - their errors must nearly all cancel out, so that variance accounts for nearly all of the error, and systematic bias for none of it. Jensen's Inequality does not account for this surprising fact, it only takes advantage of it.

Robin, I don't claim to frame any general rule for compromising, except for the immodest first-order solution that I actually use: treat other people's verbal behavior as Bayesian evidence whose meaning is determined by your causal model of how their minds work - even if this means disagreeing with the majority. In the situation I framed, I'd listen to the other math students talking, offer my own suggestions, and see if we could find the hidden gotcha. If there is a principle higher than this, I have not seen it.

[-]Eliezer Yudkowsky19y10

Robin, on second thought, there's a better answer to your question. Namely, the reason that BVD (bias variance decomposition) is offered as support for majoritarianism is the assumption that adopting the average of the group estimates is in fact what majoritarianism advises; otherwise BVD would contradict majoritarianism by suggesting a superior alternative, namely, adopting the average of group estimates, instead of whatever it is majoritarianism does advise. And if majoritarianism does advise adopting the group average, then I can offer a superior alternative to it in the scenario given; namely, use an estimate from a randomly selected student. And if majoritarianism is said, after the fact, to give whatever advice we painstakingly deduced to be best - so that someone suggests that majoritarianism doesn't command averaging the estimates in this case, only after we worked out from nonmajoritarian reasons that averaging was a bad idea - then I'd like to know what the use is of a philosophy whose recommendations no one can figure out in advance. And also, what happened to the idea that the average opinion was likely to be true, not just useful?

[-]Robin_Hanson219y00

Eliezer, my best reading of majoritarianism is that it advises averaging the most recent individual probability distributions, and then having each person use expected utility with that combined distribution to make his choice.

In your example, you have students pick "estimates," average them, give them new info and a new cost function, and then complain that the average of the old estimates, ignoring the new info, does not optimize the new cost function.

[-]Robin_Hanson219y20

One would have a severe framing problem if one adopted a rule that one's estimate of X should be the average across people of their estimates, E[X]. This is because a translation of variables to F(X) might be just as natural a way to describe one's estimates, but as Eliezer points out, E[F(X)] usually differs from F(E[X]). So I think it makes more sense to average probabilities, rather than point estimates.

[-]Eliezer Yudkowsky19y00

Robin, that's a fair reply for saving majoritarianism. But it doesn't save the argument from bias-variance decomposition, except in the special case where the loss function is equal to the squared difference for environmental or moral reasons - that is, we are genuinely using point scalar estimates and squared error for some reason or other. The natural loss function for probabilities is the log score, to which the bias-variance decomposition does not apply, although Jensen's Inequality does. (As I acknowledged in my earlier post on The Modesty Argument.)

This leaves us with the core question as "Can you legitimately believe yourself to be above-average?" or "Is keeping your own opinion like being a randomly selected agent?" which I think was always the key issue to begin with.

[-]Paul Crowley16y20

The graph for this post has vanished.

[-]Nevin15y30

A similar graph is here.

[-][anonymous]14y00

I think one could still have the BVD and still believe it did involve some type of modesty. The idea is just that when I weight the input of the other agents, I'm not just looking at the raw number that they output as an estimate, but also models for how they arrived at that estimate, etc. Under some independence assumptions, you would re-write BVD in an expanded form that involved multiplying many probabilities for each agent. Thus, if a creationist advised you to down-weight the theory of natural selection, you wouldn't just consider that alone when re-forming your beliefs. You'd also consider the likelihood that that agent suffers from some biases, or has motivated skepticism, etc. And this whole string of probabilities would lead you to update your belief in the ten trillionth decimal place or something; something well below the machine epsilon of a human mind. But in cases where the other agents can't be modeled as deficient reasoners, you would give more credence to their differing estimates and update accordingly. The modesty argument, to me, represents credence for trustworthiness of other agents. In cases where that trustworthiness is probably low, not much updating happens. When it is high, more updating happens.

[-]LawrenceC10y20

Results from the Good Judgment Project suggest that putting people into teams lets them significantly outperform (have lower Brier's scores than) predictions from both (unweighted) averaging of probabilities and the (admittedly also unweighted) averaging of probability estimates from the better portion of predictors. This seems to offer weak evidence that what goes on in a group is not simple averaging.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

32

The Error of Crowds

32

32