Intransitive Trust

[-]papetoast2y*100

EigenKarma is a (basically abandoned) attempt to apply a semi-transitive trust algorithm in real situations. EigenTrust (the original paper, which is the basis of EigenKarma) had some brief discussions and computational experiments on how adversaries affect trust.

I had been wanting to spend some time working on improving EigenKarma for the last year but haven't got around to do it.

[-]Said Achmiz2y74

He decides the sensitivity of the test—that aliens actually abduct people, given he experienced aliens abducting him—is 5% since he knows he doesn’t have any history of drug use, mental illness, or prankish friends with a lot of spare time and weird senses of humour. (That’s P(B|A).)

Er, what? P(“aliens actually abduct people, given [Bob] experienced aliens abducting him”) is P(A|B), not P(B|A). This is what you’re trying to find out, so it can’t be one of the inputs.

P(B|A), which is indeed the input you want, would be P(“Bob experiences aliens abducting him, given that aliens actually abduct people”). However, this has nothing whatsoever to do with whether Bob has “any history of drug use, mental illness, or prankish friends with a lot of spare time and weird senses of humour”. (It would presumably depend on things like “what is the selection procedure that the aliens use to choose abduction victims”, plus various figures like “how many people are there”, “what is the probability that Bob would be in a certain location at a certain time”, etc., etc.) And I can’t see where you might get a figure of 5% for this.

So, as far as I can tell, the math you provide doesn’t work.

[-]Said Achmiz2y63

This part is obviously more speculative, but let’s suppose that Bob reasons thus: given that aliens abduct people, there is perhaps an equal chance that they would abduct any human as any other. (Is this a reasonable assumption? Who knows?) And let’s say that they abduct 100 people per year. So the chance of having been abducted at least once in (let’s say) 40 years of life would be 1 − (7,999,999,900 / 8,000,000,000)^40 = ~0.0000005.

Ah, but there’s a catch! The chance of having an abduction experience if there are aliens isn’t just the chance of being abducted, it’s the chance of being abducted plus the chance of falsely coming to believe you’ve been abducted when in fact you have not. (As surely we do not think that the existence of aliens would prevent humans from having schizophrenic episodes or LSD trips etc.?) Thus we must add, to P(B|A), that 0.001 chance of having a false abduction experience, for a total of 0.0010005. (If we didn’t account for this, we’d end up concluding that Bob’s experience should lead him to revise P(A) drastically down!)

So, the revised calculation:

P(A|B) = (0.0010005 * 0.01) / ((0.0010005 * 0.01) + (0.001 * 0.99)) = 0.000010005 / (0.000010005 + 0.00099) = 0.000010005 / 0.001000005 = ~0.01000495 = 1.000495%.

1.000495%. Not 33%. (The prior, we must recall, was 1%.) In other words, this is an update so tiny as to be insignificant.

Of course we can tweak that by modifying our assumptions about the aliens’ behavior—how often do they abduct people, how do they select abductees—but you’d have to start with some truly implausible assumptions to get the numbers anywhere near a large update.

Bob should notice that it is overwhelmingly more likely that his experience was false than that it was real, and it should have essentially no effect whatsoever on his estimate of the probability of the existence of aliens who sometimes abduct people.

[-]Screwtape2y20

Thank you for checking my math and setup! This is my first time trying Bayes in front of an audience.

Yeah, I think I described P(A|B) when trying to describe the sensitivity, you are right that whether aliens actually abduct people given Bob experienced aliens abducting him is P(A|B). It's possible I need to retract the whole section and example.

Your description of P(B|A) confuses me though. If I think through the standard Bayes mammogram problem, I don't set P(B|A) as P("A specific woman gets a positive test result, given some people get a positive test") and have to figure out what the selection procedure is that the doctor uses to choose people to test. We're looking for P("A specific woman gets a positive test result, given she actually has cancer.") I think Bob gets to start knowing he experienced getting abducted, the same way the woman in the mammogram problem gets to start knowing she got a positive test. He then tries to figure out whether the abduction was aliens or some kind of hallucination, the same way the woman (or her doctor) in the mammogram problem tries to figure out whether the test result is a true positive or a false positive.

Hrm. So, in the mammogram problem, if the sometimes the machine malfunctions in a way that gives a positive result whether or not the woman actually had cancer, then some of the time the woman will coincidentally happen to have cancer when the machine malfunctioned. I think that's just supposed to be counted as part of the probability the woman with cancer gets a positive test, i.e. the sensitivity? Translating back to Bob's circumstances, aliens are real, but Bob hallucinated?

Intuitively it makes sense to me that if someone thinks they got abducted by aliens, it's more likely they're hallucinating than that they actually got abducted by aliens. It's true that aliens actually abducting people wouldn't mean people stop having hallucinations. But adding P(B|¬A) - the rate of false positives - to P(B|A) - the rate of true positives - seems like some kind of weird double counting. What am I misunderstanding here?

[-]Said Achmiz2y20

Intuitively it makes sense to me that if someone thinks they got abducted by aliens, it’s more likely they’re hallucinating than that they actually got abducted by aliens. It’s true that aliens actually abducting people wouldn’t mean people stop having hallucinations. But adding P(B|¬A) - the rate of false positives—to P(B|A) - the rate of true positives—seems like some kind of weird double counting. What am I misunderstanding here?

Well, to start with, if you don’t include the false positive rate in P(B|A), and work through the numbers, then, as I said, you’ll find that having the abduction experience will drastically lower your probability estimate of aliens. You would have:

P(A|B) = (0.000005 · 0.01) / ((0.000005 · 0.01) + (0.001 · 0.99)) = 0.00000005 / (0.00000005 + 0.00099) = 0.00000005 / 0.00099005 = ~0.0000505025 = 0.00505025%.

So that’s clearly very wrong—even if it doesn’t tell you what the right answer should be.

But, intuitively… well, I explained it already: if aliens exist and abduct people, some people will still take drugs or go crazy or whatever, and hallucinate being abducted. Bob could be one of those people. It’s not double-counting because the A is “aliens exist and abduct people”, not “Bob was abducted by aliens”. (Otherwise P(A) could not possibly have started as high as 0.01—that would’ve been wrong by many orders of magnitude as a prior!) (This is essentially @clone of saturn’s explanation, so see his sibling comment for more on this point.)

[-]Said Achmiz2y*20

Yeah, I think I described P(A|B) when trying to describe the sensitivity, you are right that whether aliens actually abduct people given Bob experienced aliens abducting him is P(A|B). It’s possible I need to retract the whole section and example.

I agree. But I don’t think that you should discard the text entirely, because it seems to me that there is actually a lesson here.

I have had this experience many times: someone (sometimes on this very website) will say something like, “I know for a fact that X; my experience proves it to me beyond any doubt; I accept that my account of it won’t convince you of X, but I at least am certain of it”.

And what I often think in such cases (but perhaps too rarely say) is:

“But you shouldn’t be certain of it. It’s not just that I don’t believe X, merely based on your experience. It’s that you shouldn’t believe X, merely based on your experience. You, yourself, have not seen nearly enough evidence to convince you of X—if you were being a proper Bayesian about it. Not just my, but your conclusion, should be that, actually, X is probably false. Your experience is insufficient to convince me, but it should not have convinced you, either!”

(EDIT: For example.)

(This is related to something that Robyn Dawes talks about in Rational Choice in an Uncertain World, when he says that people are often too eager to learn from experience.)

This is also related to what E. T. Jaynes calls “resurrection of dead hypotheses”. If you have an alien abduction experience, then this should indeed raise your probability estimate of aliens existing and abducting people. But it should also raise your probability estimate of you being crazy and having hallucinations (to take one example). And since the latter was much more probable than the former to begin with, and the evidence was compatible with both possibilities, observing the evidence cannot result in our coming to believe the former rather than the latter. As Jaynes says (in reference to his example of whether evidence of psychic powers should make one believe in psychic powers):

…Indeed, the very evidence which the ESPers throw at us to convince us, has the opposite effect on our state of belief; issuing reports of sensational data defeats its own purpose. For if the prior probability of deception is greater than that of ESP, then the more improbable the alleged data are on the null hypothesis of no deception and no ESP, the more strongly we are led to believe, not in ESP, but in deception. For this reason, the advocates of ESP (or any other marvel) will never succeed in persuading scientists that their phenomenon is real, until they learn how to eliminate the possibility of deception in the mind of the reader.

[-]clone of saturn2y20

The mammogram problem is different because you're only trying to determine whether a specific woman has cancer, not whether cancer exists at all as a phenomenon. If Bob was abducted by aliens, it implies that alien abduction is real, but the converse isn't true. You either need to do two separate Bayesian updates (what's the probability that Bob was abducted given his experience, and then what's the probability of aliens given the new probability that Bob was abducted), or you need a joint distribution covering all possibilities (Bob not abducted, aliens not real; Bob not abducted, aliens real; Bob abducted, aliens real).

[-]Screwtape2y20

Hrm. Maybe the slip is accidentally switching whether I'm looking for "do aliens abduct people, given Bob experienced being abducted" vs "was Bob's abduction real, given Bob experienced being abducted."

But if Bob's abduction was real, then aliens do abduct people. It would still count even if his was the only actual abduction in the history of the human race. Seems like this isn't the source of the math not working?

[-]Seth Herd2y40

Semi-transitive trust is also how scientists update their beliefs, and therefore how we form scientific conclusions.

So the dilemma about how to update in your examples also govern our understanding of cutting edge scientific issues. I'm pretty sure of that after spending a couple of decades as a research scientist. I suspect this is also true of our understandings of politics, business, and everything else that makes the modern world turn. Including our understanding of the interlocking questiosn underlying the alignment problem.

In the case of scientific inquiry, the other scientists have tried to state more of why they've formed their beliefs; they cite statistics, methods, and other studies. But you can't understand all of their methods; they're never literally all stated, and you know you're misunderstanding even some of the ones that are stated. And you can't possibly read and fully understand all of the citations they give, unless this is exactly your research are, and you're addressing exactly the same question. Even then, you probably don't have time to read every cited study in detail, let alone ask the authors in person to clarify.

Add to these problems the blurry line between lying and biased reporting.

I think understanding this, as well as its place in our epistemics (as addressed in the excellent Truthseeking is the ground in which other principles grow is critical for making progress on complex questionsn.

[-]Said Achmiz2y30

Another error: you have P(ban) = (0.3 · 0.005) + (0.001 · 0.995) but it should be P(ban) = (0.3 · 0.005) + (0.01 · 0.995) (0.001 would be 0.1% false positive rate, not 1% as you stipulate). ~~This results in P(problem|ban) = 14.85% rather than 13.1%.~~

(~~Also, I am actually not sure how you got 13.1%.~~ With the aforementioned error, the result would be 60.1%…)

[-]Screwtape2y20

You are correct I added an extra 0, writing (0.3 · 0.005) + (0.001 · 0.995) when I meant (0.3 · 0.005) + (0.01 · 0.995). That's a transcription error, thank you for catching it.

I'm not sure how you're getting 14.85% or 60.1% though? I just checked, and I think those numbers do wind up at ~13.1%, not 14.85%.

[-]Said Achmiz2y20

Yep, my mistake. I’m not sure either! Math is hard, it seems…

EDIT: That is, I’m not sure how I got 14.85%—that was, like, I pressed the wrong calculator keys, or something. But 60.1% is like this:

(0.3 · 0.005) / ((0.3 · 0.005) + (0.01 · 0.995)) = ~0.60120 = 60.12%

[-]Dagon2y30

Adversarial action makes this at least an order of magnitude worse. If Carla has to include the chance that Bob could be lying (for attention, for humor, or just pathologically) about his experiences or his history of drug use or hallucinations, she makes an even smaller update. This is especially difficult in group-membership or banning discussions, because LOTS of people lie (or just focus incorrectly on different evidence) for status and for irrelevant-beef reasons.

I don't think there is a solution, other than to acknowledge that such decisions will always be a balance of false-positive and false-negative, and it's REQUIRED to ban some innocent (-ish) people in order to protect against the likely-but-not-provably-harmful.

[-]Screwtape2y20

I agree adversarial action makes this much worse.

I think Bob and Carla's problem isn't really whether Bob is lying or not. If they knew for an absolute fact Bob wasn't speaking things he knew to be factually untrue, Carla still has to sort through misunderstanding (maybe Bob's talking about a LARP?) and drug use (maybe Bob forgot whether he took LSD the way I forget whether I've had coffee sometimes?) and psychotic breaks. I wouldn't usually count any of those as "lying" in the relevant sense; Bob's wrong, but he's accurately reporting his experiences as best he can.

I don't have a solution for the group membership case, which I think of as a special case of the reputation problem. I'm trying to point out a couple failure modes; one where you don't realize a bunch of your information actually has a single source and should be counted once, and one where you don't actually get or incorporate reputation information at all.

[-]Menotim2y20

This problem doesn't seem to be about trust at all, it seems to be about incomplete sharing of information. It seems weird to me to say Carla doesn't completely trust Bob's account if she is 100% sure he isn't lying.

The sensitivity of the test - that aliens actually abduct people, given someone is telling her aliens abducted him - is 2.5% since she doesn't really know his drug habits and hasn't ruled out there's a LARP she's missing the context for.

I would describe this not as Carla not trusting Bob, but as her not having all of Bob's information - Bob could just tell her that he doesn't use drugs, or that he isn't referring to a LARP, or any other things he knows about himself that Carla doesn't that are causing her sensitivity to be lower, until their probabilities are the same. And, of course, if this process ends with Carla having the same probabilities as Bob, and Carla does the same with Dean, he will have the same probabilities as Bob as well.

I think this satisfies Aumann's Agreement Theorem.

Well, if it does then Bob and Carla definitely have the same probabilities; that's what the problem says, after all.

^{^}

If the problem is that someone is a known serial rapist, then excessive reputation damage might be acceptable. (Though you should hopefully be fairly confident about it!) If the problem is that someone is rude and annoying to work with, then a big public statement can feel like overkill.

^{^}

Though now that I've said that, I'm going to have to worry about adversarial action. Expect a less organized post at some point about this, but any time you explain how you make decisions, if anyone whose interests are not aligned with yours would want those decisions to come out differently, you need to be at least a little aware you might be getting set up.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

45

45

45

I.

II.

III.

IV.

V.