Mistakes with Conservation of Expected Evidence

[-]Ruby7y290

I really like this. It feels very important and I feel like I'm going to have to read through it a could more times to absorb and process.

This stuff just feels so fundamental. I'd love to see more discussion like this.

[-]abramdemski7y200

Thank you! Appreciative comments really help me to be less risk-averse about posting.

[-]Vladimir_Nesov7y*220

To see why it is wrong in general, consider an extreme case: a universal law, which you mostly already believe to be true.

I feel that this example might create another misconception, that certainty usually begets greater certainty. So here's an opposing example, where high confidence in a belief coexists with high probability that it's going to get less and less certain. You are at a bus stop, and you believe that you'll get to your destination on time, as busses here are usually reliable, though they only arrive once an hour and can deviate from schedule by several minutes. If you see a bus, you'll probably arrive in time, but every minute that you see no bus, it's evidence of something having gone wrong, so that the bus won't arrive at all in the next hour. You expect a highly certain belief (that you'll arrive on time) to decrease (a little bit), which is balanced by an unlikely alternative (for each given minute of waiting) of it going in the direction of greater certainty (if the bus does arrive within that minute).

[-]jimrandomh7y200

Promoted to curated. One of LessWrong's main goals is to advance the art of rationality, and spotting patterns in the ways we process and misprocess evidence is a central piece of that. I also appreciated the Bayesian grounding, the epistemic statuses and the recapping and links to older work. I'm pretty sure most have made these errors before, and I expect that fitting them into a pattern will make them easier to recognize in the future.

[-]Wei Dai7y190

I think all of this is fairly important—if you’re like me, you’ve likely made some mistakes along these lines.

Can you give some examples of specific mistakes you made or you observed others make? Have you observed me making any such mistakes? I ask because there's a lot of rather abstract arguments in this post and I'm not sure if it's worth the effort to think carefully about them to try to figure out if they apply to me. It seems like giving more concrete examples would help others who might have this reaction to your post too.

[-]abramdemski7y*180

Thinking up actual historical examples is hard for me. The following is mostly true, partly made up.

(#4) I don't necessarily have trouble talking about my emotions, but when there are any clear incentives for me to make particular claims, I tend to shut down. It feels viscerally dishonest (at least sometimes) to say things, particularly positive things, which I have an incentive to say. For example, responding "it's good to see you too" in response to "it's good to see you" sometimes (not always) feels dishonest even when true.
(#4) Talking about money with an employer feels very difficult, in a way that's related to intuitively discarding any motivated arguments and expecting others to do the same.
(#6) I'm not sure if I was at the party, but I am generally in the crowd Grognor was talking about, and very likely engaged in similar behavior to what he describes.
(#5) I have tripped up when trying to explain something because I noticed myself reaching for examples to prove my point, and the "cherry-picking" alarm went off.
(#5, #4) I have noticed that a friend was selecting arguments that I should go to the movies with him in a biased way which ignored arguments to the contrary, and 'shut down' in the conversation (become noncommittal / slightly unresponsive).
(#3) I have thought in mistaken ways which would have accepted modest-epistemology arguments, when thinking about decision theory.

[-]gjm7y192

On "If you can't provide me with a reason ...", I think the correct position is: when someone says X (and apparently means it, is someone whose opinions you expect to have some correlation with reality, etc.) you update towards X, and if they then can't give good reasons why X you then update towards not-X. Your overall update could end up being in either direction; if the person in question is particularly wise but not great at articulating reasons, or if X is the sort of thing whose supporting evidence you expect to be hard to articulate, the overall update is probably towards X.

[-]abramdemski7y50

That seems about right.

A concern I didn't mention in the post -- it isn't obvious how to respond to game-theoretic concerns. Carefully estimating the size of the update you should make when someone fails to provide good reason can be difficult, since you have to model other agents, and you might make exploitable errors.

An extreme way of addressing this is to ignore all evidence short of mathematical proof if you have any non-negligible suspicion about manipulation, similar to the mistake I describe myself making in the post. This seems too extreme, but it isn't clear what the right thing to do overall is. The fully-Bayesian approach to estimating the amount of evidence should act similarly to a good game-theoretic solution, I think, but there might be reason to use a simpler strategy with less chance of exploitable patterns.

[-]TurnTrout5y170Nomination for 2019 Review

I would almost nominate this post for this quote alone:

The right thing to do is closer to this: figure out how convincing you expect evidence to look given the extent of selection bias. Then, update on the difference between what you see and what's expected. If a clever arguer makes a case which is much better than what you would have expected they could make, you can update up. If it is worse that you'd expect, even if the evidence would otherwise look favorable, you update down.

I've used this heuristic several times over the last year, and it was better than whatever I would have done otherwise.

Zooming out, I personally found points 1, 4, 5, and 6 to be insightful one year on.

[-]JenniferRM3y*130

This is a great post in a humanistic sense, that I only just read today via curatorial cleverness giving me appreciation for author and curators both :-)

I will now proceed to ignore the many redeeming qualities and geek out on math quibbling <3

Something jumped out at me related to the presumed equivalence between predicate logic and bayesian probability and subjective observations.

Consider how this is presented:

Then also this:

$P (H) = P (H | o b s (E)) P (o b s (E)) + P (H | o b s (\neg E)) P (o b s (\neg E))$

I instantly and intuitively "get what this is pointing to" within the essay as a way of phenomenologically directing my thoughts towards certain experiences of looking at things (or not), and trying experiments (or not), or even bringing things up in a discussion (or not)...

...and yet also some part of me feels that this is a sort of "an abuse of notation" that might have massive cascading consequences?

Like... it is nearly axiomatic that:

$1.0 = P (o b s (E)) + P (\neg o b s (E))$

But it does NOT seem obviously "nearly axiomatic" that:

$1.0 = P (o b s (E)) + P (o b s (\neg E))$

Also, I feel like "the predictable failure of anticipated observations to logically sum to unity" is close to the core of many confusions?

Until it is formalized, I find that I just do not know for sure how that "obs" operator is supposed to work mechanically and syntactically and in terms of whether adding that operator to the rest of the bayesian tools would maintain any particular properties like soundness or completeness!

Like maybe it is possible to "obs(E & not-E)" by two different methods, during a single observational session, and maybe that's just... in the range of formally allowed (sometimes empirically faulty?) "observations"?

Is this intended as syntactic sugar somehow for... sequential updating? (Surely the main point of Bayesian notation is to turn "pre-OBServational prior probabilities" into "post-OBServational posterior probabilities"?)

Or maybe you are trying to evoke SQL's "three valued logic" where NULL sort of "contaminates" the output of boolean logical operators applied to it? Like in SQL you say "(True AND NULL) is NULL" while "(True OR NULL) is True" and "(False OR NULL) is NULL".

Or is this intended to be evocative of something like the Pearlian DO() operator?

((

Maybe the OBS() operator is literally syntactic sugar for DO() in a special case?

Maybe you could build a larger belief network that has a boolean "observed_the_thing" as a "belief" node with a causal arrow to a different node that represents "observation_of_the_thing" and then P(observation_of_the_thing=SQLNULL|observed_the_thing=False) > 99% as an epistemic belief in a larger table of conditional probabilities that spell out exactly how "observing CAUSES observations"?

Then maybe OBS() is literally just DO() in the restricted case of DO(observed_the_thing=True)?

))

My hope, given that you authored this in 2019 and I'm commenting in 2023, is that you already noticed something proximate to these possibilities, and can just point me to some other essay or textbook chapter or something <3

Have you explored much of this in other ways since writing it?

It feels like it could be an evocative call to "invent new notation to formalize something not-yet-formalized" <3

[-]abramdemski3y30

Until it is formalized, I find that I just do not know for sure how that "obs" operator is supposed to work mechanically and syntactically and in terms of whether adding that operator to the rest of the bayesian tools would maintain any particular properties like soundness or completeness!

I entirely agree here. Since December, I have been writing a megapost about interpreting "obs" as "box" in modal logic, so that it's like a concept of necessity or proof, but mostly in a critical way where I question every assumption that anyone has made in the literature. Hopefully that post will se the light of day some time this year.

Currently I think of Obs(X) as the proposition that you have observed X. The idea that you should update on Obs(X) when you observe X is related to the idea that Obs(X) -> Obs(Obs(X)); ie, the idea that you always observe that you osberve X, if you observe X at all. I think this proposition is faulty, so we can't necessarily update on Obs(X), as most Bayesians recommend, even though we would be better off doing so.^[1]

So I don't particularly think the "do" operator is involved here.

Like maybe it is possible to "obs(E & not-E)" by two different methods, during a single observational session, and maybe that's just... in the range of formally allowed (sometimes empirically faulty?) "observations"?

An example which appears in the philosophical literature: you observe that it's 6pm, and later you observe that it's 7pm, which contradicts the 6pm observation.

(The contradiction seems pretty dumb, and should be easy to take care of, but the important question is how exactly we take care of it.)

My hope, given that you authored this in 2019 and I'm commenting in 2023, is that you already noticed something proximate to these possibilities, and can just point me to some other essay or textbook chapter or something <3

I would point you to Novelty, Information, and Surprise with the caveat that I don't entirely buy the approach therein, since it firmly assumes Obs(X) -> Obs(Obs(X)). However, it still yields an interesting generalization of information theory, by rejecting the critical 'partition assumption', an assumption about how Obs() can work (due to Aumann, afaict) which I briefly argued against in my recent post on common knowledge. I think re-reading Aumann's classic paper 'agreeing to disagree' and thinking carefully about what's going on is a good place to start. Also, Probabilistic Reasoning in Intelligent Systems by Judea Pearl has a careful, thoughtful defense of conditioning on Obs(X) instead of X somewhere in the early chapters.

^{^}
I'll construct a counterexample.
Define the "evidence relation" R to mean that when I'm in w1, I think I might be in w2. Define Obs(X) to mean that I think I'm within the set of worlds X. (That is to say, X is equal to, or a superset of, the worlds I think I might be in.) The "information set at a world w" is the set of worlds I think I might be in at w. To "know" a proposition X is to Obs(X); that is, to rule out worlds where X is not the case.
Obs(X) -> Obs(Obs(X)) implies that R is transitive: if I think I might be in a world w2, then I must think I might be in any world w2 would think it might be in. Otherwise, I think I might be in a world w2 where I thought I might be in some world w3, which I don't currently think I might be in. In other words, the information set at w2 contains something which the information set at w1 does not contain. Setting X to be something that's false in w3 but true in w1 and w2, we see that Obs(X) but not Obs(Obs(X)), contradicting our initial assumption.
Indeed, Obs(X) -> Obs(Obs(X)) is equivalent to transitivity of R.
But transitivity of R is implausible for an agent embedded in the physical world. For example, if I observe a coffee cup on a table, I can only measure its location to within some measurement error. For every possible location of the coffee cup, the information set includes a small region around that precise location. Transitivity says that if we can always slide the coffee cup by 1mm to make an indistinguishable observation, then we must be able to slide it by Xmm for any X. So, if we accept both transitivity and realistic imprecision of measurement, we are forced to conclude that the coffee cup could be anywhere.

[-]romeostevensit7y90

Posts like this are deceptively hard to write, so I really appreciate how well done this is.

Providing reasons feels fractal, or ship of theseus like to me. The metaphor that comes to mind is something like

Imagine two martial artists sparring, you are listening to a commentator describe the match over a radio. Two commentators would describe the match differently. In principle, a fight between two novices and a fight between two masters might sound very similar if the commentary captures a low enough resolution of events. When trying to communicate, we're something like the commentator looking directly at the mashing together of felt senses and using various mental moves to carve up the high dimensional space differently. Groups of people will fall into commentator norms to improve bandwidth, but these choices carry (usually unacknowledged) trade offs. Reification at one particular abstraction level forces a lot of structure on things that is a result of the choice of level as much as a result of the territory.

This is one of the reasons for Chapman's 'if a problem seems hard, the representation is probably wrong.' Different initial basis choices tend to push the complexity around to different parts of the model. And this process isn't even always perverse. Often the whole point is that you really can shove the uncertainty somewhere where it doesn't matter for your current purposes.

[-]magfrump5y70Review for 2019 Review

I like that this responds to a conflict between two of Eliezer's posts that are far apart in time. That seems like a strong indicator that it's actually building on something.

Either "just say the truth", or "just say whatever you feel you're expected to say" are both likely better strategies.

I find this believable but not obvious. For example, if the pressure on you is you'll be executed for saying the truth, saying nothing is probably better that saying the truth. If the pressure on you is remembering being bullied on tumblr, and you're being asked if you disagree with the common wisdom at a LW meetup, saying nothing is better than saying what you feel expected to say.

I find it pretty plausible that those are rare circumstances where the triggering uncertainty state doesn't arise, but then there are some bounds on when the advice applies that haven't been discussed at all.

a little cherry-picking is OK

I think the claim being made here is that in most cases, it isn't practical to review all existing evidence, and if you attempt to draw out a representative sub-sample of existing evidence, it will necessarily line up with your opinion.

In cases where you can have an extended discussion you can mention contradicting evidence and at least mention that it is not persuasive, and possibly why. But in short conversations there might only be time for one substantial reference. I think that's distinct from what I would call "cherry-picking." (it does seem like it would create some weird dynamics where your estimate of the explainer's bias rises as you depart from uncertainty, but I think that's extrapolating too far for a review)

I think the comment of examples is helpful here.

I wonder about the impact of including something like this, especially with social examples, in a curated text that is at least partly intended for reading outside the community.

[-]Gurkenglas7y70

If you, while awake, decide to doubt whether you're awake (no matter how compelling the evidence that you're awake seems to be), then you're not really improving your overall correctness.

It builds a habit that makes you also doubt while dreaming.

[-]Yoav Ravid5y30

It seems to me that the dream example doesn't actually violate the principle "Yes require the possibility of no", but just a very tricky case.

If i understand correctly, what you're saying is basically that the observation itself is evidence. now, it's only evidence if it happens only in one case, when you're awake. so wouldn't saying "observation as evidence requires the possibility of no observation" be correct and consistent with the principle?

[-]abramdemski5y60

Yeah. But I fear that a more common reading of "yes requires the possibility of no" takes it to mean "yes requires the possibility of an explicit no", when in fact it's just "yes requires the possibility of not-yes". I would rather explicitly highlight this by adding "yes requires the possibility of no, or at least, silence", rather than just lumping this under "tricky cases" of yes-requires-the-possibility-of-no.

[-]Yoav Ravid5y30

Fair enough, thanks for the clarification . "not-yes" actually makes things clearer to me than "silence", but unfortunately it doesn't sound elegant. anyway I'm happy my intuition of the principle was right, and it was more a matter of labeling.

[-]Ben Pace5y30Nomination for 2019 Review

Lots more of the basics. Practical heuristics explained with simple theory.

[-]Pattern7y30

"If you can't provide me with a reason, I have to assume you're wrong."

One: I make a mathematical claim, while talking to a (smart) mathematician. They say "That doesn't * hold. **"

Two: I explain the proof/the conditions for the proof. The mathematician says, "Right, it holds under those conditions."

The only problem is, when I can't generate a proof. Then "One" can happen, but not "Two".

Unspoken:

*necessarily

**in all cases.

[-]David Joshua Sartor2y20

I no longer think it makes sense to clam up when you can't figure out how you originally came around to the view which you now hold

Either you can say "I came to this conclusion at some point, and I trust myself", or you should abandon the belief.

You don't need to know how or why your brain happened to contain the belief; you just need to know your own justification for believing it now. If you can't sufficiently justify your belief to yourself (even through things like "My-memory-of-myself-from-a-few-minutes-ago thinks it's likely" or "First-order intuition thinks it's likely"), you should abandon it (unless you're bad at this, which is probably not the case for most people who might try it).

From my perspective, I just had an original thought. If there's any writing about something related, or if someone else has something to add or subtract, I would probably very much like to read it.

[-]niplav5y20

Part 2 (and the dream algorithm) remind me of semi-decidability.

[-]Esteemed Estimator9mo10

One idea is to "call out" the pressure you feel. "I'm having trouble saying anything because I'm worried what you will think of me." This isn't always a good idea, but it can often work fairly well. Someone who is caving to incentives isn't very likely to say something like that, so it provides some evidence that you're being genuine.

If it provides evidence that I'm being genuine, someone who isn't being genuine would use this strategy and say the same thing. The other person knows this, so "calling out" shouldn't provide evidence that I'm being genuine. If a person who isn't being genuine wouldn't use this strategy in the first place because it has a net effect of undermining their credibility, then I also shouldn't be using this strategy since it has a net effect of undermining my credibility.

In practice this isn't always correct (maybe people who are being genuine are a bit more likely to worry about incentives) but this consideration has a similar shutting down effect on me because we are just pushing the incentive problem to a different layer.

[-]abramdemski9mo30

Fair. I think the analysis I was giving could be steel-manned as: pretenders are only boundedly sophisticated; they can't model the genuine mindset perfectly. So, saying what is actually on your mind (eg calling out the incentive issues which are making honesty difficult) can be a good strategy.

However, the "call out" strategy is not one I recall using very often; I think I wrote about it because other people have mentioned it, not because I've had sucess with it myself.

Thinking about it now, my main concerns are:
1. If the other person is being genuine, and I "call out" the perverse incentives that theoretically make genuine dialogue difficult in this circumstance, then the other person might stop being genuine due to perceiving me as not trusting them.

2. If the other person is not being genuine, then the "call out" strategy can backfire. For example, let's say some travel plans are dependent on me (maybe I am the friend who owns a car) and someone is trying to confirm that I am happy to do this. Instead of just confirming, which is what they want, I "call out" that I feel like I'd be disappointing everyone if I said no. If they're not genuinely concerned for my enthusiasm and instead disingenuously wanted me to make enthusiastic noises so that others didn't feel I was being taken advantage of, then they could manipulatively take advantage of my revealed fear of letting the group down, somehow.

[-]ProgramCrafter1y10

it would imply that you should ignore mathematical proofs if the person who came up with the proof only searched for positive proofs and wouldn't have spend time trying to prove the opposite. (This ties in with the very first section -- failing to find a proof is like remaining silent.)

I think this strategy becomes coherent if you update on the claim "fact X is true, here's its proof" being made? After all, there's lower probability that person publishes such claim if they fail to find the proof.

(Generalization: it doesn't matter much on what arguments you update, it matters more what you end up believing.)

[-]abramdemski1y23

The strategy "ignore the arguments" still goes wrong if they've published an incorrect mathematical proof, with a flaw you could have spotted. So it's still clearly wrong in general, even with this adjustment.

[-]Eigil Rischel6y10

This is great. A point which helped me understand number 6: If you ask someone "why do you believe X", since you're presumably going to update your probability of X upwards if they give a reason, you should update downwards if they don't give a reason. But you probably already updated upwards as soon as they said "I believe X", and there is no theorem which says this update has to be smaller than the latter update. So you can still end up with a higher or equal probability of X compared to where you were at the beginning of the conversation.

[-]Chris_Leong7y10

I view the issue of intellectual modesty much like the issue of anthropics. The only people who matter are those whose decisions are subjunctively linked to yours (it only starts getting complicated when you start asking whether you should be intellectually modest about your reasoning about intellectual modesty)

One issue with the clever arguer is that the persuasiveness of their arguments might have very little to do with how persuasive they should be, so attempting to work off expectations might fail.

[-]abramdemski7y30

I view the issue of intellectual modesty much like the issue of anthropics. The only people who matter are those whose decisions are subjunctively linked to yours (it only starts getting complicated when you start asking whether you should be intellectually modest about your reasoning about intellectual modesty)

I agree fairly strongly, but this seems far from the final word on the subject, to me.

One issue with the clever arguer is that the persuasiveness of their arguments might have very little to do with how persuasive they should be, so attempting to work off expectations might fail.

Ah. I take you to be saying that the quality of the clever arguer's argument can be high variance, since there is a good deal of chance in the quality of evidence cherry-picking is able to find. A good point. But, is it 'too high'? Do we want to do something (beyond the strategy I sketched in the post) to reduce variance?

[-]Chris_Leong7y10

I agree fairly strongly, but this seems far from the final word on the subject, to me.

Hmm, actually I think you're right and that it may be more complex than this.

Ah. I take you to be saying that the quality of the clever arguer's argument can be high variance, since there is a good deal of chance in the quality of evidence cherry-picking is able to find. A good point.

Exactly. There may only be a weak correlation between evidence and truth. And maybe you can do something with it or maybe it's better to focus on stronger signals instead.

[+][comment deleted]1y10

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

233

Mistakes with Conservation of Expected Evidence

233

233

1. "You can't predict that you'll update in a particular direction."

2. "Yes requires the possibility of no."

3. "But then what do you say to the Republican?"

4. "I can't credibly claim anything if there are incentives on my words."

5. "Your true reason screens off any other evidence your argument might include."

6. "If you can't provide me with a reason, I have to assume you're wrong."

Conclusion