Draft: Reasons to Use Informal Probabilities

This post should review the arguments in When (Not) To Use Probabilities.

Sometimes the conclusion you can derive from the made up numbers is worse than a directly intuited conclusion, since the latter is one step closer to native form of the request for answers from the brain.

[-]komponisto15y20

This post should review the arguments in When (Not) To Use Probabilities.

The lesson of that post is basically "don't let yourself be deceived into thinking your calibration is better than it is". But if you're poorly calibrated, better to know this, and giving explicit probability estimates may help you find this out.

Hiding your judgements doesn't make them better.

[-]Vladimir_Nesov15y40

Probabilities are easier to remember than informal notions of confidence.

You don't usually remember informal notions of confidence either, you regenerate them on request from the global model of the world in your mind.

[-]Vladimir_M15y10

Here's one challenge for your position. Take, for example, your first question. I don't think it makes any sense to talk about any probabilities there, since the question is incomplete to the point of meaninglessness. What sample of cars are we talking about, and under what exact circumstances? To which, I assume, you would answer that for everything unspecified, you should somehow make assumptions that are true with some probabilities and then use that to calculate the final probability of your answer, or estimate it just by feeling in some such way.

But how far would you take this principle? Suppose you receive this question in a bad handwriting, with one word totally smudged, so that it reads like "a [...] is white," or "a car is [...]." Would you be willing to assign a probability nevertheless, based on probabilistic guesses about the missing word? If yes, what about the case where two words are smudged, so the claim is "a [...] is [...]"? What about the ultimate case where the text is completely unreadable, so you have to guess what the question is?

(Note that we can arrive at your original question by starting with a well-defined problem with a computable exact answer, and then smudging parts of it so that we're left with "a car is white.")

[-]jimrandomh15y00

The way to deal with underspecified questions is to note the ambiguity, seek clarification if possible, and then if you still need an answer and can't get clarification, assume a probability distribution for each missing detail. Producing an answer is always possible, but the more ambiguities you had to do this for, the less useful the answer will be.

a [...] is white: 0.1 a car is [...]: 0.1 a [...] is [...]: 0.05

I wouldn't be willing to actually use those probabilities for much of anything, because as soon as I had a use for the answer, I'd surely also have found out what the actual question was, and be able to produce a much better answer.

[-]RobinZ15y10

A car is white. Sampling from the domains of cars I have seen on the road: 3%.
A car is a white, ten year old Ford with a dent on the rear right door Ditto: 10^-9.
A ten-mile car trip will involve a collision. 10^-7 or thereabouts.
A building is residential. Off the cuff, close to even odds.
A person is below the age of 20. 5%.
A word in a book contaains a typo. Any given word in a published book: 10^-8. Any given book: 10%.
Your arm will spontaneously transform into a blue tentacle today. Negligible, dominated by fundamental errors in understanding a la you-are-in-the-Matrix scenarios - 10^-20 is almost certainly too high, 1/10^^100 might be too low.
A purse contains exactly 71 coins. 0.1%.
76297 is a prime number. 10%.

Unfortunately, memories of degrees of confidence tend to come back badly distorted, unless they're crystallized somehow. Worse, they tend to come back consistently biased towards whatever would be judged correct now, which makes them useless or worse. Numbers crystallize those memories, making them usable and enabling you to retrace steps

Really? Mythbusters fans might disagree. :P

[-]rwallace15y00

Aside from the other problems that have been pointed out, I will also take exception to calling an order of magnitude a rough estimate. An order of magnitude would be a rough estimate where you have actual numeric data to work with. In cases where you have to just make up the numbers, an order of magnitude is high precision -- in some of these cases, extraordinarily high precision, far greater than you have any reason for claiming.

[-]khafra15y00

I'd say "an order of magnitude is a rough estimate" is a rough estimate. Remember, this is epistemic probability, so whether you

just think 76297 looks prime-ish and guess 9/10
mentally estimate the natural logarithm, quickly check whether 76297 is divisible by 2 or 3, and call it a 1/2 chance
can actually compute the Sieve of Eratosthenes with five nines of accuracy for it in ten seconds and call it a 1/10000 chance

You're correct, as long as you're not mis-reading your own degree of belief. To get into confidence about your degree of belief, I think we'd have to get into something like informal Dempster-Schafer theory--which, incidentally, I'd love to do.

[-]sixes_and_sevens15y00

My answers and the logic and assumptions behind them. I assumed an implicit "in the UK" after every question, because this is where all my knowledge comes from.

1) 1/22

Assuming roadworthy cars. Anyone who's played Motorway Snooker will know how tricky it is to get started. I estimate I'd need to see 22 cars before a white one came up.

2) 1/1*10^7

Also assuming roadworthy cars. If we included cars sitting in scrapheaps, it becomes significantly more probable.

3) 1/1000

I would assume most collisions happen within the first or last few miles of a journey, so this estimate is effectively the same as "a car trip of at least ten miles involves a collision", which is easier to work with.

4) 9/14

I'd guess nine out of every fourteen buildings is residential. Commercial and industrial buildings tend to be larger and less numerous than residential dwellings.

5) 1/7

Based on my knowledge of population distribution by age, which I'll admit isn't that great.

6) 1/90000

Assuming an average of 250,000 words a book, and two to three typos a book.

7) 1/1*10^56

A silly number for a silly event.

8) 1/27000000

71 is a very strange number of coins to have in any object you might call a purse. This was a bit of a (number of purses) (probability of having a weird-ass number of coins) (spread of weird-ass coins) job.

9) 25%.

It fails divisibility tests for 2, 3, 5 and 11. Divisibility by 7 isn't something I can reliably test in under ten seconds, but it doesn't look divisible by 7. That still leaves a lot of other potential prime factors, but not nearly as many.

[-]orangecat15y00

0.2 (I recall reading that white is the most common color, and I do see a bunch).
0.2 (p(10 year old Ford)=~0.001) (p(dent on rear right|10 year old Ford)=~0.01) =~ 2e-6, or 1 in 500,000.
Average person averages one 10-mile trip per day and gets into an accident once every 10-20 years. ~1 in 5000.
2/3, heavily dependent on definition of building
0.2
Average 1 typo per 10 books, 100k words/book, so 1 in a million.
Probability that I'll perceive it, 10^-20. Probability of it actually happening, around 10^-(10^100)
Seems like several standard deviations above average, maybe 1 in 1,000.
Not divisible by 2 or 3, if I had written this post I'd flip a coin to decide whether to use a prime or plausible imposter, so 0.5.

[-]CarlShulman15y60

Re #7, its past use as a discussion tool makes it more likely that people will create/simulate such situations as a joke in the future. The probability of "actually happening" thus seems far too low.

[-]sixes_and_sevens15y10

6.Average 1 typo per 10 books, 100k words/book, so 1 in a million.

You have a very high opinion of proof readers :-)

[-]NancyLebovitz15y00

10%
.001 %
.0001%
60%
30%
.5%
epsilon
1%
.01%

LESSWRONG
LW

LESSWRONG
LW

15

Draft: Reasons to Use Informal Probabilities

15

15