Learning the basics of probability & beliefs

by tomme1 min read31st Mar 201211 comments

6

Personal Blog

Let's say that I believe that the sky is green.

1) How can I know whether this belief is true?

2) How can I assign a probability to it to test its degree of truthfulness?

3) How can I update this belief?

Thank you.

11 comments, sorted by Highlighting new comments since Today at 2:41 PM
New Comment

1) How can I know whether this belief is true?

Expose it to tests. For example, you might stick your head out a window and look up. The theory that the sky is green strongly predicts that you should see green and only very weakly allows for you to see anything else (your eyes may occasionally play tricks on you, perhaps you are looking close to a sunset, etc.).

2) How can I assign a probability to it to test its degree of truthfulness? 3) How can I update this belief?

If you knew absolutely nothing about skies other than that they were some colour, you would start with a symmetrical prior distributed across a division of colour space. You would then update your belief every time you come into contact with entangled information. For example, every time you look up at the sky and see a colour, your posterior probability that the sky is that colour goes up at the expense of alternative colours. Observing other people describe what they see when they look at the sky and learning about how vision works and the chemical composition of the sky are also good examples of evidence you could use.

In practise, manually updating every belief you have all the time is far to arduous, and most people collect large amounts of beliefs and information prior to learning much about statistics anyway. Because of this, the prior probability you assign to your beliefs will often have to be a quick approximation.

True beliefs reliably control anticipation. If you build a sky-detector that bases its decisions on color discrimination, you should anticipate that the detector's decisions will be appropriate to the extent that your theory of the sky's color is correct.

Let's imagine that the sky-detector's function is to orient a water spray toward the sky, so as to maximize the area sprayed. (Because "sky" is what's up when outdoors.)

Place the device on various surfaces. The color of sky is the color of whatever makes the device most confused, spraying in random directions.

While you make an interesting case for testing my belief, I do not know how to build a "sky-detector". So I still am oblivious as to whether my belief is true or not.

I do not know how to build a "sky-detector"

Get a camera, or use the two you were born with.

I still am oblivious as to whether my belief is true or not.

I guess the first step is that you have to want to test your belief. :)

To beg the question (which I think I know the answer to), in this context, how do I trust a camera, or my eyes, to give me a true result?

"Green" or other color words are already in your vocabulary of trusted primitives, otherwise the question couldn't even be spoken. "Green" is (to a good enough approximation) a particular pattern of activation of cone cells in your retina, or (an approximation of an approximation, but still good enough) a point in the color space computed by a camera's CCD photoreceptors.

The query at hand is whether the same cone cells are firing (or whether the camera's CCDs return the same value) when looking straight up when out of doors on a cloudless day, as they do when looking at whatever type of object results in the color judgement "green" (for instance, grass or a tree).

That's the content of the belief. The original questions are answered from that basis.

Question 2 (of which question 1 is a special case) consists of evaluating the fit between the above and the observations we can collect. So, we point cameras (or eyes) at trees, then at the sky, and compare the results.

Question 3 entails pointing cameras at objects of different colors and picking to describe "sky" the same color word that we use to describe objects that have roughly the same values as the sky.

(Though question 3 is more complicated in its most general case - for instance to describe what happens when we start out with no color words whatsoever and learn them from experience. We're still updating on the results of a procedure that's very similar, but we do it without having yet formed the category "color word".)

Thanks for writing this explanation.

I have tested my belief using the built-in "sky-detector"(my eyes) and I can tell that it's false, for the sky is clearly blue with a tint of white. Still, there are some instances where my "sky-detector" could be faulty(e.g., eye disorders, neurological conditions), but since other people's "sky-detectors" and machines have confirmed my belief, I guess it's true.

But how true? Is there, say, an algorithm I can use to assign numerical values to the probability of my belief? Assuming that there is such an 'algorithm', how can I use it to compare my initial belief to the belief I now have(i.e., the sky is blue with a tint of white)?

It's kind of simple: Does it seem that the sky is blue? Then accept it as a temporary truth and move on. Learn about physics, colors, human brain, et cetera. Every Friday review your beliefs about the color of the sky, and update them based on your current knowledge.

I suppose that there are infinitely many such algorithms!

Again, imagine building some sort of robot to keep your lawn watered. You could program it with explicit hard-coded values for "blue", or you could equip it with some subprogram to "learn" the color of the sky. So the robot makes a note of the average color in the direction its nozzle its pointed at and (let's suppose) it receives feedback in the form of how well the lawn has been watered on each such occasion.

The numerical value of the "belief" that the sky is a certain color, then, is simply the value of the "rightness" of certain colors to spray water at (it's a probability distribution of "sky-ness" over the color space). The robot updates this distribution each time it receives some feedback, and there are any number of ways you could program that.

The laws of probability dictate certain constraints on this algorithm, for instance that you can't associate 0% or 100% probability to the proposition "the sky is blue", on pain of becoming unable to ever update away from these positions through a Bayesian update, if the robot finds itself in circumstances where the sky is a different color. (Though in the situation of the linked article, the lawn-watering robot wouldn't be much use at all.)

There isn't a unique, exact algorithm for such calculations, because there isn't a unique, exact meaning for the words "sky" and for our color words, independent of what we use the concepts of sky and color for. There are as many different algorithms as there are purposes for the programs that embody them; you yourself embody an unknown algorithm resulting from your genetic and personal history.

But, it seems, there are some constraints on any such algorithm, unique in the sense that they arise from the mathematical structure of the universe.

This question is better suited to an open thread.

I don't think it was too small for a discussion post, though it feels close to a minimum. Going over the basics is a good thing.