An Attempt at Logical Uncertainty

Hi Ben,

If you have not already seen them, you should probably read this and this, which are two ways to define a coherent probability measure on sentences. The first link my construction, and the second link is Abram Demski's. Abram and I are working on this problem in the MIRIxLosAngeles group.

You seem to be doing something different from what we are doing, in that our probability functions assign probability 1 to all provable statements, and try to answer the question of what to do with statements which are neither provable nor disprovable. (If you want your probability function to represent a probability measure on models of your logical system, then you cant just give them all probability 1/2)

For both of our constructions, the probability assignments are approximable in the sense that we have a computation which updates probabilities and converges to the probability in our construction (but never equals it exactly) and both of our computations work by updating as you observe proofs of sentences.

In particular I have a procedure which (I only conjecture right now) converges to my probability assignment by noticing over and over again "I can prove that exactly one of A, B, and C is true, but their probabilities do not sum to one" and then shifting the probabilities of these sentences accordingly. (This is not written up in the post I linked to, but I can explain it more if you like.)

If you do look at them, feel free to ask me questions about either construction, or if you would like I can talk to you about what we've done in a skype call or something. I have a few conjectures that I would like to prove related to my construction.

Also, neither one of us pays any attention to proof length, so it would seem reasonable for you to ignore what we did and go your own direction. However, if you find yourself thinking mostly about how to assign probabilities to unprovable things, then it is worth looking at.

I do have one suggestion on notation. I suggest you reserve the term "probability measure" for something more strict than just assignments of probabilities to sentences. Say "probability function" or "probability assignment." I would say a "probability measure on sentences" should represent a way to choose a single sentence at random, and a "probability measure on models" should be a way to choose a single model of your axioms at random.

I am excited to see what you guys do!

[-]BenjaminFox11y30

Thanks for those links! They both look very intresting, and I'll read them in depth.

As you mention, you are doing something slightly diffrent. You are assigning probability 1 to all the provable sentances, and then trying to investagate the unprovable ones. I, on the other hand, am taking the unprovable ones as just that, unprovable, and focusing on assigning probability mass to the provable ones.

I think the question of how to assign probability mass to provable, yet not yet proven, statments is the really important part of logical uncertanty. That's the part that is handwaved away in discussions of, say, UDT, and so is the part that I want to focus on.

About your suggestion on notation: Yes, I was being slightly casul with notation there. By construction, it is a measure, I think, as always gives probabilities in the range [0,1], and it obeys the law of the excluded middle. I didn't actually prove that the measure of multiple independant sentances is equal to the sum of the measures, but I think it follows... More work is needed on this. At the moment, this only gives probabilities to individual sentances, and not to gtoups of sentances, so technically that wouldn't work at all. The obvois next step is tpo try to extend it in order to be able to do this. But until that is done, you are correct that it is abuse of notation to call it a measure.

[-]Scott Garrabrant11y50

I do not think it is a measure. If A B and C are all unprovable, undisprovable, but provably disjoint sentences, then your system cannot assign probability of A or B or C equal to P(A)+P(B)+P(C) because that must be 3/2.

I think that the thing that makes logical uncertainty hard is the fact that you cant just talk about probability measures (on models) because by definition a probability measure on models must assign probability 1 to all provable sentences.

[-]BenjaminFox11y30

That's a good point, and I concede that you are right. At the moment, it's more of a "probability assignment", as you said, rather than a probability measure. More work needs to be done on the subject, and hopefully we will progress along these lines at the MIRIx workshop.

[-]AlexMennen11y80

Lastly, for statements that are unprovable, we have to assign a probability of ½.

You can't do that. Let φ and ψ be sentences such that φ, ψ, and (φ ^ ψ) are all neither provable nor disprovable. If all of these sentences are given probability 1/2, then since φ and (φ ^ ψ) have the same probability, (φ ^ ~ψ) must have probability 0. That is, φ implies ψ with probability 1. By symmetry, ψ implies φ with probability 1, so φ and ψ are equivalent with probability 1. But there exist such pairs of sentences that are not equivalent.

Edit: Actually it looks like my argument doesn't apply in your system because it does not satisfy the axioms of a probability measure. For instance, if φ has a short proof that does not meantion ψ anywhere, and ψ does not have a short proof, then (φ v ψ) will be provable, with its shortest proof being the proof of φ with the rule of inference φ |- (φ v ψ) appended to the end, a longer proof, and thus P(φ v ψ) < P(φ), which is ridiculous. Another reason I don't like the idea of assigning probabilities based on proof length is that in order to compute the probability, you have to find a proof, and by that time, you may as well give probability 1 to the statement. The only reason I would want to assign a probability other than 1 to a provable statement is if I didn't already know that it was provable.

[-]Skeptityke11y50

To add to the hail of links, you might want to inspect the big official MIRI progress report on the problem here.

Also, though i know quite a bit less about this topic than the other people here (correct me if I'm wrong somebody), I'm a little suspicious of this distribution because I don't see any way to approximate the length of the shortest proof. Given an unproven mathematical statement for which you aren't sure whether it is true or false, how could you establish even a rough estimate of how hard it is to prove in the absence of actually trying to prove it?

[-]Manfred11y50

Hi Bejnamin! Have some links you might already have read.

For an introduction to the topic, you can't go wrong with the scholarly literature: Gaifman's "Reasoning with limited resources and assigning probabilities to arithmetical statements."

I will also happily recommend my own posts on the subject, beginning at the very beginning, here.

For breadth, you might also check out Abram Demski's description, which is good to think about though unsatisfying to me. There's also some discussion on lesswrong and a MIRI document uses a slightly modified version, somewhere.

Anyhow, I won't cotton to any method of assigning a logical probability that takes longer than just brute-forcing the right answer. For this particular problem I think a bottom-up approach is what you want to use.

[-]BenjaminFox11y40

I appreciate the links. I haven't read Gaifman's paper before, so I'll go ahead and read that.

Anyhow, I won't cotton to any method of assigning a logical probability that takes longer than just brute-forcing the right answer. For this particular problem I think a bottom-up approach is what you want to use.

I see the sentiment there, and that too is a valid approach. That said, after trying to use the bottom-up approach many times and failing, and after seeing others fail using bottom-up approaches, I think that if we can at least build a nonconstructive top-down theory, that would be a starting point. After all, Solomonoff Induction is completely top down, yet it's a very powerful theoretical tool.

[-]janos11y30

One nonconstructive (and wildly uncomputable) approach to the problem is this one: http://www.hutter1.net/publ/problogics.pdf

[-]Manfred11y20

after seeing others fail using bottom-up approaches

<.<

>.>

Well, care to explain what I did wrong?

[-]cousin_it11y30

Seconding Coscott's and Manfred's comments. I also had a post on this topic, which seems to solve one use case of logical uncertainty, namely, choosing between several logical counterfactuals that are all provably true.

[-]lukeprog11y10

In addition to the links provided by others, I also recommend looking at Paul's new report: "Non-omniscience, probabilistic inference, and metamathematics."

Edit: Oops, this was already linked from this comment.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

14

An Attempt at Logical Uncertainty

14

14