Logical thermodynamics: towards a theory of self-trusting uncertain reasoning

[-]Manfred12y20

A general problem with big integrals in your definition of the abstract function P: Suppose that C is a statement whose shortest proof takes longer to write out than your agent has time. When asked for P(C), your abstract function P should not assign a number that your agent is physically incapable of assigning.

More concrete problem: since your energies are all greater than 0, proofs (evidence) with long length get weighted greater (are lower energy) than short proofs.

[-]Squark12y20

Evidence with long length is weighted less since it contributes less to the energy.

When computing probabilities at finite temperature to finite precision, it should be possible to ignore long evidence (at least evidence which is long wrt the given sentence). If the shortest evidence for a sentence is very long, it means its probability is "marginal" in some sense (since the total energy depends weakly on the value of the probability field at this statement). For sentences which cannot be decomposed into simpler sentences using propositional calculus operations, "marginal" probably means close to 1/2.

[-]Manfred12y00

Oh, whoops. Yeah, my bad.

[-]Luke_A_Somers12y20

Related but simpler concept - I remember thinking about assigning a temperature on a prior distribution, but it didn't normalize if it wasn't penalizing longer hypotheses even harder than usual.

Construction

Define a probability field to be a function p : {sentences} -> [0, 1] satisfying the following conditions:

If φ is a tautology in propositional calculus (e.g. φ = ψ or not-ψ) then p(φ) = 1

For all φ: p(not-φ) = 1 - p(φ)

For all φ, ψ: P(φ) = P(φ and ψ) + P(φ and not-ψ)

Probability fields are a convex set: a convex linear combination of probability fields is a probability field. Essentially, probability fields are probability measures in the space of truth assignments consistent w.r.t. propositional calculus.

We define the energy of a probability field p to be E(p) := Σ_φ Σ_v 2^-l(v)E_φ,v(p(φ)). Here v are pieces of evidence as defined in the previous post, E_φ,v are their associated energy functions and l(v) is the length of (the encoding of) v. We assume that the encoding of v contains the encoding of the sentence φ for which it is evidence and E_φ,v(p(φ)) := 0 for all φ except the relevant one. Note that the associated energy functions are constructed in the same way as in the previous post, however they are not the same because of the self-referential nature of the construction: it refers to final probability assignment.

The final probability assignment is defined to be

P(φ) = Integral_p [e^-E(p)/Tp(φ)] / Integral_p e^-E(p)/T

Here T >= 0 is a parameter representing the magnitude of logical uncertainty. The integral is infinite-dimensional so it's not obviously well-defined. However, I suspect it can be defined by truncating to a finite set of statements and taking a limit wrt this set. In the limit T -> 0, the expression should correspond to computing the centroid of the set of minima of E (which is convex because E is convex).

Remarks

Obviously this construction is merely a sketch and work is required to show that

The infinite-dimensional integrals are well-defined
The resulting probability assignment is coherent for consistent theories and T = 0
The system overcomes the Loebian obstacle for tiling agents in some formal sense

For practical application to AI we'd like an efficient way to evaluate these probabilities. Since the form of the probabilities is analogous to statistical physics, it is suggestive to use similarly inspired Monte Carlo algorithms.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

10

Logical thermodynamics: towards a theory of self-trusting uncertain reasoning

10

10

Construction

Remarks