The Coding Theorem — A Link between Complexity and Probability

In case anyone else didn't know what it meant for a set of binary strings to be "prefix-free", here's Claude's explanation, which I found helpful:

A set of binary strings is prefix-free if no string in the set is a prefix of any other string in the set.
Example:
✅ Prefix-free: {0, 10, 110, 111}
❌ Not prefix-free: {0, 01, 10} (because "0" is a prefix of "01")
Why does this matter for Turing machines?
The key is in how universal Turing machines work. A universal machine U simulates any other machine T by receiving input of the form i′q, where:
i′ = prefix-free encoding of machine T's description
q = the actual input to feed to machine T
U must parse this concatenated input to figure out: "Where does the machine description end and the actual input begin?"
Without prefix-free encoding: Given input "00101", U can't tell if the machine description is "0", "00", "001", etc. - there's ambiguity.
With prefix-free encoding: Once U reads a complete machine description, it knows exactly where that description ends and the input begins. No ambiguity, no delimiters needed.
This unique parseability is essential for universal machines to correctly simulate other machines, and it's crucial for Kolmogorov complexity theory where we need to measure program lengths precisely without parsing ambiguity.

^{^}

See Theorem 4.3.3 in Li and Vitányi's "An Introduction to Kolmogorov Complexity and Its Applications". This was first proved by L.A. Levin in 1974.

^{^}

Importantly, this result is only correct up to a logarithmic error for plain, or descriptional, Kolmogorov complexity. Thus, the assumption that we work with prefix-free Turing machines is crucial. Thanks to Daniel Filan for pointing that out.

^{^}

The term "padding" is also used in this context in the book "An Introduction to Universal Artificial Intelligence" by Hutter, Quarel, and Catt.

^{^}

In particular, $T$ halts on $q$ whenever $U$ halts on $i^{'} q$ .

^{^}

My guess is one could also start with the resource $R_{0} = {ϵ}$ consisting of only the empty binary string, which would possibly be somewhat cleaner than the choice I made.

[-]jacob_drori4mo30

[-]Leon Lang4mo20

Thanks for adding!

Not sure if you were aware, but in the glossary at the top-right of the post, there is also an explanation (albeit shorter) of "prefix-free code". I'm just mentioning this in case you weren't aware of the glossary functionality.

[-]jacob_drori4mo20

Ah I hadn't noticed that, very nice. Great post!

LESSWRONG
LW

LESSWRONG
LW

34

The Coding Theorem — A Link between Complexity and Probability

34

34

Formulating the Coding Theorem

Algorithmic Kraft Coding

A Proof of the Coding Theorem

Appendix: Proof of Efficient Algorithmic Kraft Coding