This is a linkpost for https://www.arxiv.org/pdf/2508.16245
The grain of truth problem asks how multiple agents having consistent mental models can reason and learn about each other - recursively.
With Marcus Hutter, Jan Leike (@janleike), and Jessica Taylor (@jessicata) , I have revisited Leike et al.'s paper "A Formal Solution to the Grain of Truth Problem" (AFSGOTP) which studies games between reflective AIXI agents and... further formalized it.
The result is "Limit-Computable Grains of Truth for Arbitrary Computable Extensive-Form (Un)Known Games" (LCGOTACEFUG) which perhaps could have been called "A Formal Formal Solution to the Grain of Truth Problem." Our new paper has some new results, including:
...but mostly we just expand on various definitions and algorithms that were previously left implicit in AFSGOTP. Basically, our new paper LCGOTACEFUG is a "journal version" of AFSGOTP. Except it is not published in a journal yet, because peer review is slow.
Who should read this?
Errata: The proof of Theorem 45 should not say "with O-access."