The Generative Crash: Why RLHF Fails the Free Energy Principle
LessWrong-Specific Foreword Epistemic Status Highly confident in the neurobiological model and the UX framework. Confident in the application to AI alignment. Context and Structural Formatting Note I am a Human Factors psychologist, engineer, and systems architect. This piece formalizes the mechanical failure state of the human Theory of Mind when processing generative media, bridging active inference with Cooperative Inverse Reinforcement Learning. The essay deploys a functional UX protocol called The Ghost Scale. It relies on precise CSS opacity limits to signal intent density, structurally altering the metabolic expenditure of the reader. Because the native editor here strips custom CSS, reading the plaintext below physically breaks the biological friction-reduction framework the interface is designed to interact with. To allow you to properly evaluate the cognitive load reduction and view the visual affordances as intended, the interactive version is hosted here: abrahamhaskins.org/art. The raw text is provided below to facilitate direct critique of the IRL convergence theorem, the psychophysics, and the alignment implications. Moderation Note on Generated Content: Approximately 15% of this post is AI-synthesized text, almost entirely confined to the formal appendix. I am aware this violates standard community guidelines regarding generative output. This inclusion is a functional requirement, not a shortcut. The appendix is explicitly flagged, logically led into, and serves as a live demonstration of the UX protocol defined in the essay. The preceding 85% of the theoretical framework is entirely human-authored, with the exception of clearly visible, explicitly declared, and clearly marked short segments used primarily for demonstration of the framework. If possible, please read the interactive version instead of this reduced-fidelity local copy to allow yourself to engage with and test the AI affordances proposed. Art: A Unifying Model Table of Content