Context:I (Daniel C) have been working with Aram Ebtekar on various directions in his work on algorithmic thermodynamics and the causal arrow of time. This post explores some implications of algorithmic thermodynamics on the concept of optimization. All mistakes are my (Daniel's) own. A typical picture of optimization is when...
This post is a comment on Natural Latents: Latent Variables Stable Across Ontologies by John Wentworth and David Lorell. It assumes some familiarity with that work and does not attempt to explain it. Instead, I present an alternative proof that was developed as an exercise to aid my own understanding....
This is a link post for two papers that came out today: * Inoculation Prompting: Eliciting traits from LLMs during training can suppress them at test-time (Tan et al.) * Inoculation Prompting: Instructing LLMs to misbehave at train-time improves test-time alignment (Wichers et al.) These papers both study the following...
Debates on which decision theory (EDT/CDT/UDT/FDT/etc.) is "rational" seem to revolve around how one should model "free will". Do we optimize individual actions or entire policies? Do we model our choice as an evidential update or a causal intervention? Physics tells us that the Universe evolves deterministically and reversibly, so...
While I'm intrigued by the idea of acausal trading, I confess that so far I fail to see how they make sense in practice. Here I share my (unpolished) musings, in the hopes that someone can point me to a stronger (mathematically rigorous?) defense of the idea. Specifically, I've heard...
Direct PDF link for non-subscribers > Information theory must precede probability theory, and not be based on it. By the > very essence of this discipline, the foundations of information theory have a finite combinatorial character. > > - Andrey Kolmogorov Many alignment researchers borrow intuitions from thermodynamics: entropy relates...