Aram Ebtekar — LessWrong

Third Symposium on AIT & ML: AI Safety Applications

We are organizing a symposium on the intersection of algorithmic information theory and machine learning July 27-29th at Oxford! See the announcement here for details: https://sites.google.com/site/boumedienehamzi/third-symposium-on-machine-learning-and-algorithmic-information-theory The third iteration of the symposium is particularly focused on applications of AIT to the theory and practice of AI safety. AIXI has long...

Apr 2522

Algorithmic thermodynamics and three types of optimization

Context:I (Daniel C) have been working with Aram Ebtekar on various directions in his work on algorithmic thermodynamics and the causal arrow of time. This post explores some implications of algorithmic thermodynamics on the concept of optimization. All mistakes are my (Daniel's) own. A typical picture of optimization is when...

Dec 8, 202511

Stability of natural latents in information theoretic terms

This post is a comment on Natural Latents: Latent Variables Stable Across Ontologies by John Wentworth and David Lorell. It assumes some familiarity with that work and does not attempt to explain it. Instead, I present an alternative proof that was developed as an exercise to aid my own understanding....

Oct 26, 202536

Inoculation prompting: Instructing models to misbehave at train-time can improve run-time behavior

This is a link post for two papers that came out today: * Inoculation Prompting: Eliciting traits from LLMs during training can suppress them at test-time (Tan et al.) * Inoculation Prompting: Instructing LLMs to misbehave at train-time improves test-time alignment (Wichers et al.) These papers both study the following...

Oct 8, 2025176

Time's arrow => decision theory

Debates on which decision theory (EDT/CDT/UDT/FDT/etc.) is "rational" seem to revolve around how one should model "free will". Do we optimize individual actions or entire policies? Do we model our choice as an evidential update or a causal intervention? Physics tells us that the Universe evolves deterministically and reversibly, so...

Sep 2, 202533

Help me understand: how do multiverse acausal trades work?

While I'm intrigued by the idea of acausal trading, I confess that so far I fail to see how they make sense in practice. Here I share my (unpolished) musings, in the hopes that someone can point me to a stronger (mathematically rigorous?) defense of the idea. Specifically, I've heard...

Sep 1, 202546

Thermodynamic entropy = Kolmogorov complexity

Direct PDF link for non-subscribers > Information theory must precede probability theory, and not be based on it. By the > very essence of this discipline, the foundations of information theory have a finite combinatorial character. > > - Andrey Kolmogorov Many alignment researchers borrow intuitions from thermodynamics: entropy relates...

Feb 17, 202577