A very non-technical explanation of the basics of infra-Bayesianism

Some quick comments:

There is a reason why we shouldn't expect agents to always guess correctly the bits divisible by 4: it might be that guessing wrong causes some other bits to become more easily guessable. Moreover, the "causality" might be of any nature (e.g. physical or pure "logical").
For a summary of the results of IB, see this.
The claim that infra-Bayesian physicalism hasn't produced any results so far is not fair. It hasn't produced major results, but the original article does contain some theorems.

Okay, maybe I was somewhat unfair in saying there are no results. Sill, I think it's good to distinguish "internal results" and "external results". Take the example of complex analysis: we have many beautiful results about complex holomorphic functions, like Cauchy's integral formula. I call these internal results. But what made complex analysis so widely studied is that it could be used to produce some external results, like calculating the integral under the bell curve or proving the prime number theorem. These are questions that interested people even before holomorphic functions were invented, so proving them gave a legitimacy to the new complex analysis toolkit. Obviously, Cauchy's integral formula and the like are very useful too, as we couldn't reach the external results without understanding the toolkit itself better with the internal results. But my impression is that John was asking for an explanation of the external results, as they are more of an interest in an introductory post.

I count the work on Newcomb as an external result: "What learning process can lead to successfully learning to one-box in Newcomb's game?" is a natural question someone might ask without hearing about infra-Bayesianism, and I think IB gives a relatively natural framework for that (although I haven't looked into this deeply, and I don't know exactly how natural or hacky it is). On the other hand, from the linked results, I think the 1st, 4th and 5th are definitely internal results, I don't understand so can't comment of the 3rd, and the 2nd is Newcomb which I acknowledge. Similarly, I think IBP itself tries to answer an external question (formalizing naturalized induction), but I'm not convinced it succeeds in that, and I think the theorems are mostly internal results, and not something I would count as an external evidence. (I know less about this, so maybe I'm missing something).

In general, I don't deny IB has many internal results, which I acknowledge to be a necessary first step. But I think that John was looking for external results here, and in general my impression is that people seem to believe that there are more external results than there really are (did I mention the time I got a message from a group of young researchers asking if I thought "if it is currently feasible integrating multiple competing scientific theories into a single infra-Bayesian model"?) So I think it' useful to be more clear about that we don't have that many external results.

[-]Vanessa Kosoy3y*96

I partially agree, but the distinction between "internal" and "external" results is more fuzzy and complicated than you imply. Ultimately, it depends on the original problem you started with. For example, if you only care about prime numbers, then most results of complex analysis are "internal", with the exception of results that imply something about the distribution of prime numbers. However, if complex functions are a natural way to formalize the original problem, then the same results become "external".

In our case, the original problem is "creating a mathematical theory of intelligent agents". (Or rather, the problem is "solving AI alignment", or "preventing existential risk from AI", or "creating a flourishing future for human civilization", but let's suppose that the path from there to "creating a mathematical theory of intelligent agents" is already clear; in any case that's not related specifically to IB.) Infra-Bayesianism is supposed to be an actual ingredient in this theory of agents, not just some tool brought from the outside. In this sense, it already starts out as somewhat "external".

To give a concrete example, you said that results about IB multi-armed bandits are "internal". While I agree that these results are only useful as very simplistic toy models, they are potentially necessary steps towards stronger regret bounds in the future. At what point does it become "external"? Taking it to the extreme, I can imagine regret bounds so powerful, that they would serve as substantial evidence that an algorithm satisfying them is AGI or close to AGI. Would such a result still be "internal"?! Arguably not, because AGI algorithms are very pertinent to what we're interested in!

You can also take the position that any result without direct applications to existing, practical, economically competitive AI systems is "internal". In such case, I am comfortable with a research programme that only has "internal" results for a long time (although not everyone would agree). But this also doesn't seem to be your position, since you view results about Newcombian problems as "external".

[-]Chris_Leong3y50

How is it less of a hack if we are using measures instead of probability distributions? Also, how is "all losses are wiped out" less contrived than infinite utility?

[-]Søren Elverlin3y20

We discussed this post in the AISafety.com Reading Group, and have a few questions about it and infra-bayesianism:

The image on top of the sequence on Infra-Bayesianism shows a tree, which we interpret as a game-tree, with Murphy and an agent alternating in taking actions. Can we say anything about such a tree? E.g. Complexity, Pruning, etc?
There was some discussion about if an infra-bayesian agent could be Dutch-booked. Is this possible?
Your introduction makes no attempt to explain "convexity", which seems like a central part of Infra-Bayesianism. If it is central, what would be a good one-paragraph summary?
Will any sufficiently smart agent be infra-bayesian? To be precise, can you replace "Bayesian" with "Infra-Bayesian" in this article: https://arbital.com/p/optimized_agent_appears_coherent/ ?

[-]David Matolcsi3y60

No idea. I don't think it's computationally very tractable. If I understand correctly, l Vanessa hopes there will be computationally feasible approximations, but there wasn't much research into computational complexity yet, because there are more basic unsolved questions.
I'm pretty sure that no. An IB agent (with enough compute) plans for the long run and doesn't go into a chain of deals that leaves it worse of than not doing anything. In general, IB solves the "not exactly Bayesian expected utility maximizer but still can't be Dutch booked problem" by potentially refusing to take either side of a bet: if it has Knightian uncertainty about whether a probability is lower or higher than 50%, it will refuse to bet at even odds either for or against. This is something that humans actually often do, and I agree with Vanessa that a decision theory can be allowed to do that.
I had a paragraph about it:
"Here is where convex sets come in: The law constrains Murphy to choose the probability distribution of outcomes from a certain set in the space of probability distributions. Whatever the loss function is, the worst probability distribution Murphy can choose from the set is the same as if he could choose from the convex hull of the set. So we might as well start by saying that the law must be constraining Murphy to a convex set of probability distributions."
As far as I can tell, this is the reason behind considering convex sets. This makes convexity pretty central: laws are very central, and now we are assuming that every law is a convex set in the space of probability distributions.
Vanessa said that her guess is yes. In the terms of the linked Arbital article, IB is intended to be an example of "There could be some superior alternative to probability theory and decision theory that is Bayesian-incoherent". Personally, I don't know, I think that the article's "A cognitively powerful agent might not be sufficiently optimized" possibility feels more likely in the current paradigm, I can absolutely imagine the first AIs to become a world-ending threat not being very coherent. Also, IB is just an ideal, real-world smart agents will be at best approximations of infra-Bayesian agents (same holds for Bayesianism). Vanessa's guess is that understanding IB better will still give us useful insights into these real-world models if we view them as IB approximations, I'm pretty doubtful, but maybe. Also, I feel that the problem I write about in my post on the monotonicity principle points at some deeper problem in IB which makes me doubtful whether sufficiently optimized agents will actually use (approximations of) the minimax thinking prescribed by IB.

[-]Lao Mein3y20

Regarding 4: given that infra-Bayesianism is maximally paranoid, shouldn't it have lower performance relative to decision-making theories like regular Bayes under many non-adversarial conditions? If the training set does not contain many instances of adversarial information, then shouldn't we expect agents to adopt Bayes instead of infra-Bayes?

[-]David Matolcsi3y40

I think Vanessa would argue that "Bayesianism" is not really an option. The non-realizability problem in Bayesianism is not just some weird special case, but the normal state of things: Bayesianism assumes that we have hypotheses fully describing the world, which we very definitely don't have in real life. IB tries to be less demanding, and the laws in the agent's hypothesis class don't necessarily need to be that detailed. I am relatively skeptical of this, and I believe that for an IB agent to work well, the laws in its hypothesis class probably also need to be unfeasibly detailed. So both "adopting Bayes" and "adopting infra-Bayes" fully is impossible. We probably won't have such a nice mathematical model for the messy decision process a superintelligence actually adopts, the question is whether thinking about it as an approximation of Bayes or infra-Bayes gives us a more clear picture. It's a hard question, and IB has an advantage in that the laws need to be less detailed, and a disadvantage that I think you are right about it being unnecessarily paranoid. My personal guess is that nothing besides the basic insight of Bayesianism ("the agent seems to update on evidence, sort of following Bayes-rule") will be actually useful in understanding the way an AI will think.

[-]martinkunev2y10

"the agent guesses the next bits randomly. It observes that it sometimes succeeds, something that wouldn't happen if Murphy was totally unconstrained"

Do we assume Murphy knows how the random numbers are generated? What justifies this?

LESSWRONG
LW

LESSWRONG
LW

62

A very non-technical explanation of the basics of infra-Bayesianism

62

62

Introduction

Classical learning theory

The non-realizability problem

Infra-Bayesianism

The results of infra-Bayesianism

Newcomb's problem

Other results