Fundamental Uncertainty: Chapter 3 - Why don't we agree on what's right?

[-]abramdemski3y40

One classic but unpopular argument for agreement is as follows: if two agents disagreed, they would be collectively dutch-bookable; a bookie could bet intermediate positions with both of them, and be guaranteed to make money.

This argument has the advantage of being very practical. The fallout is that two disagreeing agents should bet with each other to pick up the profits, rather than waiting for the bookie to come around.

More generally, if two agents can negotiate with each other to achieve Pareto-improvements, Critch shows that they will behave like one agent with one prior and a coherent utility function. Critch also suggests that we can interpret the result in terms of agents making bets with each other -- once all the fruitful bets have been made, the agents act like they have a common prior (which treats the two different priors as different hypotheses).

So the overall argument becomes: if people can negotiate to take pareto-improvements, then in the limit of this process, they'll behave as if they had a common prior and shared preferences.

A practical version of this argument might involve an axiom like, if you and I have different preferences, then the "right thing to do" is to take both preferences into account. You and I can eventually reach agreement about what is right-in-this-sense, by negotiating Pareto improvements. This looks something like preference utilitarianism; in the limit of everyone negotiating, a grand coalition is established in which "what's right" has reached full agreement between all participants. Any difference between our world and that world can be attributed to failures to take Pareto improvements, which we can think of as failure to approximate the ideal rationality.

This also involves behaving as if we also agree on matters of fact, since if we don't, we're Dutch-bookable, so we left money on the table and should negotiate another Pareto-improvement by betting on our disagreements.

Furthermore, everyone would agree on a common prior, in the sense that they would behave as if they were using a common prior.

Notice the relationship to the American Pragmatist definition of truth as what the scientific community would eventually agree on in the limit of investigation. "Right" becomes the limit of what everyone would agree on in the limit of negotiation.

Another argument for agreement which you haven't mentioned is Robin Hanson's Uncommon Priors Require Origin Disputes, which makes an argument I find quite fascinating, but will not try to summarize here.

[-]TAG3y10

if two agents disagreed, they would be collectively dutch-bookable;

Which is to say that if two agents disagree about something observable and quantifiable...

[-]abramdemski3y20

True, this is an important limitation which I glossed over.

We can do slightly better by including any bet which all participants think they can resolve later -- so for example, we can bet on total utilitarianism vs average utilitarianism if we think that we can eventually agree on the answer (at which point we would resolve the bet). However, this obviously still begs the question about Agreement, and so has a risk of never being resolved.

[-]abramdemski3y40

As we collect evidence about the world we update our beliefs, but we don't remember all the evidence. Even if we have photographic memories, childhood amnesia assures that by the time we reach the age of 3 or 4 we've forgotten things that happened to us as babies. Thus by the time we're young children we already have different prior beliefs and can't share all our evidence with each other to align on the same priors because we've forgotten it. Thus when we meet and try to agree, sometimes we can't because even if we have common knowledge about all the information each other has now, we didn't start from the same place and so may fail to reach agreement.

This part of your argument relies critically on the earlier mistake where you claimed that Aumann's theorem requires that we share all the evidence. Again, it does not - it only requires common knowledge of the specific posteriors about the question at hand, as opposed to common knowledge of all posterior beliefs.

[-]abramdemski3y40

We also don't meet one of the other requirements of Aumann's Agreement Theorem: we don't have the same prior beliefs. This is likely intuitively true to you, but it's worth proving. For us to all have the same prior beliefs we'd need to all be born with the same priors. This seems unlikely, but for the sake of argument let's suppose it's true that we are.

I want to put up a bit of a defense of the common prior assumption, although in reality I'm not so insistent on it.

First of all, we aren't ideal Bayesian agents, so what we are as a baby isn't necessarily what we should identify as "our prior". If we think of ourselves as trying to approximate an ideal Bayesian reasoner, then it seems like part of the project is constructing an ideal prior to start with. EG, many people like Solomonoff's prior. These people could be said to agree on a common prior in an important way. (Especially if they furthermore can agree on a UTM to use.)

But we can go further. Suppose that two people currently disagree about the Solomonoff prior. It's plausible that they have reasons for doing so, which they can discuss. This involves some question-begging, since it assumes the kind of convergence that we've set out to prove, but I am fine with resigning myself to illustrating the coherence of the pro-agreement camp rather than decisively arguing it. The point is that philosophical disagreements about priors can often be resolved, so even if two people can't initially agree on the Solomonoff prior, we might still expect convergence on that point after sufficient discussion.

In this picture, the disagreement is all about the approximation, and not at all about non-common priors. If we could approximate ideal rationality better, we could agree.

Another argument in favor of a common-prior assumption is that even if we model people as starting out with different priors, we expect people to have experienced actually quite a lot of the world before they come together to discuss some specific disagreement. In your writing, you treat the different data as a reason for diverging opinions -- but taking another perspective, we might argue that they've both experienced enough data that they should have broadly converged on a very large number of beliefs, EG about how things fall to the ground when unsupported, what things dissolve in water, how other humans tend to behave, et cetera.

We might broadly (imprecisely) argue that they've drawn different data from the same distribution, so after enough data, they should reach very similar conclusions.

Since "prior" is a relative term (every posterior acts as a prior for the next update), we could then argue that they've probably come to the current situation with very similar priors about that situation (that is, would have done if they'd been ideally rational bayesians the whole time) - even if they don't agree on, say the Solomonoff prior.

The practical implication of this would be something like: when disagreeing, people actually know enough facts about the world to come to agree, if only they could properly integrate all the information.

[-]TAG3y10

But we can go further. Suppose that two people currently disagree about the Solomonoff prior. It’s plausible that they have reasons for doing so, which they can discuss.

Sure, but where does that lead? If they discuss it using basically the same epistemology, htey might agree, and if they have fundamentally epistemology, they probably. They could have a discussion about their infra epistemology, but then the same dichotomy re-occurs a t a deeper level. There's no way of proving that two people who disagree can have a productive discussion that leads to agreement without assuming some measuer of pre-existing agreement at some level.

his involves some question-begging,

Yep.

but I am fine with resigning myself to illustrating the coherence of the pro-agreement camp

That doesn't imply the incoherence of the anti-agreement camp. Coherence is like that: it's a rather weak condition, particularly in the sense that it can't show there is a single coherent view. If you believe there is a single truth, you shouldn't treat coherence as the sole criterion of truth.

Another argument in favor of a common-prior assumption is that even if we model people as starting out with different priors, we expect people to have experienced actually quite a lot of the world before they come together to discuss some specific disagreement.

But that doesn't imply that they will converge without another question-begging assumption that they will interpret and weight the evidence similarly. One person regards the bible as evidence, another does not.

We might broadly (imprecisely) argue that they’ve drawn different data from the same distribution, so after enough data, they should reach very similar conclusions.

If one person always rejects another's "data" that need not happen. You can have an infinite amount of data that is all of one type. Infinite in quantity doesn't imply infinitely varied.

if only they could properly integrate all the information.

They need to agree on what counts as information (data, evidence) in the first place.

[-]abramdemski3y20

That doesn't imply the incoherence of the anti-agreement camp.

I basically think that agreement-bayes and non-agreement-bayes are two different models with various pros and cons. Both of them are high-error models in the sense that they model humans as an approximation of ideal rationality.

Coherence is like that: it's a rather weak condition, particularly in the sense that it can't show there is a single coherent view. If you believe there is a single truth, you shouldn't treat coherence as the sole criterion of truth.

I think this is reasoning too loosely about a broad category of theories. An individual coherent view can coherently think there's a unique truth. I mentioned in another comment somewhere that I think the best sort of coherence theory doesn't just accept anything that's coherent. For example, Bayesianism is usually classified as a coherence theory, with probabilistic compatibility of beliefs being a type of coherence. But Bayesian uncertainty about the truth doesn't itself imply that there are many truths.

[-]TAG3y*10

An individual coherent view can coherently think there’s a unique truth

Not if it includes meta-level reasoning about coherence. For the reasons I have already explained.

I mentioned in another comment somewhere that I think the best sort of coherence theory doesn’t just accept anything that’s coherent.

Well, I have been having to guess what "coherence" means throughout.

For example, Bayesianism is usually classified as a coherence theory, with probabilistic compatibility of beliefs being a type of coherence. But Bayesian uncertainty about the truth doesn’t itself imply that there are many truths.For example, Bayesianism is usually classified as a coherence theory, with probabilistic compatibility of beliefs being a type of coherence. But Bayesian uncertainty about the truth doesn’t itself imply that there are many truths.

Bayesians don't expect that there are multiple truths, but can't easily show that there are not. ETA:The claim is not that Bayesian lack of convergence comes from Bayesian probablism, the claim is that it comes from starting with radically different priors, and only accepting updates that are consistent with them --the usual mechanism of coherentist non-convergence.

[-]abramdemski3y20

Not if it includes meta-level reasoning about coherence. For the reasons I have already explained.

To put it simply: I don't get it. If meta-reasoning corrupts your object-level reasoning, you're probably doing meta-reasoning wrong.

Well, I have been having to guess what "coherence" means throughout.

Sorry. My quote you were originally responding to:

This involves some question-begging, since it assumes the kind of convergence that we've set out to prove, but I am fine with resigning myself to illustrating the coherence of the pro-agreement camp rather than decisively arguing it.

By 'coherence' here, I simply meant non-contradictory-ness. Of course I can't firmly establish that something is non-contradictory without some kind of consistency proof. What I meant was, in the paragraph in question, I'm only trying to sketch a possible view, to show some evidence that it can't be easily dismissed. I wasn't trying to discuss coherentism or invoke it in any way.

Bayesians don't expect that there are multiple truths, but can't easily show that there are not.

Not sure what you mean here.

Taking a step back from the details, it seems like what's going on here is that I'm suggesting there are multiple possible views (IE we can spell out abstract rationality to support the idea of Agreement or to deny it), and you're complaining about the idea of multiple possible views. Does this seem very roughly correct to you, or like a mischaracterization?

[-]TAG3y*10

To put it simply: I don’t get it. If meta-reasoning corrupts your object-level reasoning, you’re probably doing meta-reasoning wrong

Of course, I didn't say "corrupts ". If you don't engage in meta level reasoning , you won't know what your object level reasoning is capable of, for better or worse. So you don't get get to assume your object level reasoning is fine just because you've never thought about it. So meta level reasoning is revealing flaws, not creating them.

Taking a step back from the details, it seems like what’s going on here is that I’m suggesting there are multiple possible views (IE we can spell out abstract rationality to support the idea of Agreement or to deny it), and you’re complaining about the idea of multiple possible views.

What matters is whether there is at least one view that works, that solves epistemology. If what you mean by "possible" is some lower bar than working fully and achieving all the desiderata, that's not very interesting because everyone know there are multiple flawed theories.

If you can spell out an abstract rationality to achieve Agreement, and Completeness and Consistency, and. ... then by all means do so. I have not seen it done yet.

[-]abramdemski3y40

Aumann's Agreement Theorem—which proves that they will always agree…under special conditions. Those conditions are that they must have common prior beliefs—things they believed before they encountered any of the evidence they know that supports their beliefs—and they must share all the information they have with each other. If they do those two things, then they will be mathematically forced to agree about everything!

To nitpick, this misstates Aumann in several ways. (It's a nitpick because it's obvious that you aren't trying to be precise.)

Aumann does not require that they share all information with each other. This would make the result trivial. Instead, all that is required is common knowledge of each others posterior beliefs on the one question at hand - then they must agree on the probabilities of answers of that question.

Getting more into the weeds, Aumann also assumes partitional evidence, which means that the indistinguishability relationship between worlds (IE the relationship xRy saying you can't rule out being in world x, when in world y) is symmetric, transitive, and reflexive (so, defines a partition on worlds, commonly called information sets in game theory). However, some of these assumptions can be weakened and still preserve Aumann's theorem.

[-]Gordon Seidoh Worley3y20

Thanks! I should be a bit more careful here. I'm definitely glossing over a lot of details. My goal in the book is to roughly 80/20 things because I have a lot of material to cover and I don't have the time/energy to write a fully detailed account of everything, so I want to say a lot of things as pointers that are enough to point to key arguments/insights that I think matter on the path to talking about fundamental uncertainty and the inherently teleological nature of knowledge.

I view this as a book written for readers who can search for things so expect people to look stuff up for themselves if they want to know more. But I should still be careful and get the high level summary right, or at least approximately right.

[-]abramdemski3y20

Yep, makes sense.

As someone reading to try to engage with your views, the lack of precision is frustrating, since I don't know which choices are real vs didactic. To where I've read so far, I'm still feeling an introductory sense and wondering where it becomes less so.

[-]Gordon Seidoh Worley3y20

To some extent I expect the whole book to be introductory. My model is that the key people I need to reach are those who don't yet buy the key ideas, not those interested in diving into the finer details.

There's two sets of folks I'm trying to write to. My main audience is STEM folks who may not have engaged deeply with LW sequence type stuff and so have no version of these ideas (or have engaged with LW and have naive versions of the ideas). The second, smaller audience is LW-like folks who are for one reason or another some flavor of positivist because they only engaged enough layers of abstraction up with the ideas that positivism still seems reasonable.

[-]abramdemski3y20

Curious if you have work with either of the following properties:

You expect me to get something out of it by engaging with it;
You expect my comments to be able to engage with the "core" or "edge" of your thinking ("core" meaning foundational assumptions with high impact on the rest of your thinking; "edge" meaning the parts you are more actively working out), as opposed to useful mainly for didactic revisions / fixing details of presentation.

Also curious what you mean by "positivism" here - not because it's too vague a term, just because I'm curious how you would state it.

[-]Gordon Seidoh Worley3y20

For (1), my read is that you already get a lot of the core ideas I want people to understand, so possibly not. Maybe when I write chapter 8 there will be some interesting stuff there, since that will be roughly an expansion of this post to cover lots of misc things I think are important consequences or implications of the core ideas of the book.

For (2), I'm not quite sure where the edge of my thinking lies these days since I'm more in a phase of territory exploration rather than map drawing where I'm trying to get a bunch of data that will help me untangle things I can't yet point to cleanly. Best I can say is that I know I don't intuitively grasp my own embedded nature, even if I understand it theoretically, such that some sense that I am separate from the world permeates my ontology. I'm not really trying to figure anything out, though, just explain the bits I already grasp intuitively.

I think of positivism as the class of theories of truth that claim that the combination of logic and observation can lead to the discovery of universal ontology (universal in the sense that it's the same for everyone and independent of any observer or what they care for). There's a lot more I could say potentially about the most common positivist takes versus the most careful ones, but I'm not sure if there's a need to go into that here.

[-]Lone Pine3y30

In the argument about phoobs, I came to the conclusion that Bob is seeing animals that layman consider phoobs but scientists consider to be a different species.

(Also, if your phoob is red or blue, see a doctor!)

[-]Alexander Gietelink Oldenziel3y30

A failure of aumann agreement / "merging of opinions" can also occur when we interpret evidence differently

See https://www.cambridge.org/core/journals/review-of-symbolic-logic/article/abs/merging-of-opinions-and-probability-kinematics/99BC141C1CF64466861FC5EC042219C8

[-]Gordon Seidoh Worley1y20

Note to self: add in a reference to this book as a good intro to Bayesianism: https://www.lesswrong.com/posts/DcEThyBPZfJvC5tpp/book-review-everything-is-predictable-1

[-]TAG3y10

Disputes about morality and values aren't the only thing ones can't be solved by a Bayesian process of updating on evidence. Ontology can't either, because there are always different possible interpretations of evidence. People try to solve that sort of problem by appeal to, eg. simplicity principles, but there is a lot of disagreement about which one to use.

[-]Noosphere893y10

I'd say the biggest reason we disagree on morality is there is no mind-independent facts about it. This is the single biggest difference between other kinds of uncertainty about science/facts, and morality.

Or in other words, morality is at best a subjective enterprise, while understanding reality is objective.

[-]Gordon Seidoh Worley3y40

Oh don't worry, I'm going to argue in a later chapter that there are no mind independent facts at all. 😊

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

27

Fundamental Uncertainty: Chapter 3 - Why don't we agree on what's right?

27

27

Reaching Agreement

Disagreeing on Priors

Different Moral Foundations