The Geometric Importance of Side Payments

I don't actually think 'Alice gets half the money' is the fair allocation in your example.

Imagine Alice and Bob splitting a pile of 100 tokens, which either of them can exchange for $10M each. It seems obvious that the fair split here involves each of them ending up with $500M.

To say that the fair split in your example is for each player to end up with $500M is to place literally zero value on 'token-exchange rate', which seems unlikely to be the right resolution.

[-]StrivingForLegibility1y10

This might be a framing thing!

The background details I’d been imagining are that Alive and Bob were in essentially identical situations before their interaction, and it was just luck that Alice and Bob got the capabilities they did.

Alice and Bob have two ways to convert tokens into money, and I’d claim that any rational joint strategy involves only using Bob’s way. Alice's ability to convert tokens into pennies is a red herring that any rational group should ignore.

At that point, it's just a bargaining game over how to split the $1,000,000,000. And I claim that game is symmetric, since they’re both equally necessary for that surplus to come into existence.

If Bob had instead paid huge costs to create the ability to turn tokens into tens of millions of dollars, I totally think his costs should be repaid before splitting the remaining surplus fairly.

[-]Dagon1y20

I'm generally a fan of "maximize economic surplus and then split the benefits fairly".

I kind of agree, from an outside controller's perspective. Unfortunately, in the real universe, there is no outside controller, and there is no authority to make agents agree on either "maximum total surplus" or "fair". In the embedded agency model (agents are independent and imperfect, and there is no outside view), the best you can do is for each agent to maximize their own utility, by sharing information that usually improves overall surplus. Limiting it to economic/comparable values is convenient, but also very inaccurate for all known agents - utility is private and incomparable.

That said, side-payments are CRITICAL in finding solutions that do increase the total. It turns a lot of zero-sum games into positive-sum games where cooperation is the obvious equilibrium.

Note that your example shares some aspects of the ultimatum game - a purely rational Alice should not expect/demand more than $1.00, which is the maximum she could get from the best possible (for her) split without side payments. Only in a world where she knows and cares about Bob's situation and the cultural feelings of "fairness" indicate she "should" enjoy some of his situational benefits does she demand more.

[-]StrivingForLegibility1y*10

Limiting it to economic/comparable values is convenient, but also very inaccurate for all known agents - utility is private and incomparable.

I think modeling utility functions as private information makes a lot of sense! One of the claims I’m making in this post is that utility valuations can be elicited and therefore compared.

My go-to example of an honest mechanism is a second-price auction, which we know we can implement from within the universe. The bids serve as a credible signal of valuation, and if everyone follows their incentives they’ll bid honestly. The person that values the item the most is declared the winner, and economic surplus is maximized.

(Assuming some background facts, which aren't always true in practice, like everyone having enough money to express their preferences through bids. I used tokens in this example so that “willingness to pay” and “ability to pay” can always line up.)

We use the same technique when we talk about the gains from trade, which I think the Ultimatum game is intended to model. If a merchant values a shirt at $5, and I value it at $15, then there's $10 of surplus to be split if we can agree on a price in that range.

Bob values the tokens more than Alice does. We can tell because he can buy them from her at a price she's willing to accept. Side payments let us interpersonally compare valuations.

As I understand it, economic surplus isn't a subjective quantity. It's a measure of how much people would be willing to pay to go from the status quo to some better outcome. Which might start out as private information in people's heads, but there is an objective answer and we can elicit the information needed to compute and maximize it.

a purely rational Alice should not expect/demand more than $1.00, which is the maximum she could get from the best possible (for her) split without side payments.

I don't know of any results that suggest this should be true! My understanding of the classic analysis of the Ultimatum game is that if Bob makes a take-it-or-leave-it offer to Alice, where she would receive any tiny amount of money like $0.01, she should take it because $0.01 is better than $0.

My current take is that CDT-style thinking has crippled huge parts of economics and decision theory. The agreement of both parties is needed for this $1,000,000,000 of surplus to exist, if either walk away they both get nothing. The Ultimatum game is symmetric and the gains should be split symmetrically.

If we actually found ourselves in this situation, would we actually accept $1 out of $1 billion? Is that how we’d program a computer to handle this situation on our behalf? Is that the sort of reputation we’d want to be known for?

Using Weights as Side Payments

Money is a pretty reasonable type of side-payment, but we could also let agents transfer weights in the joint utility function among themselves. This is the approach Andrew Critch explores in his excellent paper on Negotiable Reinforcement Learning, in which a single RL agent is asked to balance between the interests of multiple principals with different beliefs. The overall agent is an

H (_, ϕ)

maximizer, where the Harsanyi weights shift according to Bayes rule, giving better predictors more weight in future decisions. The principals essentially bet about the next observation the RL agent will make, where the stakes are denominated in

ϕ

One direction Andrew points towards for future work is using some kind of bargaining among sub-agents to determine what the overall agent does. One way to model this is by swapping out

H

maximization for

G

maximization, defining each agent's baseline if no trade takes place, and enriching

F

to include side payments.

^{^}

This can also be framed as picking a point on the Pareto frontier, and then letting agents pay each other for small shifts from there. Bargaining over $D \times P$ combines these into a single step.

^{^}

How do I know utilities can be compared? Exactly because when Bob offers Alice $$ 5, 000, 000$ for one of her tokens, she says "yep that sounds good to me!" Money is the unit of caring.

LESSWRONG
LW

LESSWRONG
LW

8

The Geometric Importance of Side Payments

8

8

Using Weights as Side Payments