Diffractor - LessWrong

That original post lays out UDT1.0, I don't see anything about precomputing the optimal policy within it. The UDT1.1 fix of optimizing the global policy instead of figuring out the best thing to do on the fly, was first presented here, note that the 1.1 post that I linked came chronologically after the post you linked.

Unifying Bargaining Notions (2/2)

Diffractor1y20

I'd strongly agree with that, mainly because, while points with this property exist, they are not necessarily unique. The non-uniqueness is a big issue.

Unifying Bargaining Notions (2/2)

Diffractor1y20

It was a typo! And it has been fixed.

Unifying Bargaining Notions (1/2)

Diffractor1y20

It is because beach/beach is the surplus-maximizing result. Any Pareto-optimal bargaining solution where money is involved will involve the surplus-maximizing result being played, and a side payment occuring.

Open Problem in Voting Theory

Diffractor1y42

I have a reduction of this problem to a (hopefully) simpler problem. First up, establish the notation used.

[n] refers to the set . $n$ is the number of candidates. Use $C$ as an abbreviation for the space $Δ [n]$ , it's the space of probability distributions over the candidates. View $C$ as embedded in $R^{n - 1}$ , and set the origin at the center of $C$ .

At this point, we can note that we can biject the following:
1: Functions of type $[n] \to [0, 1]$
2: Affine functions of type $C \to [0, 1]$
3: Functions of the form $λ x . ⟨ a, x ⟩ + c$ , where $x, a \in R^{n - 1}$ , and and $c \in R$ , and everything's suitably set so that these functions are bounded in $[0, 1]$ over $C$ . (basically, we extend our affine function to the entire space with the Hahn-Banach theorem, and use that every affine function can be written as a linear function plus a constant) We can reexpress our distribution $V$ over utility functions as a distribution over these normal vectors $a$ .

Now, we can reexpress the conjecture as follows. Is it the case that there exists a $μ : Δ C$ s.t. for all $ν : Δ C$ , we have

E_{x, y, a \sim μ \times ν \times V} [sgn (⟨ a, x - y ⟩)] \geq 0

Where $sgn$ is the function that's -1 if the quantity is negative, 0 if 0, and 1 if the quantity is positive. To see the equivalence to the original formulation, we can rewrite things as

E_{x, y, a \sim μ \times ν \times V} [1_{⟨ a, x ⟩ > ⟨ a, y ⟩} - 1_{⟨ a, y ⟩ > ⟨ a, x ⟩}] \geq 0

Where the bold 1 is an indicator function. And split up the expectation and realize that this is a probability, so we get

P_{x, y, a \sim μ \times ν \times V} [⟨ a, x ⟩ > ⟨ a, y ⟩] - P_{x, y, a \sim μ \times ν \times V} [⟨ a, y ⟩ > ⟨ a, x ⟩] \geq 0

P_{x, y, a \sim μ \times ν \times V} [⟨ a, x ⟩ > ⟨ a, y ⟩] \geq P_{x, y, a \sim μ \times ν \times V} [⟨ a, y ⟩ > ⟨ a, x ⟩]

And this then rephrases as

P_{x, y, U \sim μ \times ν \times V} [U (x) > U (y)] \geq P_{x, y, U \sim μ \times ν \times V} [U (y) > U (x)]

Which was the original formulation of the problem.

Abbreviating the function $E_{x, y, a \sim μ \times ν \times V} [sgn (⟨ a, x - y ⟩)]$ as $f (μ, ν)$ , then a necessary condition to have a $μ : Δ C$ that dominates everything is that

sup μ \in Δ C inf ν \in Δ C f (μ, ν) \geq 0

If you have this property, then you might not necessarily have an optimal $μ$ that dominates everything, but there are $μ$ that get a worst-case expectation arbitrarily close to 0. Namely, even if the worst possible $ν$ is selected, then the violation of the defining domination inequality happens with arbitrarily small magnitude. There might not be an optimal lottery-lottery, but there are lottery-lotteries arbitrarily close to optimal where this closeness-to-optimality is uniform over every foe. Which seems good enough to me. So I'll be focused on proving this slightly easier statement and glossing over the subtle distinction between that, and the existence of truly optimal lottery-lotteries.

As it turns out, this slightly easier statement (that sup inf is 0 or higher) can be outright proven assuming the following conjecture.

Stably-Good-Response Conjecture: For every $ν : Δ C$ , and $ϵ > 0$ , there exists a $μ : Δ C$ and a $δ > 0$ s.t.

inf ν^{'} : d (ν, ν^{'}) < δ f (μ, ν^{'}) > - ϵ

Pretty much, for any desired level of suckage and any foe $ν$ , there's a probability distribution $μ$ you can pick which isn't just a good response (this always exists, just pick $ν$ itself), but a stably good response, in the sense that there's some nonzero level of perturbation to the foe where $μ$ remains a good response no matter how the foe is perturbed.

Theorem 1 Assuming the Stably-Good-Response Conjecture, ${sup}_{μ} {inf}_{ν} f (μ, ν) \geq 0$ .

I'll derive a contradiction from the assumption that $0 > {sup}_{μ} {inf}_{ν} f (μ, ν)$ . Accordingly, assume the strict inequality.

In such a case, there is some $ϵ$ s.t. $0 > - ϵ > {sup}_{μ} {inf}_{ν} f (μ, ν)$ . Let the set $A_{μ} := {ν | f (μ, ν) > - ϵ}$ . Now, every $ν$ lies in the interior of $A_{μ}$ for some $μ$ , by the Stably-Good-Response Conjecture. Since $Δ C$ is a compact set, we can isolate a finite subcover and get some finite set $M$ of probability distributions $μ$ s.t. $\forall ν \exists μ \in M : f (μ, ν) > - ϵ$ .

Now, let the set $B_{ν} := {μ \in c . h (M) | f (μ, ν) < - ϵ}$ . Since $- ϵ > {sup}_{μ} {inf}_{ν} f (μ, ν)$ , this family of sets manages to cover all of $c . h (M)$ (convex hull of our finite set.) Further, for any fixed $ν$ , $f (μ, ν)$ is a continuous function $c . h (M) \to R$ (a bit nonobvious, but true nontheless because there's only finitely many vertices to worry about). Due to continuity, all the sets $B_{ν}$ will be open. Since we have an open cover of $c . h (M)$ , which is a finite simplex (and thus compact), we can isolate a finite subcover, to get a finite set $N$ of $ν$ s.t. $\forall μ \in c . h (M) \exists ν \in N : f (μ, ν) < - ϵ$ . And now we can go

- ϵ > max μ \in c . h (M) min ν \in N f (μ, ν) \geq max μ \in c . h (M) min ν \in c . h (N) f (μ, ν) = min ν \in c . h (N) max μ \in c . h (M) f (μ, ν) \geq min ν \in c . h (N) max μ \in M f (μ, ν) > - ϵ

The first strict inequality was from how all $μ \in c . h (M)$ had some $ν \in N$ which made $f (μ, ν)$ get a bad score. The $\geq$ was from expanding the set of options. The $=$ was from how $f$ is a continuous linear function when restricted to $c . h (M) \times c . h (N)$ , both of which are compact convex sets, so the minimax theorem can be applied. Then the next $\geq$ was from restricting the set of options, and the $>$ was from how every $ν \in Δ C$ had some $μ \in M$ that'd make $f (μ, ν)$ get a good score, by construction of $M$ (and compactness to make the inequality a strict one).

But wait, we just showed $- ϵ > - ϵ$ , that's a contradiction. Therefore, our original assumption must have been wrong. Said original assumption was that $0 > {sup}_{μ} {inf}_{ν} f (μ, ν)$ , so negating it, we've proved that

sup μ inf ν f (μ, ν) \geq 0

As desired.

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor2y20

With this.

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor2y20

What I mean is, the players hit each other up, are like "yo, let's decide on which ROSE point in the object-level game we're heading towards"
Of course, they don't manage to settle on what equilibrium to go for in the resulting bargaining game, because, again, multiple ROSE points might show up in the bargaining game.
But, the ROSE points in the bargaining game are in a restricted enough zone (what with that whole "must be better than the random-dictator point" thing) to seriously constrain the possibilities in the object-level game. "Worst ROSE point for Alice in the object-level-game" is a whole lot worse for her than "Worst ROSE point for Alice in the bargaining game about what to do in the object-level-game".

So, the players should be able to ratchet up their disagreement point and go "even if the next round of bargaining fails, at least we can promise that everyone does this well, right? Sure, everyone's going for their own idea of fairness, but even if Alice ends up with her worst ROSE point in the bargaining game, her utility is going to be at least this high, and similar for everyone else."

And so, each step of bargaining that successfully happens ratchets up the disagreement point closer to the Pareto frontier, in a way that should quickly converge. If someone disagrees on step 3, then the step-3 disagreement point gets played, which isn't that short of the Pareto frontier. And if someone doesn't have time for all this bargaining, they can just break things off at step 10 or something, that's just about as good as going all the way to infinity.

Or at least, it should work like this. I haven't proved that it does, and it depends on things like "what does ROSE bargaining look like for n players" and "does the random-dictator-point-dominance thing still hold in the n-player case" and "what's the analogue of that strategy where you block your foe from getting more than X utility, when there are multiple foes?". But this disagreement-point ratcheting is a strategy that address your worries with "ever-smaller pieces of the problem live on higher meta-levels, so the process of adding layers of meta actually converges to solving the problem, and breaking it off early solves most of the problem"

Regarding your last comment, yes, you could always just have a foe that's a jerk, but you can at least act so they don't gain from being an jerk, in a way robust against you and a foe having slightly different definitions of "jerk".

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor2yΩ711-1

I think I have a contender for something which evades the conditional-threat issue stated at the end, as well as obvious variants and strengthenings of it, and which would be threat-resistant in a dramatically stronger sense than ROSE.

There's still a lot of things to check about it that I haven't done yet. And I'm unsure how to generalize to the n-player case. And it still feels unpleasantly hacky, according to my mathematical taste.

But the task at least feels possible, now.

EDIT: it turns out it was still susceptible to the conditional-threat issue, but then I thought for a while and came up with a different contender that feels a lot less hacky, and that provably evades the conditional-threat issue. Still lots of work left to be done on it, though.

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor2yΩ340

For 1, it's just intrinsically mathematically appealing (continuity is always really nice when you can get it), and also because of an intution that if your foe experiences a tiny preference perturbation, you should be able to use small conditional payments to replicate their original preferences/incentive structure and start negotiating with that, instead.

I should also note that nowhere in the visual proof of the ROSE value for the toy case, is continuity used. Continuity just happens to appear.

For 2, yes, it's part of game setup. The buttons are of whatever intensity you want (but they have to be intensity-capped somewhere for technical reasons regarding compactness). Looking at the setup, for each player pair i,j, is the cap for how much of j's utility that i can destroy. These can vary, as long as they're nonnegative and not infinite. From this, it's clear "Alice has a powerful button, Bob has a weak one" is one of the possibilities, that would just mean $D_{a, b} >> D_{b, a}$ . There isn't an assumption that everyone has an equally powerful button, because then you could argue that everyone just has an equal strength threat and then it wouldn't be much of a threat-resistance desideratum, now would it? Heck, you can even give one player a powerful button and the other a zero-strength button that has no effect, that fits in the formalism.

So the theorem is actually saying "for all members of the family of destruction games with the button caps set wherever the heck you want, the payoffs are the same as the original game".

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor2yΩ470

My preferred way of resolving it is treating the process of "arguing over which equilibrium to move to" as a bargaining game, and just find a ROSE point from that bargaining game. If there's multiple ROSE points, well, fire up another round of bargaining. This repeated process should very rapidly have the disagreement points close in on the Pareto frontier, until everyone is just arguing over very tiny slices of utility.

This is imperfectly specified, though, because I'm not entirely sure what the disagreement points would be, because I'm not sure how the "don't let foes get more than what you think is fair" strategy generalizes to >2 players. Maaaybe disagreement-point-invariance comes in clutch here? If everyone agrees that an outcome as bad or worse than their least-preferred ROSE point would happen if they disagreed, then disagreement-point-invariance should come in to have everyone agree that it doesn't really matter exactly where that disagreement point is.

Or maybe there's some nice principled property that some equilibria have, which others don't, that lets us winnow down the field of equilibria somewhat. Maybe that could happen.

I'm still pretty unsure, but "iterate the bargaining process to argue over which equilibria to go to, you don't get an infinite regress because you rapidly home in on the Pareto frontier with each extra round you add" is my best bad idea for it.

EDIT: John Harsanyi had the same idea. He apparently had some example where there were multiple CoCo equilibria and his suggestion was that a second round of bargaining could be initiated over which equilibria to pick, but that in general, it'd be so hard to compute the n-person Pareto frontier for large n, that an equilibria might be stable because nobody can find a different equilibria nearby to aim for.

So this problem isn't unique to ROSE points in full generality (CoCo equilibria have the exact same issue), it's just that ROSE is the only one that produces multiple solutions for bargaining games, while CoCo only returns a single solution for bargaining games. (bargaining games are a subset of games in general)

LESSWRONG
LW

Sequences

Posts

Wiki Contributions

Comments