However, if you cooperate with probability p+0.001, then both people are trying to be a little more cooperative than the other. You'll cooperate 100% of the time with others following the same strategy, while sacrificing very little in other situations.

Really haven't thought much about this, but my brain wants to say that this strategy must clearly be exploitable somehow.

Reply

[-]abramdemski9y20

I won't try and argue that this strategy in particular is ideal. (FYI, this is the strategy called "nicerbot".) However, the general pattern I'm using it to point at, where you give just a little benefit of the doubt, is only slightly exploitable as a rule. This will often be worth it due to a number of cases where the strategy helps force good equilibria.

Reply

[-]habryka9y20

By the way, in one sense, the "True Prisoner's Dilemma" is impossible between agents of the sort I'm imagining. They see the game set-up and the payoff table, and immediately figure out the Nash bargaining solution (or something like it), and re-write their own utility function to care about the other player.

This seems strange to me. My intuitions about agent design say that you should practically never rewrite your own utility function. The thing that "re-write their own utility function" here points to seems to something more accurately described as "making an unbreakable commitment", which seems like it could be done via a separate mechanism than literally rewriting your utility function. Humans seem to do something in that space (i.e. we have desires and commitments, both of which feel quite different and separate from the inside).

Reply

[-]abramdemski9y20

I agree, that's a more accurate description. The sense in which "true prisoner's dilemma" is impossible is the sense in which your utility function is the cooperative one you commit to. It makes sense to think in terms of your "personal" (original) utility function and an "acting" utility function, or something like that.

I still think this undermines the point of the "true prisoner's dilemma", since thinking of humans gives decent intuitions about this sort of reasoning.

Reply

[-]habryka9y20

I very much agree with the broad gist of the post, but also have many specific points that I disagree with. This feels like a post for which inline-commenting or at a special content block that allows a comment-thread to start from that place would be extremely useful.

In the absence of that, I will write multiple replies to separate parts of the post, so that we can keep the discussion threads apart.

Reply

Moderation Log

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

3

Coalition Dynamics as Morality

3

3

Coalition Dynamics

Preference Utilitarianism

Deontology

Contractualism

Population Ethics

Conclusion