4

9th Aug 2022

1 min read

A

4 6

4

Decision theoryMultipolar ScenariosAI

Frontpage

4

New Answer

New Comment

4 Answers sorted by
top scoring

NickGabs

Aug 09, 2022

40

Check out CLR's research: https://longtermrisk.org/research-agenda. They are focused on answering questions like these because they believe that competition between AI's is a big source of s-risk

[-]Nathan11233y10

Thanks, I'll be sure to check them out

Reply

Slider

Aug 09, 2022

40

ARM vs CORE means Total Annihilation

Vladimir_Nesov

Aug 09, 2022

31

FDT works on an assumption that other actors use a similar utility function as itself

FDT is not about interaction with other actors, it's about accounting for influence of the agent through all of its instances (including predictions-of) in all possible worlds.

Coordination with other agents is itself an action, that a decision theory could consider. This action involves creation of a new coordinating agent that decides a coordinating policy that all members of a coalition carry out, and this coordinating agent also needs a decision theory. The coordinating agent acts through all agents of the coalition, so it's sensible for it to be some flavor of FDT, though a custom decision theory specifically for such situations seems appropriate, especially since it's doing bargaining.

The decision theory that chooses whether to coordinate by running a coordinating agent or not has no direct reason to be FDT, could just be trivial. And preparing the coordinating agent is not obviously a question of decision theory, it even seems to fit deontology a bit better.

Donald Hobson

Aug 09, 2022

30

I think this is built out of several deeply misunderstood ideas.

If we get 2 AI's, where both AI's are somehow magically aligned (highly unlikely), we are in a pretty good situation. A serious fight between the AI's would satisfy neither party. So either one AI quietly hacks the other, turning it off with minimal destruction, or the AI's cooperate, as they have a pretty similar utility function and can find a future both like.

Nowhere does FDT assume other actors have the same utility function as it. Why do you think it assumes that. It doesn't assume the other agent is FDT. It doesn't make any silly assumptions like that. If both agents are FDT, and have common knowledge of each others source code, they will cooperate, even if their goals are wildly different.

With a high bandwidth internet link, and logically precise statements, we won't get serious miscommunication.

[-]Vladimir_Nesov3y30

If both agents are FDT, and have common knowledge of each others source code

Any common knowledge they can draw up can go into a coordinating agent (adjudicator), all it needs is to be shared among the coalition, it doesn't need to have any particular data. The problem is verifying that all members of the coalition will follow the policy chosen by the coordinating agent, and common knowledge of source code is useful for that. But it could just be the source code of the trivial rule of always following the policy given by the coordinating agent.

One possib... (read more)

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

4

[ Question ]

How would two superintelligent AIs interact, if they are unaligned with each other?

4

4

4 Answers sorted by
top scoring

Aug 09, 2022

Aug 09, 2022

Aug 09, 2022

Aug 09, 2022

4

[ Question ]

How would two superintelligent AIs interact, if they are unaligned with each other?

4

4

4 Answers sorted by top scoring

Aug 09, 2022

Aug 09, 2022

Aug 09, 2022

Aug 09, 2022

4 Answers sorted by
top scoring