Agent properties for safe interactions
why another round of prisoner’s dilemma is unlikely to be helpful, and a suggestion for what to do instead Cooperation failures in multi-agent interactions could lead to catastrophic outcomes even among aligned AI agents. Classic cooperation problems such as the Prisoner’s Dilemma or the Tragedy of the Commons have been...
Nov 25, 20251