x

LESSWRONG

LW

C Tilli — LessWrong

C Tilli

C Tilli

Message

1

5mo

C Tilli

5mo

Agent properties for safe interactions

why another round of prisoner’s dilemma is unlikely to be helpful, and a suggestion for what to do instead Cooperation failures in multi-agent interactions could lead to catastrophic outcomes even among aligned AI agents. Classic cooperation problems such as the Prisoner’s Dilemma or the Tragedy of the Commons have been...

Nov 25, 2025•1