The Multi-Agent Minefield: Can LLMs Cooperate to Avoid Global Catastrophe?
ArXiv paper here. Most AI safety research asks a familiar question: Will a single model behave safely? But many of the risks we actually worry about – including arms races, coordination failures, and runaway competition – don’t involve one single AI model acting alone. They emerge when multiple advanced AI...
Feb 1714

