Emerging misalignment in multi-agent governance simulations
> “our wish if we knew more, thought faster, were more the people we wished we were, had grown up farther together; where the extrapolation converges rather than diverges, where our wishes cohere rather than interfere; interpreted as we wish that interpreted.” > — Eliezer Yudkowsky, “Coherent Extrapolated Volition” (2004)...
Oct 28, 20251