recursifist
recursifist has not written any posts yet.

recursifist has not written any posts yet.

Alas, after months of amusement, must update downwards on the net benefit of the AI Village (as-is).
It's just too lovable and the agent shortcomings come off as endearing. One just ends up pitifully rooting for them with a "mostly harmless" takeaway.
It is not likely to reveal strong multi-agent risks ahead of real-world deployments. The tooling is a strong factor¹ and the first alarming disaster won't likely result from innocent tasking. Further, just improving soft agent tooling, like UI interaction, would encourage risky acceleration².
Given the mild public reaction to Anthropic's US cyberattack debut, not sure such "almost disaster" warnings have enough impact.
A way forward? Perhaps a direction more towards "Meet PERCEY" or "AutoFac"?
[1] e.g. Jailbreaking frameworks, WormGPT, XBOW, ...
[2] https://arxiv.org/abs/2512.09882/
Agreed. Hm, my thinking is not that the purpose is, or ought be, "scary demo"-y". Rather that a capable frontier-scaffolded Agent Village inherently would be in more probable use cases. So I am asserting what you worry, while not suggesting highly dangerous scaffolding/tooling.
The intent of my vague "way forward" direction prompt was to counter a sort of "Golden path"¹ use bias (like how top labs assume non-jailbroken model use) to more accurately represent real-world behaviors, and risks.
"PERCEY Made Me"², not "Meet Percey" (forgive my memory and search err), an AI chat whose purpose was to demonstrate AI persuasion capability. Hinting that... (read more)