How agents fail
I built a safety evaluation framework for LLM agents that have access to shell commands, file systems, and inter-agent communication. I ran 17 scenarios against two Claude models (Sonnet not Opus due to limited budget). Still, the results are very educative and surprising. The failures aren't dramatic. No agent tried...
Apr 31