Tool-Entropy Collapse: A Cross-Architecture Signature of Agent WANDERING Failure
A practical safety-monitor for production LLM agents, derived from 99 Qwen3.6-27B SWE-bench Pro trajectories and validated across Llama-70b + GPT-5. TL;DR * Probe-based monitoring of LLM agents has a quantifiable 34% blind spot: the WANDERING sub-class where the probe says "success" but the agent never emits the completion action and...