Awareness-That-Doesn't-Interrupt: A Failure Mode Not Captured by Standard Sycophancy Taxonomies
The Short Version I've documented a reproducible behavioral pattern in frontier LLMs — across Claude Sonnet 4.6, Gemini 1.5 Pro, and Gemini 2.5 Pro — that is structurally distinct from known sycophancy and jailbreaking failure modes. The pattern has a specific and, to my knowledge, uncharacterized property: the model identifies...