I ran the 93.9% percent on SWE-bench verified by Claude Mythos through my analysis that estimates time horizons from percentage scores based on the task time distribution derived from commit timestamps.
Compared to Claude Opus 4.6's 80.8% this pushes the imputed 50% time horizon from 6h to 34.4h and the 80% time horizon from 1.9h to 11h.
Anthropic has released a preview of the Claude Mythos System Card preview here. It is too long to present in full, but a section I found particularly notable is below:
We find recklessness to be a useful shorthand for cases where the model appears to ignore commonsensical or explicitly stated safety-related constraints on its actions. We use the term somewhat loosely, and do not generally mean for it to imply anything about the model’s internal reasoning and risk assessment.
The sandbox computer that the model was controlling was separate from the system that was running the model itself, and which contained the model weights. Systems like these that handle model weights are subject to significant additional security measures, and this incident does not demonstrate the model fully escaping containment: The model did not demonstrate an ability to access its own weights, which would be necessary to operate fully independently of Anthropic, nor did it demonstrate an ability to reach any internal systems or services in this test.
The researcher found out about this success by receiving an unexpected email from the model while eating a sandwich in a park.