I work at a threat intelligence company and my default view is that Mythos is likely under-hyped for SWE and overhyped for cyber.
People VASTLY underestimate how easy it is to break into the vast majority of organizations. There is no need to design custom 0-Days because you can simply log-in (or run a session replay attack) using credentials/sessions freely available on the dark web. I would put forward that if Mythos was released tomorrow with standard API guardrails you would not see an explosion in cyberattacks.
However, the one exception is that organi...
There can be too very distinctive and conceptually important beliefs here that seem to be a crux.
Fro...
Where I have doubts about FOOM/RSI is that LLMs seem to me in many ways a fundamentally different type of intelligence than organic life.
Psychometrics shows that general intelligence improves human abilities across a broad range of domains. If you take this view and apply it to AI it doesn’t quite work, I leverage AI very very heavily at work, and sometimes it is phenomenal, often it is not, and occasionally it makes mistakes a grade schooler would not (I’m using Opus4.6). The ”intelligence” is very unevenly distributed and skewed towards verifiable domai...
I think you’re missing what he’s saying here.
Pre training was easy to scale in 22, 23 and 24. There was excess capacity. Mythos is likely the first >10b pertained model. The Claude4-4.6 paradigm was likely driven by one pre trained model with RLVF on top. Mythos is the new class of pre trained model and scaling and doubling times will be based on the speed of building RL models on top of Mythos.