In general, my sense of the US national security state is that it will often first ask nicely for the things it wants. They don't make open threats, because they want you to consent freely and enthusiastically. Threats would undermine that spirit of collaboration, and they would also potentially enable the person being threatened to brace for whatever is threatened, undermining the effectiveness of the threat.
If you decline, they will then prioritize going about getting what they want using an assortment of coercive means. Sometimes, you may find that were high enough on the list to be asked, but not high enough to warrant coercive means sufficient to get what they want at the present time.
Other times, some variation of this happens:
"I think you should come work for us, there's a lot we could accomplish together, you just need to do some things for me" "And if I say no?" "No pressure, I'll just call my boss and say you said no." "That's it?" "What happens after that isn't up to me, but they probably won't have me ask again."
Imagine how it goes when "we don't want someone who declines to help to know that we're interested in the substance we asked about" is added to the list of government priorities.
Reposting from the comments:
I was set to agree that there is "something missing," but then I saw this quote from Fortune:
"It is unclear if Amazon was testing Fable for vulnerabilities in response to a White House request or if the company conducted the tests completely of its own accord. Politico quoted an unnamed source familiar with Amazon’s discussions as saying the government asked Amazon for feedback on the new Anthropic model."
If true, it's rather different if the White House is soliciting the feedback, rather than a CEO coming forward independently. Whether as part of a legit, informal initiative to solicit feedback from stakeholders on the dangers of the model, with legitimate alarm from the Amazon CEO, or if it was a fishing expedition to find fuel for the White House's grudge, it would be hard to say.
For the timeline, I think there might be some "both sides are right" going on. The USG could have viewed the back-and-forth with in the hours prior as the "begging," and it only dropped the 90 minute deadline to try to force a decision. And Dario may well have been at a wellness retreat in the prior 24 hours. Meanwhile, Anthropic can perfectly well have viewed it as a surprise when the government called them at 1pm, gave what felt to them to be no details, and seemingly suddenly dropped the deadline. Likely it is a failure to communicate.
I also feel that saying "Anthropic said in a document that the safeguards were leaky, but the leaks are permissible, so it's OK" framing is a little off. The decision to release was rushed, and three business days is actually pretty fast for stakeholders to conclude "hey this actually seems bad to us, regardless of what Anthropic thinks."
"At least one source has now seen the research report, claiming it shows nothing." - this very well could be Moussouris herself, a literal Anthropic employee - not an impartial source.
"Moussouris said the researchers were able to find security vulnerabilities by asking questions normal defenders would ask AI, which is exactly what the model was intended to do." I'm confused? Isn't the point of Fable that you cannot ask even safe-looking cyber questions, because most such questions are dual use? Getting cyber capabilities is what Glasswing is supposed to be for. If someone is bypassing Fable's safeguards to ask it cyber questions, then the safeguards have failed, and you should be worried.
Re: the letter various executives signed - I'm worried that when they say "[the underlying model capabilities... c]an be replicated on GPT-5.5, Opus, Sonnet and even Chinese models like Kimi 2.7", they are making the error where you can point inferior models at a snippet and have it flag the issue, but that same model couldn't actually do the autonomous end-to-end codebase scanning, with low false positive rates, that Mythos can do.
On Friday evening the United States Government has forced Anthropic to take down all access to Fable and Mythos.
It’s been a rough weekend.
More details have come to light. There remains some fog of war, but we now have a rather good idea why Claude Fable and Mythos were, deeply stupidly, taken down.
A lot of nihilists are justifying this decision, and blaming Anthropic, all of whom are very much confirming that they adhere to Dean Ball’s portrait of the United States Government as a dying NPC hospice patient we have to properly placate with the proper vibes and genuflection so they don’t lash out at us. Except they equate this with strength and righteousness, because might makes right, power and vibes.
This is a fast developing story with a large speed premium, so I apologize for any errors, and for the structure likely not being ideal. We do the best we can.
What we do not know is:
The good outcome would be that this is a terrible misunderstanding, a reflection of a panic reaction, which can be sorted out quickly, after which we can restore access. Or where they otherwise face enough pressure they quickly realize they made a mistake, or Anthropic can do something to quickly assuage their concerns even if it is dumb. There will still be a terrible precedent set, which comes with a lot of permanent damage to trust in American AI, to our business climate, to our ability to employ vital foreign AI talent, to America’s relationships to its allies, to the progress of Project Glasswing and our cyber security, and to the rule of law.
The silver lining, which might be large, is that this will have shown that when we actually need to act, we are not afraid to act, even at great economic and political cost. Sometimes there will be a demand driven by national security, or other concerns, and if you cannot physically meet that demand without shutting down? Tough. This was (with notably extremely rare exceptions) an action far out of bounds of what safety advocates have dared propose as even an option, and it happened. So there’s no more saying, in such situations: ‘Give up, the government will never do [X].’
This also emphasizes the need to figure out how to act well, now, before we need to act. If we get into such a situation, and don’t have a good way to do [X], we might well do [X] in a no good, haphazard, deeply destructive way, instead. So get to work figuring out how to strike deals, or do a pause, or take down a given model, and so on.
The bad outcome is if this is not a terrible misunderstanding, is motivated by other factors, and cannot be sorted out quickly. The government might actually be rapidly escalating towards a forcible takeover of America’s leading AI labs by a would-be authoritarian unitary executive that thinks you should never talk back to it, and when it says jump (or asks for stock, or anything else) everyone should ask how high. Or else.
There is also the third possibility that, as unlikely as it looks now, the White House was correct, the threat was real, and this was an emergency situation, whether or not they did a good job justifying this to Dario and Anthropic in real time, and whether or not they are doing a good job justifying this now. Perhaps this was itself dangerous, or perhaps it implied too high risk of other dangers.
We cannot rule this out until we can verify technical claims. And we should not assume that next time, the company will be right and the government wrong. There likely will come a time when a company says ‘This Is Fine’ and is very, very wrong.
If that proves to be true, Anthropic will have lost a ton of credibility on all fronts, which is another reason I find this so unlikely. They cannot afford to be wrong, here.
Table of Contents
What Happened When: The Bottom Line
The government’s own account is that Anthropic’s ‘lack of seriousness’ around responding led to the government imposing export controls.
If we believe Axios and Politico, the ‘lack of seriousness’ was when Anthropic:
So it was basically ‘Anthropic wants to only do things because of reasons, and thus we concluded the vibes were off, so f*** them we’re blowing it all up to show who is boss.’
This is also the second time ‘we could not reach Dario this particular minute so we had to blow up all of American AI policy shortly after 5pm on a Friday’ has come up as an excuse. It was also used by Emil Michael.
This time, the claim is that he was at a ‘wellness retreat’ which Anthropic categorically denies, and which Ashlee Vance, who was there, categorically denies.
Anthropic says it made Dario available 75 minutes after he was requested, and that other senior Anthropic people were made available during that time. I believe them.
The White House waited far longer than 75 minutes, indeed they waited overnight, after they were contacted by Amazon, to start attempting to contact Dario.
Amazon Calls The White House
Details continue to come in on the events and the timeline. First Axios:
Amazon is confirmed as the central call, among others, that caused the White House to start taking actions that led to them taking down Fable.
As I discussed last time, Anthropic’s release announcement included clear warnings that jailbreaks on the level of what Amazon did were possible. I have no doubt they extensively briefed the administration on such details.
Does anyone remember this graph, from the Fable 5 release announcement?
I do not understand why Amazon’s CEO called the White House over this. There is a key piece of information there that we do not know.
The Government Panics
Anthropic was then given less than 24 hours from the initial call by Amazon, and no details of anything actually concerning happening, after which it was hit by a classic ‘Friday after 5pm’ order. For most of those less than 24 hours, the government had not yet attempted to contact Anthropic about this.
We have a source in the White House confirming, even if we fully buy their story, that they decided to risk blowing up all of American AI because they did not like the vibes they got in 90 minutes over a series of phone calls.
Dario tried to explain that this was a narrow issue, and they simply did not understand or believe him, or chose not to understand or believe him.
We now know that Dario was fully correct that the issue was narrow and harmless.
Where Dario was incorrect was in assuming those he was talking to were both capable of and interested in understanding what he was trying to say.
Do you think the White House was ‘begging for hours?’ Or do you think they’re just throwing words out, that at best are code for ‘we did not issue an official order yet?’
I see no reason not to believe Anthropic here. Dario tried to explain that this was a false positive and asked for details. The White House did not provide any details that supported their claims, or evidence that this was necessary or prudent. They simply said ‘remove Fable in 90 minutes,’ likely without making it clear this was ‘or else IFAR.’
What pissed them off, it seems, is in large part that Anthropic wanted reasons, rather than asking how high when told to jump.
That he failed to commit to asking how high, in general, no matter what.
As in, they told us we were wrong. That means they are not serious. How could they possibly understand the situation better than the Treasury Secretary?
The Stupider Version
And that’s the good version. The bad version, as Chubby explains here, and which Axios seems to make clear, is that the White House is simply being petty, and thinks Anthropic ‘screwed them’ by asking for reasons and by having employed people with opposing political views. That is very illustrative of how these people think.
The White House did not like that the Anthropic security expert was a ‘radical Democrat’ and the White House is interpreting that as ‘they screwed us’ and should now be considered bad actors.
The stupidity, on all sides, of such a thing, knows no bounds. This is not something that should matter, but it also would be a really stupid mistake by Anthropic. Look, yes, these folks are that obsessed with political perspectives, so when dealing with them directly you really do need to prioritize sending in people who won’t set off these folks, even though this is a fully apolitical situation where that concern makes no sense. So it’s kind of on everyone.
There Was No Wellness Retreat
I find it very unlikely that Dario was at a wellness retreat, but even if he was you do not blow up all of American AI policy if a CEO does not have his phone for four hours and left someone else in charge. What is this, 2029?
Make Your Threats Explicit
There has been a lot of miscommunication between the White House and Anthropic.
Reporting is that Dario Amodei was told ‘you are making a bad decision’ when he refused to voluntarily take down Fable, instead asking for more information.
The thing is, that could mean anything. It could mean ‘we will be mad at you’ or ‘this is now on you and if something goes wrong it is your fault.’
Could all of this have been avoided if instead of maintaining deniability, Dario had been told, explicitly, ‘Fable is going offline today. If you do not agree this minute to do this voluntarily we will hit you with an export control and you’ll be totally f***ed?’
I don’t know. But my guess is yes.
I did notice that the option had previously been put on the table, here’s Axios:
Was China Accessing Mythos?
There is the claim via the Financial Times, Verge and Semafor that the White House learned that ‘a China linked-group’ had accessed Mythos. My suspicion is this is a mixup with the earlier incident mentioned just above? Could go either way.
That is absolutely going to happen, at least from time to time, when there are over 100 partnerships given access to Mythos. Access controls can only go so far. The key is to contain what can be done before the compromised access is discovered and closed. The report said ‘had been accessed’ rather than ‘has continued access.’
It would be non-crazy, although likely an overreaction, to ask that this mean Mythos be temporarily limited to a core group within Glasswing, or potentially even shut down entirely for a time, but that would not impact Fable.
It could potentially also have contributed to the vibes issues, or there could have been confusion about the relationship between these two things.
Should Anthropic Still Have Taken Fable Offline When Asked?
Look. Yes. A mistake was made. I am not a ‘oh Anthropic did nothing wrong’ guy here.
Try to talk them out of it if you can, but when they aren’t budging, you do it, even though it is mind bogglingly stupid and expensive and might be kind of a hit job.
And I do think they bear some of the responsibility here, because of that, and also they could have in various ways ‘handled’ the White House better, including in terms of who they sent in.
You want to establish, on multiple levels, the precedent that you take the model down when told to while you sort things out, and you say you were doing it because the government raised a potential security concern, at least until such time as it is clear they are not going to get over it any time soon.
In hindsight this is even more obvious. But what is done is done.
And I get it. The request was unjustified and nonsensical and rushed and the people had no idea what they’re talking about, and they really are just ordering you to do their bidding like this isn’t a Republic and that is not okay, and no one would be so stupid as to… yeah. Yeah.
There are also various signs that the Fable launch may have been somewhat rushed.
If you tell me the head of CAISI consulted his team and thinks Fable access needs to be restricted and the threat is serious, I would be a lot more likely to believe it. If you tell me it was Commerce acting alone, they don’t know what they are doing.
I also totally can believe that the Fable release was rushed. Evidence includes it happening on a Tuesday, Anthropic not realizing people would object to the output downgrading and the general state of the classifiers, and it just happening fast.
That does not in any way make the export control order okay.
Yes, This Was A Takedown Order For Fable
Mainstream media has this very strange way of saying that the obviously true thing might actually be true, ya’al, but strictly speaking we can’t prove it and we are a Serious News Organization.
Yes. This was a way to target Anthropic to get them to take the model down for everyone, and they did not much care about the blast radius of the method. They knew full well that this was a de facto full takedown notice. That is true even if you are maximally charitable to the government’s case.
It is in theory possible they were so clueless they did not realize this would be the result, but that’s worse, you know why that’s worse, right?
We Are Not Saying The DoW Fight Is Related And Yet
Well, technically the NSA operates out of Fort Meade, Maryland, so that does not count here in terms of evaluating Hegesth’s claims, although there is the whole ‘they used Claude extensively to fight an undeclared war against Iran.’
The Nihilists
I posted an early version of the bottom line snippet from earlier on Twitter and a remarkable number of people replied with a version of ‘how dare Anthropic not make their CEO available 24/7 on a moment’s notice and do whatever the government asks them to do without question while sending the correct vibes, they didn’t do that so this serves them right.’
Do these people think we live in a republic? Would they like to? I wonder.
Do these people think that if a company doesn’t perform the Shibboleths of knee bending properly then we should wreck American AI, all of our productivity, our global position and the rule of law over nothing, cause Anthropic deserves to suffer, and that is Anthropic’s fault because America’s government is an NPC with an anger management problem and you know how he gets when you talk back to him?
I think they kind of do. That is exactly the vibe I am getting.
Think about what you are saying.
That’s on top of the people saying ‘Anthropic said government should regulate AI so this serves them right’ or ‘Anthropic said that frontier models are dangerous so this serves them right.’ Similar vibes.
Whereas for those of us who are not nihilists, who do not believe in might makes right, it is hard to see the reasonable version of this from the USGov side.
The correct criticism of Anthropic is ‘they should have still taken the model down when ordered, no matter how stupid they thought that was, while discussions continued.’ That’s valid.
The things people are almost entirely actually saying? It’s a bunch of nihilism, of pure worshipping of power and tribalism, and hurting everyone as long as it hurts those you dislike more, of lashing out because of and with vibes.
Mostly Harmless
At least one source has now seen the research report, claiming it shows nothing.
We don’t have any person on the other side claiming that the report shows something, or explaining what that something might be.
Everyone Means Everyone
It looks like when Anthropic took Mythos down, they really did fully take it down.
It appears that, because this was done in such a stupid fashion, Project Glasswing is cut off from Mythos. The clock is ticking before others get similar capabilities. I wonder what the spy agencies and major corporations think about this.
The ‘good’ news is that Mythos has presumably already found a lot more vulnerabilities that remain unpatched, and which Claude Opus 4.8 and GPT-5.5 are strong enough to help patch once they are found, so defensive work should continue.
Cyber leaders, according to Axios, are being clear that this move net harms our cyber security, because given who has access in what ways it helps defenders more than attackers. There is now an open letter to this effect, urging the government to restore access to Fable.
From the letter, calling this a pure unforced error that does nothing but net damage:
This Could Be The Good Scenario And Mostly A Misunderstanding
The action was definitely ‘vibe governing.’ The decision was some combination of ‘this seems vaguely spooky’ and ‘f*** Anthropic,’ not ‘we have a policy and a threshold.’
It could still be well intentioned.
My presumption is that what happened was pretty straightforward. Someone said ‘hey there is a jailbreak fix it’ and Dario said ‘this is harmless there is nothing to fix.’ The question remains, was the request ‘fix this particular jailbreak’ or ‘fix all jailbreaks pls tks?’
The above is the charitable interpretation. There is also the uncharitable one.
Or simply:
Kodus to Martin Casado for speaking out against what happened, and maybe even being convinced by argument to move from his initial position of ‘it’s a cyber weapon so any jailbreak is unacceptable’ even if he did end up blocking that guy. I will fully allow some amount of ‘Anthropic’s rhetoric did not make this easier’ if it is coupled with ‘and bad decisions are still bad’ rather than ‘so f*** Anthropic even if we all lose.’
The Next Step
Anthropic is flying various senior technical staff to Washington, who are spending today trying to sort this all out, which is absolutely what you do in this situation.
We will soon learn how that goes. Many next steps are possible.
The Worst Licensing Regime Is Fully Ad-Hoc
We now have more or less the worst possible licensing regime. It is fully ad-hoc, vibes based, and based on the whims of people who do not understand how AI works, and who we have no reason to assume are acting in good faith.
If you avoid all formal regulations and laws, and this results in regulation and law via executive fiat, that’s worse. You know that that’s worse right?
Neil Chilson and Adam Thierer, who actually have principles, do indeed realize that is worse, and say so.
Well said. What the White House is doing is terrible no matter your view on future AI capabilities, or AI risks, or the need to ‘win the AI race.’ It is bad all around, except that it centralizes power within the White House, via the threat of ad hoc shutdowns.
Remember when Trump was worried that a proposed Executive Order would harm American AI too much, so he did not sign it? This is so, so much worse for that, while also having many other problems.
Mark Dalton at R Street has a similar analysis, calling this The Fable Fiasco: A Bad Idea Applied Badly, pointing out that ITAR and KYC are not equipped for this task, and pointing out we will face real foreign-policy consequences for doing it this way.
We Are Showing We Are Unreliable Partners
As mentioned last time, but it bears repeating: One of those other problems is that this gives a huge kick to those who previously thought they were our allies and would be under our AI umbrella.
Do not forget that the European Union still has ASML.
Not only will such folks not trust the ‘American AI stack’ under these conditions, they will try to build a rival one. This risks driving them into the hands of China, and towards having their own chips and their own data centers under their own control, and towards use of non-American open models even though they are much worse.
The events here also do not bode well for American open models. If America is willing to put export controls on not only model weights but the model outputs, even when they are as heavily safeguarded as Fable, do you think they are not coming for your open models, with no classifiers that are not easily removed, with no safety training that cannot be undone by anyone who knows about obliteratus, that cannot be shut down once released?
Think harder about the implications of what is happening.
Our policy responses are going off the rails remarkably fast.
So I will close with a reminder of how badly we need rule of law, here, and how bad the alternative is already proving to be only weeks into the new regime.