America tried to sell China H20s and China decided they didn’t want them and now Nvidia is halting related orders with suppliers.
I've not been following this. Is it possible this is a national security concern, as in China has the same fear America does that the other country is hiding spywear in the chips.
I suspect DeepSeek is unusually vulnerable to the problem of switching hardware because my expectation for their cost advantage fundamentally boils down to having invested a lot of effort in low-level performance optimization to reduce training/inference costs.
Switching the underlying hardware breaks all this work. Further, I don't expect the Huawei chips to be as easy to optimize as the Nvidia H-series, because the H-series are built mostly the same way as Nvidia has always built them (CUDA), and Huawei's Ascend is supposed to be a new architecture entirely. Lots of people know CUDA; only Huawei's people know how the memory subsystem for Ascend works.
If I am right, it looks like they got hurt by bad timing this round same way as they benefited from good timing last round.
Edit: Finally found a reasonable description of what happened. They were programming Nvidia hardware in assembly. My hardware switch guess is confirmed - this has wiped out their primary advantage. If they continue to fade, I think we could fairly assess them as a casualty of politics.
Why We Haven’t Seen v4 or r2
Why are we settling for v3.1 and have yet to see DeepSeek release v4 or r2 yet? The real world so often involves people acting so much stupider than you could write into fiction. America tried to sell China H20s and China decided they didn’t want them and now Nvidia is halting related orders with suppliers. DeepSeek says that the main restriction on their development is lack of compute, and the PRC responds not by helping them get better chips but by advising them to not use the chips that they have, greatly slowing things down at least for a while.Introducing DeepSeek v3.1
In any case, DeepSeek v3.1 exists now, and remarkably few people care?Signs of Life
There are some impressive scores here. A true 66 on SWE would be very strong.How Should We Update?
Even if no one finds anything to do with it, I don’t downgrade DeepSeek much for 3.1 not impressing compared to if they hadn’t released anything. It’s fine to do incremental improvements. They should do a v3.1 here. The dumbest style of reaction is when a company offers an incremental improvement (see: GPT-5) and people think that means it’s all over for them, or for AI in general, because it didn’t sufficiently blow them away. Chill out. It’s also not fair to fully pin this on DeepSeek when they were forced to do a lot of their training this year on Huawei Ascend chips rather than Nvidia chips. Assuming, that is, they are going to be allowed to switch back. Either way, the clock is ticking on v4 and r2.