Open-weight models
DeepSeek V4: the benchmark is the distraction. The chip pivot isn't.
1.6 trillion parameters, open weights, cheaper than the competition — and the real news is none of that.
The answer
DeepSeek V4-Pro (1.6T params, MIT licence) shipped 24 April 2026 — but the chip story is what matters.
Let's get the easy part out of the way. On 24 April 2026 DeepSeek released two open-weight models under an MIT licence: V4-Pro (1.6 trillion total parameters, 49 billion active) and V4-Flash (284 billion total, 13 billion active). Both support 1 million-token contexts. V4-Pro lists at $1.74/M input (it launched on a 75% promo at $0.435/M); V4-Flash at $0.14. Weights are on Hugging Face, self-hostable, commercial-use allowed. Artificial Analysis put V4-Pro at #2 on the open-weight reasoning index, behind Kimi K2.6. Impressive? Yes. 'Best model in the world'? No — the CFR pegs the U.S. as still roughly seven months ahead. Those are the receipts.
The benchmark row nobody is reading
Here is the comparison the leaderboard crowd is missing:
| DeepSeek V4-Pro | Kimi K2.6 | DeepSeek V3.2 | |
|---|---|---|---|
| Total params | 1.6T | 1.1T | 671B |
| Active params | 49B | ~32B | ~37B |
| Context | 1M tokens | 1M tokens | 128K tokens |
| API input price (list) | $1.74/M | ~$2.00/M | $0.27/M |
| Inference vs prev. | 27% FLOPs | — | baseline |
| Chip ecosystem | Huawei Ascend | Nvidia | Nvidia |
(V4-Pro launched on a 75% promo at $0.435/M input.) The context jump from 128K to 1M tokens between V3.2 and V4-Pro is real and matters for enterprise use cases. The inference efficiency story — 27% of V3.2's FLOPs — is also real. Neither of these is the headline in most V4 coverage, and both will outlast the benchmark comparison by 18 months.
V4 is the first DeepSeek model optimised for Chinese domestic chips, particularly Huawei's Ascend 950. The company reportedly gave early access only to Chinese chipmakers — signalling movement toward reducing dependence on U.S. semiconductor technology.
Why the chip pivot is the actual story
Every previous DeepSeek model ran on Nvidia hardware. V4 does not. Per MIT Technology Review, DeepSeek reportedly gave early access only to Chinese chipmakers; Huawei announced same-day Ascend 950 support — not a coincidence. What this tells you is that DeepSeek is engineering its future model development around a chip ecosystem the U.S. cannot cut off via export controls. Whether Ascend 950 is as efficient as H100 for inference is a separate technical question that will be answered in the months that follow. The strategic intent is not ambiguous.
The adoption race is the one that matters
CFR fellow Michael Horowitz noted that 'second-best models carry enormous competitive value when they are cheap and open', reframing the AI rivalry around adoption scale rather than benchmark supremacy.
The CFR analysis — published five days after launch — is worth reading in full. Its core argument is that the right frame for V4 is not 'does it beat GPT-5?' but 'will it be the default model for the next generation of AI applications built outside the U.S. and Europe?' At $1.74/M input with self-hosting rights, the answer could easily be yes for a significant portion of global developers. That is the strategic consequence, and it is not resolved by whether V4-Pro ranks #1 or #2 on an intelligence index.
File V4 correctly: a genuine open-weight leap, not the state of the art, priced to win the global adoption race, and built for the first time on hardware the U.S. can't sanction. DeepSeek claims V4-Pro tops some coding tests outright — its own report lists a LiveCodeBench score of 93.5 versus Claude Opus 4.6's 88.8, though that's a self-reported, not-yet-independently-confirmed number — yet on the independently administered composite Intelligence Index V4-Pro still sits second to Kimi K2.6, which is exactly the point: this is an excellent open model, not a frontier-beater, and its leverage is cost and openness rather than a leaderboard crown. The benchmark column is interesting. The chip column is load-bearing.
Frequently asked questions
Is DeepSeek V4 better than GPT-5 or Claude?
Why does the Huawei chip partnership matter?
What are the distillation allegations against DeepSeek?
How cheap is DeepSeek V4 compared to US frontier models?
Sources
- DeepSeek V4 Preview Release — DeepSeek, 24 April 2026
- Three reasons why DeepSeek's new model matters — MIT Technology Review, 24 April 2026
- DeepSeek is back among the leading open-weights models with V4 Pro and V4 Flash — Artificial Analysis, 27 April 2026
- DeepSeek V4 signals a new phase in the US–China AI rivalry — Council on Foreign Relations, 29 April 2026
- DeepSeek previews new AI model that 'closes the gap' with frontier models — TechCrunch, 24 April 2026