Skip to main content
insidersfeed
Back to all news

Open-weight models

DeepSeek V4: the benchmark is the distraction. The chip pivot isn't.

1.6 trillion parameters, open weights, cheaper than the competition — and the real news is none of that.

The InsidersFeed DeskVerified April 2026

The answer

DeepSeek V4-Pro (1.6T params, MIT licence) shipped 24 April 2026 — but the chip story is what matters.

Let's get the easy part out of the way. On 24 April 2026 DeepSeek released two open-weight models under an MIT licence: V4-Pro (1.6 trillion total parameters, 49 billion active) and V4-Flash (284 billion total, 13 billion active). Both support 1 million-token contexts. V4-Pro lists at $1.74/M input (it launched on a 75% promo at $0.435/M); V4-Flash at $0.14. Weights are on Hugging Face, self-hostable, commercial-use allowed. Artificial Analysis put V4-Pro at #2 on the open-weight reasoning index, behind Kimi K2.6. Impressive? Yes. 'Best model in the world'? No — the CFR pegs the U.S. as still roughly seven months ahead. Those are the receipts.

The benchmark row nobody is reading

Here is the comparison the leaderboard crowd is missing:

DeepSeek V4-Pro Kimi K2.6 DeepSeek V3.2
Total params 1.6T 1.1T 671B
Active params 49B ~32B ~37B
Context 1M tokens 1M tokens 128K tokens
API input price (list) $1.74/M ~$2.00/M $0.27/M
Inference vs prev. 27% FLOPs baseline
Chip ecosystem Huawei Ascend Nvidia Nvidia

(V4-Pro launched on a 75% promo at $0.435/M input.) The context jump from 128K to 1M tokens between V3.2 and V4-Pro is real and matters for enterprise use cases. The inference efficiency story — 27% of V3.2's FLOPs — is also real. Neither of these is the headline in most V4 coverage, and both will outlast the benchmark comparison by 18 months.

V4 is the first DeepSeek model optimised for Chinese domestic chips, particularly Huawei's Ascend 950. The company reportedly gave early access only to Chinese chipmakers — signalling movement toward reducing dependence on U.S. semiconductor technology.

Source: MIT Technology Review · 24 April 2026

Why the chip pivot is the actual story

Every previous DeepSeek model ran on Nvidia hardware. V4 does not. Per MIT Technology Review, DeepSeek reportedly gave early access only to Chinese chipmakers; Huawei announced same-day Ascend 950 support — not a coincidence. What this tells you is that DeepSeek is engineering its future model development around a chip ecosystem the U.S. cannot cut off via export controls. Whether Ascend 950 is as efficient as H100 for inference is a separate technical question that will be answered in the months that follow. The strategic intent is not ambiguous.

The adoption race is the one that matters

CFR fellow Michael Horowitz noted that 'second-best models carry enormous competitive value when they are cheap and open', reframing the AI rivalry around adoption scale rather than benchmark supremacy.

Source: Council on Foreign Relations · 29 April 2026

The CFR analysis — published five days after launch — is worth reading in full. Its core argument is that the right frame for V4 is not 'does it beat GPT-5?' but 'will it be the default model for the next generation of AI applications built outside the U.S. and Europe?' At $1.74/M input with self-hosting rights, the answer could easily be yes for a significant portion of global developers. That is the strategic consequence, and it is not resolved by whether V4-Pro ranks #1 or #2 on an intelligence index.

File V4 correctly: a genuine open-weight leap, not the state of the art, priced to win the global adoption race, and built for the first time on hardware the U.S. can't sanction. DeepSeek claims V4-Pro tops some coding tests outright — its own report lists a LiveCodeBench score of 93.5 versus Claude Opus 4.6's 88.8, though that's a self-reported, not-yet-independently-confirmed number — yet on the independently administered composite Intelligence Index V4-Pro still sits second to Kimi K2.6, which is exactly the point: this is an excellent open model, not a frontier-beater, and its leverage is cost and openness rather than a leaderboard crown. The benchmark column is interesting. The chip column is load-bearing.

Frequently asked questions

Is DeepSeek V4 better than GPT-5 or Claude?
Not on raw capability across the board. The CFR put the overall U.S. AI lead at roughly seven months. Artificial Analysis ranked V4-Pro #2 in the open-weight reasoning index, behind Kimi K2.6. DeepSeek's own report claims it leads some coding benchmarks (a self-reported LiveCodeBench 93.5 vs Claude Opus 4.6's 88.8, not yet independently confirmed), but it trails the current frontier on the independent composite index.
Why does the Huawei chip partnership matter?
V4 is the first DeepSeek model engineered for Chinese domestic silicon (Huawei Ascend 950). Per MIT Technology Review, DeepSeek reportedly gave early access only to Chinese chipmakers. This signals that future DeepSeek development will proceed on hardware the U.S. export controls cannot reach — a structural shift, not a marketing decision.
What are the distillation allegations against DeepSeek?
Per the CFR, Anthropic and OpenAI accused DeepSeek of using more than 16 million interactions via over 24,000 fabricated accounts to extract capabilities from U.S. models. DeepSeek has not publicly addressed the allegations. CFR fellow Jessica Brandt said V4's capabilities 'reflect, at least in part, access to illicitly obtained U.S. intellectual property.'
How cheap is DeepSeek V4 compared to US frontier models?
V4-Pro lists at $1.74/M input — far cheaper than leading U.S. closed-frontier equivalents — and launched on a 75% promo at $0.435/M. V4-Flash is $0.14/M input. Both can be self-hosted for free under the MIT licence.

Sources

← All news