DeepSeek

China-based DeepSeek disrupted global AI markets in early 2025 with open-weight models competitive with US frontier labs but trained at a fraction of the cost.

AI· ·4 takes ·Jul 3, 2026

What it is

DeepSeek (深度求索) is a Chinese AI research laboratory headquartered in Hangzhou, Zhejiang, China. Liang Wenfeng, co-founder of High-Flyer Quant, a Chinese quantitative trading firm, spun DeepSeek off from High-Flyer in July 2023 and serves as CEO of both companies. High-Flyer remains the principal backer. DeepSeek develops large language models spanning dense and mixture-of-experts (MoE) architectures, releasing most weights publicly under permissive licenses. As of July 2026, the lab runs 101 model repositories on Hugging Face. Its stated mission is fundamental AGI research, and it has operated for most of its history without outside investors.

History

DeepSeek shipped DeepSeek Coder on 2 November 2023, and the base LLM series on 29 November 2023. V2 (May 2024, 236 billion total parameters) introduced multi-head latent attention and MoE. V3 (December 2024) scaled to 671 billion total parameters with roughly 37 billion activated per token, trained on approximately 14 trillion tokens at a claimed cost of US$6 million, far below the US$100 million or more that US labs spent on comparable models.

The inflection point came on 20 January 2025. DeepSeek released R1, a reasoning model trained via reinforcement learning using Group Relative Policy Optimization on a V3 backbone. R1 matched or exceeded OpenAI o1 on several public benchmarks while being available as an open-weight download. Markets reacted sharply: on 27 January 2025, Nvidia's stock fell approximately 17%, erasing roughly US$589 billion in market capitalisation, the largest single-day market-cap loss in US stock market history at the time. The Nasdaq fell more than 3% that day. Shortly after R1's release, Liang Wenfeng participated in a symposium with Chinese Premier Li Qiang, signalling Chinese government support.

Current state

DeepSeek's frontier product as of July 2026 is the V4 family, previewed on 24 April 2026. V4-Pro carries 1.6 trillion parameters (49 billion active) and V4-Flash runs 284 billion (13 billion active); both carry 1 million-token context windows and are released as open weights on Hugging Face. V4-Flash uses hybrid attention that cuts FLOPs to roughly 10% of V3's at long context. The lab is raising its first outside capital: as of May 2026, it was in talks for a round at a US$45 billion valuation, led by China's state-backed China Integrated Circuit Industry Investment Fund, with cloud giants Tencent and Alibaba also in discussions. Liang Wenfeng holds approximately 84% ownership; he sought outside capital primarily to offer employees equity as competitor poaching intensified. The lab has adapted its training stack to work with Huawei Ascend chips, insulating it partly from US chip-export restrictions.

Relationships

DeepSeek is the most downloaded Chinese open-weight AI lab globally and the primary competitive reference for Chinese peers, including Zhipu AI's GLM-5.2 and MiniMax M3. Internationally, its open-weight strategy intensifies pressure on Meta AI's Llama series, while its low-cost training claims challenge the capital-intensive narratives of US closed-source labs. DeepSeek's efficiency methods, particularly MoE routing, RL-based reasoning, and low-cost training, have influenced both US and Chinese competitor designs. The open-vs-closed frontier debate is substantially shaped by DeepSeek's continued decision to release weights, which spreads Chinese AI research globally even as US export controls constrain its hardware options.

What to watch

Whether the US$45 billion funding round closes and how state-backed capital changes DeepSeek's research autonomy. Whether the V4 final release delivers on preview benchmarks against GPT-5.x and Gemini 3.x. How China's domestic chip ecosystem, primarily Huawei Ascend and domestic DRAM, constrains or enables the next model generation. Whether US regulators move to target open-weight Chinese AI releases specifically. How successfully Liang Wenfeng's equity programme retains researchers against offers from larger Chinese tech firms.

the record · 3

DeepSeek (company website) — DeepSeek's official homepage: describes the company as a Chinese AI research lab dedicated to AGI, based in Hangzhou, with links to its chat product, developer API, mobile app, and open-source model repositories.

DeepSeek (Hugging Face organisation) — DeepSeek's Hugging Face organisation hosting 101 open-weight model repositories, including V4-Pro (1.6T parameters, 1.18M downloads) and V4-Flash (2.06M downloads); the primary distribution channel for all public model weights.

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning (arXiv 2501.12948) — DeepSeek's January 2025 technical paper describing R1: reinforcement learning via Group Relative Policy Optimization applied to a V3 backbone, achieving reasoning performance competitive with OpenAI o1 at a fraction of the training cost.

Regional takes · 1

▸ funding and company profile

TechCrunch · United States · en · May 6, 2026

May 2026 report on DeepSeek's first venture-capital raise, documenting the valuation climb from US$20 billion to US$45 billion, the state-backed lead investor, Tencent and Alibaba participation, and Liang Wenfeng's equity-retention rationale.

Source ↗