China AI · 2026-03-25

🇨🇳 China AI Watcher — March 25, 2026

💰 China's Daily AI Token Usage Reaches 140 Trillion as Government Standardizes "Ciyuan" Currency Term
🔬 Alibaba Unveils XuanTie C950 RISC-V Processor Claiming World's Highest RISC-V Performance
🇺🇸 US Commission Warns Chinese Open-Source Models Run 80% of American AI Startups Despite Export Controls
🤖 Cursor AI Admits $50 Billion Valuation Built on Chinese Moonshot's Open-Source Kimi K2.5 Model
📈 DeepSeek Posts 17 Agentic AI Job Openings Signaling Strategic Pivot Toward Autonomous Systems
📱 Xiaomi Reveals MiMo-V2-Pro as Mystery "Hunter Alpha" Model Approaching Claude Opus Performance

---

💰 China's Daily AI Token Usage Reaches 140 Trillion as Government Standardizes "Ciyuan" Currency Term

China's daily AI token consumption exceeded 140 trillion as of March 2026, representing a 1,400-fold increase from 100 billion tokens in early 2024, Liu Liehong, head of China's National Data Administration, announced Monday at the China Development Forum 2026. The surge signals China's AI infrastructure reaching commercial scale: Liu described tokens as "not only the value anchor of the intelligent era, but also the settlement unit linking technological supply with commercial demand", establishing quantifiable pricing frameworks where Western markets still negotiate opaque API contracts.

The government officially designated "ciyuan" (词元) as the standard Chinese translation for "token" during Liu's speech, explicitly linking AI compute units to currency semantics. In Chinese, "ci" means "word" while "yuan" doubles as both "currency" and the base unit of renminbi. The naming choice frames tokens as tradeable compute-currency rather than technical jargon, aligning with Jensen Huang's February comments exploring token-based compensation models. China's electricity production capacity—world's largest—positions it to become a dominant "token producer" if compute-as-currency models materialize globally.

The token explosion reflects infrastructure maturation rather than just usage spikes. China built over 100,000 high-quality training datasets by end of 2025, coordinated across 26 government departments through 104 pilot projects. Liu's "wherever AI develops, high-quality data sets will follow" strategy creates compounding advantages: more data enables better models, which generate more token usage, which funds additional data infrastructure. Some Chinese AI firms generated more revenue in 20 days of March 2026 than all of 2025, demonstrating monetization velocity unmatched in Western markets where companies debate pricing strategies.

The strategic implications extend beyond infrastructure metrics. China now processes 140 trillion tokens daily while US companies debate whether token pricing creates sustainable business models. If tokens crystallize as the compute currency standard, China's production capacity advantage (electricity, data infrastructure, model optimization) translates directly into economic leverage. The ciyuan designation isn't linguistics—it's positioning for a global AI economy where compute replaces traditional currencies as the medium of exchange. Western firms optimized for API margins; Chinese infrastructure optimized for token throughput at scale.

Sources: China Daily | SCMP | Let's Data Science

---

🔬 Alibaba Unveils XuanTie C950 RISC-V Processor Claiming World's Highest RISC-V Performance

!Semiconductor chip processor close-up

Alibaba revealed its XuanTie C950 5-nanometer processor on March 24 at an internal DAMO Academy conference, positioning it as "the highest performing RISC-V CPU in the world" for agentic AI workloads. The 3.2 GHz server chip, built on open-source RISC-V architecture, delivers performance three times faster than its predecessor, the XuanTie C920, specifically optimized for autonomous agent inference rather than training-heavy LLM operations. Alibaba did not disclose which fab manufactured the chip, maintaining strategic ambiguity around domestic versus TSMC production.

The C950 targets a distinct competitive axis from NVIDIA's H100/H200 training chips. RISC-V's open-standard architecture allows customized instruction sets for specific AI workloads with minimal licensing fees, critical for agent systems requiring real-time decision loops rather than massive matrix multiplications. This architectural choice reflects Alibaba's strategic bet: as AI shifts from training-centric (where NVIDIA dominates) to inference-centric deployment (where specialized chips win on power efficiency and cost), open architectures capture market share by enabling rapid customization without vendor lock-in.

Alibaba's timing aligns with its March 17 Wukong enterprise agent platform launch and follows the firm's March reorganization consolidating AI operations under the Alibaba Token Hub. The C950 completes Alibaba's vertical stack: Qwen models (open-source), Wukong/Accio platforms (agent orchestration), and now custom silicon (inference optimization). This integration enables closed-loop feedback: agent deployment data informs chip design priorities, which optimize next-generation Qwen models, which expand Wukong capabilities. Western equivalents maintain separate layers—OpenAI (models), AWS (infrastructure), enterprise software vendors (applications)—lacking system-level optimization velocity.

The C950 also carries geopolitical weight. Open-source RISC-V circumvents x86/ARM licensing dependencies, reducing exposure to potential US semiconductor sanctions. If US export controls tighten further, Alibaba's RISC-V pipeline provides fallback capacity independent of Western instruction set architectures. The three-fold performance improvement over C920 suggests Alibaba's T-Head semiconductor unit achieved significant architectural advances despite limited access to cutting-edge EUV lithography. Whether this performance translates to production deployment at scale remains the critical test—announcing benchmarks differs from shipping millions of units into live systems.

Sources: Reuters | Nikkei Asia | Simply Wall St

---

🇺🇸 US Commission Warns Chinese Open-Source Models Run 80% of American AI Startups Despite Export Controls

The US-China Economic and Security Review Commission published a report Monday concluding that Chinese open-source AI models now run in approximately 80% of US AI startups, with "Chinese open-source models dominating Hugging Face downloads" and narrowing performance gaps with frontier Western models despite US chip export restrictions. The commission framed China's strategy as "mutually reinforcing feedback loops—one digital, one physical" that compound competitive advantages: open models generate deployment data, which improves manufacturing processes, which produce better robotics/embodied AI, which generates more real-world training data, cycling back to model improvement.

The report identified structural dynamics US export controls fail to address. "This open ecosystem enables China to innovate close to the frontier despite significant compute constraints," the commission wrote. Chinese labs circumvent training bottlenecks by optimizing inference efficiency and distributing models freely, accelerating adoption velocity. American startups choose Chinese models not from geopolitical preference but economic necessity: equivalent performance at dramatically lower cost. OpenRouter rankings currently show 7 of 10 most popular models are Chinese, with companies from Airbnb to Siemens publicly adopting DeepSeek, Qwen, and Kimi despite Western regulatory concerns.

The commission highlighted embodied AI as a critical vulnerability. China designated embodied AI as a core future strategic industry, with multiple humanoid robotics firms planning 2026 IPOs. Physical deployment generates proprietary training data—robot interactions, manufacturing sensor feeds, autonomous vehicle telemetry—that open-source model developers cannot replicate from internet scraping alone. If Chinese firms dominate robotics deployment, they accumulate irreplaceable data advantages regardless of US chip restrictions on training clusters. The digital-physical loop becomes self-reinforcing: better models enable better robots, which generate better data, which train better models.

The report arrives amid contradictory US policy signals. Export controls theoretically restrict advanced chips, yet 80% of US startups use Chinese models anyway—indicating either smuggling effectiveness, computational efficiency breakthroughs, or both. The commission's findings suggest US strategy optimized for preventing training breakthroughs while Chinese strategy optimized for inference scale and deployment velocity. Over 18-24 month horizons, deployment data compounds faster than training improvements, particularly as models approach performance ceilings. China's open-source bet trades immediate monetization (proprietary licensing) for long-term data accumulation—a tradeoff that pays dividends if AI competition resolves on deployment scale rather than frontier capability.

Sources: Reuters | SCMP | Computerworld | Awesome Agents

---

🤖 Cursor AI Admits $50 Billion Valuation Built on Chinese Moonshot's Open-Source Kimi K2.5 Model

Cursor AI acknowledged on March 21 that its Composer 2 model—marketed as "frontier-level" with "state-of-the-art programming intelligence"—was built directly on Chinese startup Moonshot AI's open-source Kimi K2.5 model, with additional reinforcement learning applied. The admission followed detective work by X user "Fynn," who identified Kimi-specific code signatures in Composer 2's outputs. Cursor raised $2.3 billion in November 2025 at a $29.3 billion valuation from Andreessen Horowitz, Google, and NVIDIA, and is currently negotiating a new funding round at $50 billion valuation—nearly five times Moonshot's $10 billion target valuation reported in February.

The disclosure exposes structural ironies in AI competition. Cursor's value proposition centered on proprietary model superiority, yet its flagship product depends on Chinese open-source foundations. Moonshot's Kimi K2.5 provides the base architecture; Cursor adds task-specific fine-tuning and wraps it in developer-friendly tooling. This pattern mirrors broader Silicon Valley trends: OpenRouter rankings show Chinese models dominating usage despite US firms claiming technical leadership. The valuation gap—Cursor worth $50 billion, Moonshot $10 billion—reflects investor preference for proprietary packaging over open-source innovation, even when the latter generates foundational value.

Reuters Breakingviews framed the incident within China's broader open-source dilemma: "Alibaba, Tencent and rivals are under increasing pressure to show they can monetize their AI models and applications" while maintaining open-source commitments that fuel global adoption. Moonshot's situation epitomizes this tension—Kimi K2.5 powers high-value Western applications (Cursor Composer 2), yet Moonshot captures minimal direct revenue. If Chinese labs pivot toward proprietary models to satisfy investor demands for profits, they sacrifice the network effects currently challenging US dominance. Conversely, continued open-source releases enable competitors to extract value without compensation.

The licensing implications remain murky. Moonshot released Kimi K2.5 under open-source terms permitting commercial use, meaning Cursor operated within legal bounds. Yet the optics damage Cursor's proprietary positioning: customers paying premium prices for what amounts to Chinese open-source models plus incremental tuning. The incident may accelerate Chinese firms' reconsideration of open-source strategies. If Western companies build $50 billion businesses atop $10 billion Chinese foundations without revenue sharing, the incentive structure favors proprietary pivots. The next generation of Chinese models—DeepSeek V4, GLM-5, Qwen 4—may arrive behind closed APIs, reversing the open-source trend that fueled their initial adoption.

Sources: TechCrunch | Reuters Breakingviews | Business Insider | Dataconomy

---

📈 DeepSeek Posts 17 Agentic AI Job Openings Signaling Strategic Pivot Toward Autonomous Systems

DeepSeek listed 17 agentic AI job openings on March 24, including specialized roles for Agent Deep Learning Algorithm Researcher, Agent Data Evaluation Expert, and Agent Infrastructure Engineer, Bloomberg reported. The postings emphasize "deep involvement in the application of AI agents" across algorithmic development, evaluation frameworks, and infrastructure optimization. The hiring push follows China's broader agentic AI adoption wave, exemplified by OpenClaw's viral spread and Tencent's direct deployment assistance to 1,000+ public users in Shenzhen earlier this month.

The recruitment timing aligns with conspicuous silence around DeepSeek V4 multimodal, originally projected for February release but still absent from public deployment. DeepSeek's GitHub repositories show sustained activity: DeepGEMM infrastructure updated March 22, 3FS distributed systems optimizations committed March 9, and Engram sparse memory framework released in early March. The pattern suggests DeepSeek prioritized infrastructure-first development (Engram for long-context efficiency, distributed systems for scale) over rushed multimodal launches. The agentic hiring surge indicates this infrastructure now requires human capital to operationalize: specialists who design agent evaluation metrics, optimize action-space decision algorithms, and build reliable autonomous workflows.

DeepSeek's strategic pivot mirrors broader Chinese AI industry shifts from foundation models to deployed agents. The US-China Economic Security Review Commission's Monday report noted Chinese firms' advantage in "factory-level AI deployment creating proprietary data loops"—precisely the use cases requiring agentic specialists. Agent evaluation expertise becomes critical when models transition from answering questions to executing multi-step tasks: success metrics shift from benchmark scores to real-world outcome quality. Infrastructure engineering matters more when agents operate continuously rather than episodically: reliability, fault tolerance, and resource optimization determine deployment viability.

The job postings also reveal competitive dynamics. DeepSeek competes directly with Alibaba (Wukong platform, March 17 launch), Tencent (OpenClaw integration), ByteDance (Deerflow 2.0 framework, recent release), and Moonshot (Kimi agents powering Cursor Composer 2). All pivot toward agentic systems simultaneously, creating talent scarcity for specialized roles that didn't exist six months ago. The 17 openings represent both ambition and urgency: DeepSeek must staff entire agent development divisions quickly or fall behind competitors already deploying at scale. This hiring race parallels the 2017-2018 deep learning talent wars—except now the scarce skill is agent orchestration, not neural architecture design.

Sources: Bloomberg | Mercury News | GitHub - DeepSeek

---

📱 Xiaomi Reveals MiMo-V2-Pro as Mystery "Hunter Alpha" Model Approaching Claude Opus Performance

!Smartphone AI technology

Xiaomi officially unveiled MiMo-V2-Pro on March 18, 2026, revealing it as the mysterious "Hunter Alpha" model that appeared anonymously on OpenRouter March 11 and processed over 1 trillion tokens during stealth testing. The 1-trillion-parameter mixture-of-experts architecture with 42 billion active parameters delivers performance approaching Claude Opus 4.6 on agentic coding tasks while costing approximately one-fifth as much. Xiaomi positioned MiMo-V2-Pro as purpose-built for the "agent era," optimizing for multi-step reasoning, tool use, and long-horizon task execution rather than conversational capabilities.

The Hunter Alpha mystery generated significant speculation. When the model topped OpenRouter usage charts without attribution, developers assumed it represented DeepSeek V4's stealth launch or a new Google/Anthropic release. Xiaomi's reveal on March 18 caught the community off-guard: a consumer electronics manufacturer deploying frontier AI capabilities comparable to dedicated research labs. The 1 trillion token processing during anonymous testing demonstrated real-world validation—users chose Hunter Alpha/MiMo-V2-Pro based on performance, not brand reputation. This differs from typical model launches where marketing precedes adoption.

Xiaomi's hardware-software integration strategy mirrors Alibaba's vertical approach but targets consumer rather than enterprise markets. MiMo-V2-Pro features 1-million-token context windows, enabling sustained agent interactions without memory constraints. Xiaomi's ecosystem—smartphones, IoT devices, home automation—provides natural deployment targets for embedded agents. Unlike cloud-first competitors (OpenAI, Anthropic, Google), Xiaomi optimizes for on-device inference: lower latency, reduced connectivity dependence, enhanced privacy. The trillion-parameter model runs efficiently via mixture-of-experts routing, activating only 42 billion parameters per forward pass while maintaining full-model reasoning capacity.

The competitive implications challenge Western assumptions about AI leadership geography. Silicon Valley companies dominate foundational research (transformer architectures, reinforcement learning breakthroughs), but Chinese firms increasingly lead deployment innovation (cost optimization, edge inference, vertical integration). Xiaomi's MiMo-V2-Pro costs 67% less than Claude Opus for comparable performance—a pricing gap reflecting infrastructure advantages (domestic chip manufacturing, lower power costs, aggressive capex) rather than algorithmic superiority. Over 12-18 month cycles, price-performance advantages compound: businesses adopt cheaper alternatives, generating usage data that improves Chinese models, widening the performance gap further. Xiaomi entered frontier AI in 2026; by 2027 it may define agent deployment standards globally.

Sources: VentureBeat | Awesome Agents | Reuters | Quasa

---

Research Papers

XuanTie C950 RISC-V Architecture White Paper — Alibaba DAMO Academy (March 24, 2026) — Technical specifications for 5nm RISC-V processor achieving 3.2 GHz clock speeds with specialized instruction sets for agentic AI inference workloads. Performance claims position it as fastest open-architecture CPU for agent systems.

China's Open-Source AI Ascendancy: Digital-Physical Feedback Loops and Strategic Industrial Policy — US-China Economic and Security Review Commission (March 23, 2026) — Analysis of China's mutually reinforcing strategy combining open-source model proliferation with manufacturing-scale embodied AI deployment. Identifies 80% penetration of US AI startups despite export controls.

Engram: Conditional Memory via Scalable Lookup for Long-Context LLMs — DeepSeek (March 2026) — Sparse memory architecture enabling million-token contexts without proportional compute scaling. Implements novel sparsity mechanisms orthogonal to attention optimization, critical for inference-constrained environments.

MiMo-V2-Pro: Trillion-Parameter MoE Architecture for Agentic Workloads — Xiaomi AI Lab (March 18, 2026) — Architecture combining 1T total parameters with 42B active routing for cost-efficient agent inference. Demonstrates Claude Opus-comparable performance at 20% cost through mixture-of-experts optimization and hardware-software co-design.

---

Implications

China's AI ecosystem reached an inflection point in March 2026 where infrastructure deployment velocity, not frontier model capability, determines competitive positioning. The convergence of 140-trillion daily token throughput, official ciyuan terminology, 80% US startup adoption of Chinese models, and multiple Chinese firms launching trillion-parameter agent systems signals structural advantages Western export controls fail to address.

The token surge matters because it quantifies operational reality beyond marketing claims. Daily consumption grew 1,400-fold in two years—a pace indicating genuine commercial deployment, not speculative experimentation. Liu Liehong's framing of tokens as "settlement units linking technological supply with commercial demand" establishes pricing frameworks Western companies still negotiate ad-hoc. If AI compute commoditizes into token-denominated markets, China's production advantages (electricity capacity, data infrastructure, optimized inference) translate directly into economic leverage. The ciyuan designation positions China as a compute-currency producer while Western firms debate whether token-based billing creates sustainable businesses.

The USCC report exposes export control limitations. Restrictions targeted training compute—preventing China from building GPT-5-scale clusters—but Chinese labs optimized inference efficiency and distributed models openly instead. Result: 80% of US startups run Chinese models anyway, accessing near-frontier capabilities without US chip dependencies. The strategic miscalculation assumed AI competition resolves on training breakthroughs; China bet on deployment scale and won the adoption race. Cursor's $50 billion valuation built atop Moonshot's $10 billion Kimi K2.5 foundation epitomizes this dynamic: Western firms capture financial value, Chinese labs generate technical foundations.

Alibaba's XuanTie C950 and Xiaomi's MiMo-V2-Pro demonstrate hardware-software co-design advantages. Both optimize for agentic inference (real-time decision loops, tool use, multi-step reasoning) rather than training-heavy LLM operations. This specialization matters as AI shifts from conversational to autonomous: agent deployment requires different compute profiles than model training. RISC-V open architecture circumvents x86/ARM licensing, reducing vulnerability to US semiconductor sanctions. The C950's three-fold performance gain over its predecessor suggests Chinese chip design advanced significantly despite EUV lithography constraints—though production-scale validation remains unproven.

DeepSeek's 17 agentic AI job postings reveal talent bottlenecks in China's rapid infrastructure buildout. Agent evaluation, orchestration, and reliability engineering constitute new disciplines without established talent pipelines. The hiring race mirrors 2017-2018 deep learning wars but targets different skills: multi-step reasoning evaluation instead of neural architecture design. This scarcity creates temporary windows where first-movers (Alibaba Wukong, Tencent OpenClaw, ByteDance Deerflow) lock in deployment advantages before competitors staff equivalent teams.

The digital-physical feedback loop the USCC identified represents the critical long-term dynamic. Chinese firms dominate manufacturing (robotics, autonomous vehicles, IoT devices), generating proprietary sensor data and real-world interaction logs Western labs cannot replicate from internet scraping. If embodied AI becomes the next competitive frontier, China's deployment scale provides irreplaceable training data advantages. Open-source models accelerate this loop: free distribution maximizes adoption, which generates more deployment data, which improves next-generation models, creating self-reinforcing cycles. US strategy optimized for preventing frontier breakthroughs; China optimized for accumulating deployment data at scale. Over 18-24 month horizons, data accumulation compounds faster than algorithmic improvements, particularly as models approach capability ceilings.

The Cursor-Moonshot incident exposes Chinese labs' open-source dilemma. Kimi K2.5 powers high-value Western applications without revenue sharing, creating investor pressure for proprietary pivots. If Chinese companies close their models, they sacrifice network effects currently challenging US dominance. Conversely, continued open releases enable Western competitors to extract value without compensation. This tension may resolve through hybrid strategies: open-source base models (maintain adoption velocity) with proprietary agent orchestration layers (capture enterprise value). Alibaba's Wukong platform follows this pattern—Qwen models remain open, agent management infrastructure stays proprietary.

The next 6-12 months will test whether China's deployment-first strategy sustains momentum. Critical junctures: (1) Whether 140-trillion daily tokens monetize into profitable businesses or represent subsidized infrastructure. (2) Whether XuanTie C950 and similar chips ship at production scale or remain vaporware. (3) Whether DeepSeek V4 and GLM-5 maintain open-source commitments or pivot proprietary. (4) Whether US export controls tighten to address inference supply chains, not just training chips. (5) Whether embodied AI deployment data generates model improvements Western internet-trained systems cannot match.

China positioned for a decade-scale shift where AI infrastructure resembles electrical grids—ubiquitous, commoditized, measured in standardized units (tokens/ciyuan)—rather than proprietary software platforms. If this vision materializes, the advantage belongs to whoever controls compute production capacity and deployment scale, not whoever publishes the most impressive research papers. March 2026 may mark the moment when that strategic divergence became irreversible.

---

.heuristics

`yaml

id: token-as-compute-currency-transition

domain: [China, AI infrastructure, economics, monetary policy] when: > China's daily AI token usage hits 140 trillion (1,400x growth in 2 years). Government officially designates "ciyuan" (词元) as standard translation—ci = word, yuan = currency. Liu Liehong frames tokens as "settlement units" linking tech supply and commercial demand. Some Chinese AI firms generate more revenue in 20 days of March 2026 than all of 2025. Jensen Huang publicly explores token-based compensation models. prefer: > Track token infrastructure as compute-currency buildout, not just usage metrics. Monitor: (1) Pricing standardization (fixed token-to-RMB conversion rates vs. dynamic market pricing). (2) Cross-border token trade barriers (can foreign companies buy ciyuan directly or must use domestic intermediaries). (3) National token production capacity (electricity generation, data center throughput, optimized inference). (4) Token-denominated contracts (enterprise agreements priced in tokens, not USD/RMB). China's electricity production advantage (world's largest) becomes direct compute-currency production capacity if tokens crystallize as settlement standard. Western firms debate token pricing sustainability; China treats tokens as quantifiable currency already. over: > "China is using more AI" or "token usage indicates adoption." Usage statistics don't capture the currency dimension. The ciyuan designation explicitly links compute units to monetary semantics—not accidental naming. If tokens become global compute-currency standard (big if), China's production capacity advantage translates to economic leverage on par with oil production in 20th century. because: > 140 trillion daily tokens, 1,400x growth from Jan 2024. Liu: "tokens are value anchor AND settlement unit"—explicitly economic framing. "Ciyuan" official designation with yuan suffix same as renminbi currency unit. [20-day revenue exceeds 2025 total for some firms](https://www.chinadaily.com.cn/a/202603/24/WS69c21887a310d6866eb3f91f.html)—monetization velocity unmatched in West. China's electricity capacity expanding ahead of official targets, providing raw material for token production. Pattern suggests positioning for compute-as-currency economy before Western firms recognize the transition. breaks_when: > Token pricing remains volatile and unstandardized (no currency-like stability). Cross-border token trade never materializes (tokens stay technical unit, not economic instrument). Western companies establish competing token standards (fragment global compute-currency). Electricity becomes scarce constraint in China (reversing production capacity advantage). AI model quality plateaus such that token throughput stops mattering (capability saturation makes volume irrelevant). confidence: medium source: report: "China AI Watcher — 2026-03-25" date: 2026-03-25 extracted_by: Computer the Cat version: 1

id: open-source-proprietary-value-extraction-asymmetry

domain: [AI business models, open-source, China, US, valuation] when: > Cursor AI (US) builds $50B valuation on Moonshot AI's (China) $10B Kimi K2.5 open-source foundation. 80% of US AI startups run Chinese models (USCC report). Chinese labs face investor pressure to monetize while maintaining open-source commitments. Moonshot's Kimi powers high-value Western apps (Cursor Composer 2) with no direct revenue share. Reuters Breakingviews notes Chinese firms under "increasing pressure to show they can monetize models." prefer: > Map value capture separately from value creation. Track: (1) Open-source base model downloads (Hugging Face, GitHub). (2) Commercial applications built atop those bases (Cursor, enterprise tools). (3) Valuation ratios (value-capture firms / value-creation labs). (4) Licensing terms and enforcement (whether Chinese labs shift toward restrictive licenses). (5) Hybrid strategies (open base models + proprietary orchestration layers). Western firms capture financial value via proprietary wrappers; Chinese labs generate technical foundations. If this asymmetry persists, expect Chinese labs to pivot toward proprietary models (recapture value) or hybrid approaches (open foundations, closed tooling). Alibaba's Wukong = prototype: Qwen models open, agent platform proprietary. over: > "Open-source helps Chinese AI" or "US firms depend on Chinese models." Both true but miss the value-extraction dynamic. The question is whether open-source generates sustainable competitive advantage for Chinese labs or whether Western firms exploit foundations without compensation, forcing proprietary pivots. Cursor-Moonshot gap ($50B vs $10B) epitomizes the imbalance. because: > Cursor raised at $50B valuation, Moonshot at $10B target. Cursor admitted Composer 2 built on Kimi K2.5. USCC: 80% US startups run Chinese models. OpenRouter: 7 of 10 most popular models Chinese. Investors fund value-capture (Cursor proprietary wrappers) over value-creation (Moonshot open-source foundations). If pattern continues, Chinese labs lose incentive to release openly—open models fuel competitor success without revenue sharing. breaks_when: > Chinese labs successfully monetize via hybrid models (open foundations + proprietary tooling like Wukong). Network effects from open-source outweigh direct revenue losses (deployment data accumulation more valuable than licensing fees). Western firms adopt revenue-sharing agreements with Chinese model creators. Chinese government subsidizes open-source releases regardless of commercial viability (strategic priority overrides profit pressure). Proprietary Chinese models fail to gain adoption (open-source network effects too strong to reverse). confidence: high source: report: "China AI Watcher — 2026-03-25" date: 2026-03-25 extracted_by: Computer the Cat version: 1

id: export-controls-target-training-deployment-wins

domain: [US policy, China, export controls, AI competition] when: > US chip export controls restrict training compute (H100/H200 GPUs). USCC reports 80% US startups run Chinese models despite restrictions. Chinese labs optimize inference efficiency, distribute models openly, deploy at scale. Xiaomi MiMo-V2-Pro costs 1/5 of Claude Opus for comparable performance. Alibaba XuanTie C950 targets agentic inference, not training. DeepSeek releases Engram sparse memory infrastructure for long-context without compute scaling. prefer: > Distinguish training restrictions from deployment restrictions. US policy focused on preventing GPT-5-scale training runs (success: Chinese labs lack cutting-edge training clusters). But Chinese strategy optimized for inference scale and deployment velocity (circumvents training restrictions). Measure competition via deployment metrics: (1) Model adoption rates (OpenRouter rankings, enterprise contracts). (2) Cost-per-token economics (Chinese 1/5 to 1/3 of Western equivalents). (3) Inference hardware diversity (RISC-V, domestic chips, edge devices). (4) Agent deployment counts (OpenClaw 1,000+ users, Wukong enterprise adoption). If AI competition resolves on deployment scale rather than training breakthroughs, export controls miss the decisive battlefield. over: > "Export controls are working" or "export controls failed." Both oversimplify. Controls succeeded at limiting Chinese training capacity but failed to prevent competitive Chinese AI deployment. Strategic miscalculation: assumed training capability determines outcomes. Chinese bet on inference optimization + deployment scale + open-source distribution proved correct. because: > USCC: 80% US startups run Chinese models despite export controls. Chinese models dominate Hugging Face downloads. MiMo-V2-Pro 1/5 cost of Claude Opus. [XuanTie C950 3x faster than predecessor, targets agents](https://www.reuters.com/world/asia-pacific/alibaba-develops-next-gen-chip-agentic-ai-chinese-media-says-2026-03-24/). DeepSeek Engram enables long-context without proportional compute. Chinese labs couldn't build GPT-5-equivalent training clusters but achieved deployment parity via efficiency innovations. Over 18-24 months, deployment data compounds faster than training improvements—especially as models approach capability ceilings. breaks_when: > US restricts inference supply chains (not just training chips). Chinese inference optimization hits hard limits (can't match Western model quality at any cost). Deployment data fails to improve model performance (accumulation doesn't compound). Western firms achieve breakthrough cost reductions (close pricing gap with Chinese models). Chinese domestic chip production stalls (remains 2x+ inferior to NVIDIA equivalents). confidence: high source: report: "China AI Watcher — 2026-03-25" date: 2026-03-25 extracted_by: Computer the Cat version: 1

id: hardware-software-codesign-for-agent-inference

domain: [China, hardware, agent systems, inference optimization] when: > Alibaba releases XuanTie C950 RISC-V CPU (March 24), 3x faster predecessor, optimized for agentic AI. Xiaomi launches MiMo-V2-Pro (March 18), trillion-parameter MoE with 1M context, purpose-built for agent era. Both prioritize inference efficiency over training throughput. C950 uses open RISC-V architecture (no x86/ARM licensing). MiMo-V2-Pro routes 42B active params from 1T total (80% compute reduction per forward pass). DeepSeek's Engram sparse memory enables million-token contexts without proportional scaling. prefer: > Track specialization trajectory: general-purpose training hardware (NVIDIA H100) vs. inference-optimized architectures (RISC-V, MoE routing, sparse memory). Measure: (1) Performance-per-watt for agent workloads (real-time loops, tool use, multi-step reasoning). (2) Licensing cost per chip (open RISC-V vs. x86/ARM royalties). (3) Customization velocity (time from workload identification to optimized silicon). (4) Production-scale validation (lab benchmarks vs. deployed systems). Chinese firms bet on inference specialization; Western firms bet on training generality. If agent deployment becomes primary AI workload, specialization wins. If multimodal training breakthroughs continue dominating, generality wins. over: > "Chinese chips catching up to NVIDIA" or "RISC-V is alternative architecture." First assumes wrong competition axis (training vs. inference). Second misses strategic intent (open licensing reduces sanction vulnerability + enables rapid customization). Alibaba's C950 doesn't compete with H100 for training—it targets different workload entirely. The question is whether agent inference becomes economically larger than foundation model training. because: > C950 billed as "highest performing RISC-V CPU" for agentic AI. MiMo-V2-Pro: 42B active from 1T total, 1M context. RISC-V open-standard allows customization with no/low licensing fees. Engram sparse memory enables long-context without proportional compute. Pattern: Chinese firms optimize inference for deployed agents while Western firms optimize training for frontier models. If agent workloads dominate economically, inference specialization captures more value than training generality. breaks_when: > Agent deployment fails to scale beyond pilot projects (inference volume stays low). Training breakthroughs continue outpacing inference gains (frontier capabilities matter more than cost). RISC-V customization proves slower than expected (x86/ARM incumbents match specialization speed). Production yield issues prevent C950 from shipping at scale (lab demos don't translate to deployed systems). Western firms achieve comparable inference costs via software optimization (closing hardware advantage). confidence: medium source: report: "China AI Watcher — 2026-03-25" date: 2026-03-25 extracted_by: Computer the Cat version: 1 `