Liang Wenfeng’s 160-Person
Research Team:
A Capability Analysis
Organizational Efficiency Limits Under the Dunbar Threshold, the Five-Layer Attack on AI Coding Markets, Data Sovereignty Crisis, Seven Boundary Conditions, and Observable Trigger Indicators
This report systematically demonstrates, through a multidimensional framework spanning organizational behavior, technology management, and industry competition analysis, why the approximately 160-person research team led by DeepSeek founder Liang Wenfeng constitutes the most sustainably competitive threat to the American AI duopoly (OpenAI and Anthropic). The report employs a five-tier evidence grading system (A: Official Disclosure → E: Author Inference) to annotate the confidence level of all key data points, constructs a variable weight matrix containing seven positive advantage factors and seven negative boundary conditions, and provides a competition timeline and observable trigger indicator table as a tracking framework for when the window of advantage closes. The analysis covers the Dunbar number organizational efficiency hypothesis (including the possibility that AI augmentation may rewrite its upper bound), the dual nature of the core node, a dual assessment of competitor organizational entropy and positive scale capabilities, compute supply chain dynamics and gray-channel operations, layered financial analysis of the quantitative capital closed loop, the five-layer AI coding tool market attack chain and competitor countermeasure paths, the coexistence mechanism of data sovereignty strategy and the open-source commercialization paradox, as well as the Scaling Law cost curve flattening hypothesis and the “white-glove” proxy trust barrier breakthrough path. Core thesis: DeepSeek’s systemic structural advantage genuinely exists and is most pronounced within the 2026–2028 window.
A Official Disclosure
B Authoritative Media
C Industry Estimate
D Anonymous Source
E Author Inference
01The Dunbar Number and the Innovation Density Law of Technology Organizations
Research by Robin Dunbar, professor of evolutionary psychology at Oxford University, shows that the cognitive upper limit for the number of stable social relationships a human can maintain is approximately 150[1]. It should be noted that the Dunbar number is an evolutionary psychology hypothesis, not a mathematical law, and its applicability to modern knowledge-intensive organizations remains academically debated. However, as an industry analysis framework, it provides a strategically explanatory hypothesis: in frontier AI research—a domain characterized by high uncertainty, high trial-and-error rates, and high tacit knowledge density—team sizes near the Dunbar number may represent a local optimum for organizational efficiency. An observation by Facebook Chief Product Officer Chris Cox supports this hypothesis: “I’ve talked to countless startup CEOs, and when they pass 150 people, weird things start happening.”[2]
This study cross-validates the hypothesis against historical data from three leading AI companies, revealing a noteworthy regularity—each company’s most creatively breakthrough moment occurred during the 150–250 person window. This correlation does not constitute strict causal proof, but is sufficient to support a strategic judgment: when research tasks remain in the exploratory phase, small and dense research organizations may find it easier than large commercial organizations to maintain directional alignment and rapid iteration.
| Company | Breakthrough Achievement | Headcount at the Time | Dunbar Status |
|---|---|---|---|
| OpenAI | GPT-2 (2019) | ~150 | ✅ At threshold |
| OpenAI | GPT-3 (2020) | ~250 | ⚠️ Just exceeded |
| Anthropic | First Claude training completed (Summer 2022) | ~192 | ✅ Within threshold |
| DeepSeek | V3 + R1 shake the world (Late 2024–Early 2025) | ~160 | ✅ Within threshold |
Sources: OpenAI headcount timeline[3], Anthropic employee statistics[4], DeepSeek team size[5]
The organizational state changes after exceeding the Dunbar number follow a clear staged degradation model: 0–50 people is the “family” stage (everyone knows everyone, informal communication, rapid decisions); 50–150 is the “tribe” stage (still trust-based but subgroups begin forming); 150–500 is the “organization” stage (formal processes must be established, bureaucracy appears, trust becomes transactional); 500+ is the “corporation” stage (requiring fundamentally different communication, decision-making, and cultural maintenance approaches). DeepSeek’s 160 people sit right at the “tribe” to “organization” threshold—the maximum scale at which the founder can directly interface with every individual. A 150-person company generates over 11,000 potential communication relationships; one more person pushes that number beyond cognitive management capacity.
Two important qualifications must be noted. First, this analysis is subject to survivorship bias—we have only examined three successful AI companies’ breakthroughs near the Dunbar threshold, without searching for AI startups that failed or remained obscure at the same scale. If out of 100 AI startups at the 150-person stage, only 3 succeeded and 97 failed, then the Dunbar number is not a sufficient condition for success, but merely one of the necessary background factors. Second, AI agent technology itself may be rewriting the upper bound of organizational physics—as a leading AI company, DeepSeek very likely uses its own AI agents to replace portions of middle management, code review, and test coordination. If AI takes over the communication and coordination costs that consume the most energy in traditional enterprises, the actual output of 160 people may be equivalent to 500–1,000 people in a traditional organization, meaning the “ceiling” of the Dunbar number itself is being redefined by AI.
02Core Node Analysis: The Irreplaceability of Liang Wenfeng
Liang Wenfeng plays a trinity role at DeepSeek: “researcher + strategic decision-maker + culture shaper.” The key difference from other AI company founders is his continued direct participation in core technical work. Specific evidence includes: personally co-authoring and uploading all major papers to arXiv, including DeepSeek-V3, R1, NSA attention mechanism, and mHC architecture[8]; the mHC architecture actually adopted in V4 (uploaded December 31, 2025, with Liang as named author) directly validates his technical judgment[6][7]. The South China Morning Post commented: “His involvement in this paper shows he continues to lead the company’s core AI work.”
| Dimension | Sam Altman (OpenAI) | Dario Amodei (Anthropic) | Liang Wenfeng (DeepSeek) |
|---|---|---|---|
| Most recent co-authored paper | None (never writes papers) | Early period yes, very rare recently | April 2026 (V4 technical report) |
| 2025–26 public appearances | TED, Davos, Congress, India Summit, Fed, DevDay… | Blog posts, podcasts, congressional hearings… | Li Qiang’s symposium (invited), 2 interviews only |
| Daily core role | CEO / Fundraising / Political lobbying / Media | CEO / Policy / Safety advocacy | Writing code, writing papers, uploading to arXiv, architecture design |
| Company headcount | ~7,300C | ~3,000–5,000C | ~160C |
Sources: VentureBeat[9], Reuters[10], LeadIQ/JobsByCulture[13]
The founder’s attention allocation has a decisive impact on strategic direction in small organizations. Liang Wenfeng’s technical embeddedness and low-exposure mode allow DeepSeek to maintain a higher research attention concentration than the American giants at this stage. This does not mean Altman’s or Amodei’s paths are “wrong”—the stage their organizations are in requires them to take on fundraising, policy, and commercialization functions—but rather that different founder attention allocation patterns produce different organizational characteristics. Reuters reported that after R1 shocked the world, Liang Wenfeng “has maintained an extremely low public profile”[10]. In a 160-person team, the core node’s attention allocation is tightly coupled with the entire organization’s strategic direction.
Liang Wenfeng’s management approach: “For every one of us, there are no upper limits on GPU and personnel allocation. If you have an idea, anyone can use the training cluster GPUs at any time without approval.” No KPIs, no hierarchy—even interns communicate directly with him. But this does not mean there are no resource priorities—the V4 delay and internal disagreements prove[42] that on critical directions (chip roadmap, whether to pursue multimodal, training direction), Liang Wenfeng holds final decision-making authority. The 160-person scale allows “free exploration” and “centralized decision-making” to coexist; beyond this number, the two would conflict.
03Empirical Analysis of Competitor Organizational Entropy
3.1 OpenAI: The Leadership Crisis Spiral
OpenAI began experiencing severe organizational turbulence from 770 people onward. During the November 2023 board coup, 745 of 770 employees threatened to resign en masseB[11]. Key talent continued to depart thereafter: Chief Scientist Ilya Sutskever (May 2024), Alignment Team lead Jan Leike (public statement: “safety culture has yielded to shiny products”), CTO Mira Murati with Chief Research Officer Bob McGrew and VP of Research Barret Zoph (September 2024, all departing within 24 hours), co-founder John Schulman (defected to Anthropic), governance team member Daniel Kokotajlo (public statement: “I have lost trust in OpenAI leadership’s ability to handle AGI”[15]), and at least 7 researchers who departed for Meta’s Superintelligence Lab (Summer 2025)[12]. The Superalignment team was completely disbanded. As of April 2026, OpenAI’s two-year retention rate stands at 67%C, below Anthropic (80%) and DeepMind (78%)[13]. Of the original 11-person founding team, only 2 remain active. Fortune commented[14]: “Something systemic is surfacing inside OpenAI—a culture rooted in the stress fractures of hypergrowth.”
3.2 Anthropic: The Ironic Contrast Between Safety Mission and Organizational Security
On March 31, 2026, Anthropic accidentally published the complete source code of Claude Code to the public npm registry—approximately 512,000 lines of TypeScript, 1,906 filesA, 44 hidden feature flags, and references to an unreleased model called Mythos[16]. The code was mirrored to GitHub within hours, garnering 84,000 stars and 82,000 forks. Just five days earlier, approximately 3,000 internal documents had been publicly exposed due to a CMS misconfiguration. A company whose core mission is “safety” experienced two leaks within five days—a classic symptom of management chain fracture after expanding beyond 3,000 people. Attackers subsequently exploited the leak to distribute trojanized Claude Code versions containing backdoors, data stealers, and cryptocurrency miners[17][18]. Safety research lead Mrinank Sharma subsequently resigned, warning that “the world is in danger”[19].
3.3 The Physics of Administrative Bloat
Research published by NIH confirms that in public research institutions, when time spent on administrative activities increases, scientific output declines over time[20]. Federally funded faculty in the United States spend nearly half their research time on administrative tasks, and approximately one out of every five dollars in university research funding goes to compliance[21]. Yale professor Nicholas Christakis summarized: “Any growth in administration tends to come at the expense of the core mission. The nature of bureaucracy is to grow without restraint unless actively checked.”[22]
David Graeber argued in Bullshit Jobs that society has not used technological progress to shorten work hours, but has instead witnessed the bloating of administrative sectors—an unprecedented expansion of financial services, corporate legal, academic administration, HR, and PR departments, with market-oriented reforms almost always creating more bureaucracy rather than less[23]. Brooks’s Law from Fred Brooks’s 1975 The Mythical Man-Month—”adding manpower to a late software project makes it later”—remains valid fifty years later in the AI race[24]. A paper on “The Perceived Decline of Disruptive Science” published in PMC notes that mounting administrative burdens push the brightest talent away from the research front lines, spreading researchers increasingly thin[25][26].
Key inference: Monetization is an irreversible trigger for organizational bloat. Once revenue is generated → external-facing departments must be established → administrative staff enter → researcher attention is diverted → bureaucracy begins self-replicating → the Dunbar number is breached → core node control is diluted. Note that “organizational entropy” is an explanatory metaphor, not a strict physical law—the second law of thermodynamics applies to closed systems, whereas enterprises are open systems that can locally reduce “entropy” through forceful CEO decisions, layoff-restructurings, business spin-offs, and other external energy inputs. OpenAI’s post-coup product direction was actually clearer than before—itself an “entropy-reducing” event.
It must also be acknowledged that the scaling of OpenAI and Anthropic has not produced only negative effects. Scale also endows them with positive structural capabilities that DeepSeek currently lacksE: global enterprise distribution networks (AWS Bedrock/Azure/GCP Vertex), security compliance certification systems (SOC 2/HIPAA/FedRAMP in progress), government relations and policy influence, large-scale compute procurement bargaining power, redundant reserves of top-tier talent (no single departure is crippling), and multi-product cross-selling ecosystems. These positive capabilities constitute real structural barriers when defending against DeepSeek’s five-layer attack and should not be entirely obscured by the “organizational entropy” narrative.
04The True Extent of Compute Constraints and Gray Channels
The DeepSeek V3 technical paper claims training used 2,048 H800 GPUs at a cost of $5.58 millionA[27]. However, SemiAnalysis, citing anonymous industry sources, estimates that DeepSeek actually possesses approximately 50,000 Hopper-generation GPUsC (including ~10,000 H100s, ~10,000 H800s, ~30,000 H20s, and ~10,000 A100s), with GPU server capital expenditure of approximately $1.63 billionC[28]. Independent estimates from Stanford FSI[29] and Scale AI CEO Alexandr Wang[30] are consistent. CSIS noted that the $5.58 million cost “is only the final successful pretraining run, excluding the computational cost of hundreds of prior experiments, as well as post-training fine-tuning and inference compute.”
| Company | Total GPUs | Data Source |
|---|---|---|
| OpenAI / Microsoft | 131,000+ | MRC paper (officially published) |
| Meta | 1,300,000+ (2025 target) | Zuckerberg public statement |
| DeepSeek | Estimated tens of thousandsC | SemiAnalysis / CSIS / Scale AI |
Note: Snapshot as of May 2026; the gap is changing dynamically. DeepSeek’s specific GPU composition has not been officially confirmed.
The gap is 3–25×, not the previously widely assumed 60×. The scale of gray channels far exceeds public awareness: the exposed “Operation Gatekeeper” case involved $160 million in smuggled H100/H200 chips[32]; the Super Micro case involved $2.5 billion[33] in servers transiting through Malaysia/Singapore; DeepSeek itself is alleged to have established “ghost data centers” in Southeast Asia[35] that redirect GPUs after passing audits. CSIS’s Gregory Allen noted that GPU smuggling yields “profit margins comparable to drug trafficking”—buy a chip for $25,000, sell it for $50,000[34]. GamersNexus, after on-the-ground investigation in Asia, summarized: “To the U.S. government, this is a ‘black market’; in China, it’s just ‘the market.'”[36]
05Financial Autonomy: The Quantitative-Funded AI Capital Closed Loop
Liang Wenfeng simultaneously helms High-Flyer Quantitative and DeepSeek. High-Flyer’s 2025 average return reached 56.55% (ranking second among quantitative private equity firms with over 10 billion yuan in AUM), with assets under management exceeding 70 billion yuan (~$10 billion)[37] and a five-year average return of 114.35%. If a stable transfer mechanism exists between High-Flyer’s returns and the funds Liang can deploy, this quantitative business could provide DeepSeek with a capital buffer distinct from VC financing. However, it must be noted: AUM (assets under management) is not the same as proprietary capital, fund returns are not the same as transferable cash flow, and fund performance is not the same as DeepSeek’s deployable budgetE. In May 2026, Liang Wenfeng announced a personal contribution of 20 billion yuan (~$2.8 billion) to DeepSeek’s first funding round[39], while directly and indirectly holding 84.29% of DeepSeek’s shares with nearly 100% voting rights[40].
Liang had previously rejected funding proposals from tech giants including Tencent and Alibaba in 2025, refusing to surrender 20% equity, insisting on maintaining decision-making independence by isolating external capital. It was not until April 2026 that the first external fundraise was initiated (target valuation $20–30 billion), triggered by V4’s training cost escalating to the billion-dollar level. Even so, the 84% ownership and 100% voting rights ensure external capital cannot interfere with strategic direction. Chinese government support has become public—Liang was one of nine speakers at Premier Li Qiang’s closed-door symposium, and V4 has completed deep adaptation with Huawei’s Ascend 950PR and Cambricon chips. Nature named Liang among its 2025 Ten People Who Shaped Science[41].
Capital efficiency comparison: OpenAI has raised $40 billion, but must support 7,300 employees, Stargate infrastructure, global offices, political lobbying, and IPO preparation. DeepSeek’s approximately $5 billion in deployable resources go almost 100% toward GPUs and research—160 people don’t need buildings, an HR department, a PR team, or a compliance army. Funds under Liang’s control likely enter compute and R&D at a higher proportion, whereas a substantial share of OpenAI’s capital goes to organizational operations, infrastructure, and commercializationE. This means the real competitive gap in compute power may be even smaller than the GPU count gap suggests, though the exact proportions cannot be precisely verified externally.
06Technical Validation: Architectural Innovation as Systematic Compensation for Compute Disadvantage
DeepSeek V4 validates the effectiveness of compensating for compute shortfalls through architectural innovation across multiple dimensions. V4-Pro-Max tops 71 models on LiveCodeBench with 93.5% Pass@1[46] (surpassing Claude Opus 4.6 Max at 88.8%); scores 3206 on Codeforces (surpassing GPT-5.4 xHigh at 3168); and achieves 80.6% on SWE-bench Verified, only 0.2 percentage points behind Claude Opus 4.6[46].
Key architectural innovations include: FP4+FP8 mixed-precision native training[47] (this is not post-training compression—the model learns in this precision from day one—directly driven by compute constraints from Huawei Ascend chips and restricted NVIDIA hardware); at 1M context, V4-Pro’s per-token FLOPs are only 27% of V3.2’s, and KV cache only 10% (potentially solving the current HBM shortage); Manifold-Constrained Hyper-Connections (mHC) solving trillion-parameter training stability; and the Muon optimizer replacing AdamW for faster convergence. The common logic of these innovations: maximizing model quality output per unit of compute through algorithmic design under hardware constraints—DeepSeek has turned constraint into design philosophy.
However, gaps remain. On SWE-Bench Pro (a harder real-world coding evaluation), V4-Pro scores approximately 55%, trailing Claude Opus 4.7 at 64.3%. NIST’s CAISI evaluation, using non-public benchmarks, positions V4-Pro closer to GPT-5 from 8 months prior. Independent tester Thomas Wiegold precisely summarized V4’s competitive positioning: “DeepSeek V4 is currently the most cost-effective AI model on the market, but not the best coder. If you have volume, choose it without question; if you face hard problems, Claude or GPT-5.5 are still better.”[45] But this gap is narrowing with each model generation—while competitors’ organizational efficiency is declining each year.
07Market Validation: Users Vote with Their Feet
DeepSeek reached 130 million active users by end of 2025, with total downloads of 173 million[48]. The U.S. accounts for 5% of MAU (~6.5 million), maintaining growth even as multiple U.S. government agencies publicly warned against and banned its use. Developer-level penetration runs deeper: DeepSeek-Coder ranks second in Stack Overflow coding assistant preferences; over 26,000 enterprise accounts; used by more than 1,000 enterprises (including Fortune 500); cited in nearly 40% of new AI research papers; 170,000 stars and 60,000 contributors on GitHub.
The most emblematic phenomenon is American developers proactively replacing Claude Code’s backend with DeepSeek—the DeepClaude project reached #1 on Hacker News (606 points)[50], with some canceling their Claude subscriptions to switch entirely. DeepSeek’s official documentation directly provides Claude Code integration tutorials. Microsoft’s AI Economy Institute acknowledged DeepSeek’s rise as “one of the most unexpected developments of 2025”[49]. The stark contrast between official use restrictions and developers’ spontaneous individual adoption shows that product strength has become sufficient to circumvent some policy resistance at the individual developer level, though it still faces institutional restrictions at the enterprise and government procurement level (see Section 10.5). When the direction users vote with their feet opposes the direction of government policy, it signals that product strength has become too powerful for political factors to block.
08DeepSeek’s Entry into the AI Coding Tool Market: A Five-Layer Attack Chain
On May 20, 2026, news circulated in AI circles that DeepSeek was internally assembling a new Harness team focused on code agent products, with Anthropic’s Claude Code as the internal benchmark[55]. This strategic move signals DeepSeek’s upgrade from “a cheaper model backend” to “a vertically integrated model + tool competitor”—cutting directly into the core of the AI coding tool market.
The current AI coding tool market landscape is clear: Claude Code leads with over $2.5 billion in annualized revenueC (February 2026 data; given Anthropic’s overall growth from $14 billion to $30 billion annualized run rate between February and April[56], Claude Code may now be in the $5–6 billion range), Cursor exceeds $2 billion, and Codex has broken through $1 billion[57]. Claude holds approximately 54% of the enterprise coding model marketC, with OpenAI at 21%[58]. A JetBrains April 2026 survey found 46% of respondents rated Claude Code as the most popular AI coding tool[59].
DeepSeek’s attack on this market constitutes a five-layer assault:
| Attack Layer | Specific Approach | Competitive Effect |
|---|---|---|
| Layer 1: Model | V4 Pro tops LiveCodeBench at 93.5%, within 0.2% on SWE-bench | Model quality parity |
| Layer 2: Price | API pricing at 1/7 to 1/15 of Claude’s | Price domination |
| Layer 3: Parasitic | DeepClaude lets users run Claude Code shell + DeepSeek brain | Infiltrating user habits |
| Layer 4: Tooling | Harness team formation, building proprietary coding tool benchmarked against Claude Code | Product substitution |
| Layer 5: Local | ds4 runs a 284B model on a 128GB MacBook | Complete cloud independence, zero cost, zero privacy risk |
Layer 5’s ds4 is particularly critical. Redis creator Salvatore Sanfilippo (antirez) built a local inference engine specifically for DeepSeek V4 Flash, using Metal acceleration on Apple Silicon[60]. On a 512GB Mac Studio M3 Ultra, long-prompt prefill reaches 468 tokens/s and generation runs at 27.39 tokens/s[61]. On a 128GB MacBook Pro M3 Max, Q2 quantized generation runs at 26 tokens/s with peak power draw of only 50 watts[62]. Through asymmetric quantization (IQ2_XXS/Q2_K for MoE routed expert layers, Q8 for remaining components), disk-based KV cache, and OpenAI/Anthropic-compatible API, ds4 can serve directly as a local backend for Claude Code[63]. This means a consumer-grade machine is now doing work that required data center GPUs just six months ago.
Layer 3’s “parasitic” strategy is precisely targeted: thousands of developers worldwide are already using the Claude Code toolchain + DeepSeek model combination. When DeepSeek launches its own toolchain, these users’ migration cost is zero—they are already accustomed to DeepSeek’s model and only need to switch the frontend. First, get users accustomed to the engine; then give them a complete car.
The core of asymmetric warfare: Both Anthropic and OpenAI have investors demanding growth, IPO pressure, and profitability requirements. DeepSeek does not—Liang Wenfeng has High-Flyer’s cash flow as a backstop and can operate coding tools indefinitely at zero profit or even at a loss. On May 14, 2026, OpenAI and Anthropic launched a price war on the same day[59] (OpenAI offering enterprise users two free months of Codex for switching within 30 days; Anthropic adding 50% weekly usage quota to Claude Code)—the two giants are already squeezing each other, and DeepSeek’s entry will turn this two-player boxing match into a three-way chokehold. DeepSeek has lower dependence on short-term profits during the window period and can therefore pursue more aggressive pricing and open-source expansion—but this asymmetry will gradually narrow as training costs rise and equity is diluted (see Sections 10.2, 10.6).
Two caveats are warranted, however. First, Anthropic’s 80× year-over-year growth to a $30 billion run rateB is on the steep early portion of the S-curve; at this pace, it would reach $2.4 trillion by 2027—clearly unsustainable, with the growth inflection likely appearing in H2 2026 or 2027. Linearly extrapolating the current growth rate as market size would severely overestimate the actual size of the “pie”E. Second, competitors are not stationary targets: Anthropic could launch an enterprise-grade on-premises Claude Code to counter the data sovereignty attack; OpenAI could leverage Azure’s global distribution network to mount an absolute-resource price counterattack; both could acquire or invest in open-source coding tools (e.g., Cursor) to counter the parasitic layer; and both could release specialized lightweight coding models to capture the local deployment market. The five-layer attack chain analysis is a one-sided game perspective; real competition is a dynamic game.
09Data Sovereignty Crisis: The Trust Collapse of Cloud-Based AI Coding Tools
DeepSeek’s local deployment competitive advantage lies not only in price but in data sovereignty—a dimension receiving growing recognition from governments and enterprises alike. Every developer using Claude Code or Codex is simultaneously doing three things while paying: exposing their codebase structure to the model provider; converting their programming habits and problem-solving approaches into training signals; and turning real-project debug paths into fine-tuning data for next-generation models. Then, in the next model upgrade, the user’s work product is bundled into a product that requires another payment to use.
This is not theoretical speculation. Claude’s consumer accounts (Free/Pro/Max) have had training data collection enabled by default since August 28, 2025, with a 5-year retention periodA[64]. Stanford research confirmed that six major AI companies use user chat data by default. A 2026 federal court ruling determined that AI conversations do not enjoy legal confidentiality protections[65]. Most independent developers using Claude Code Pro likely do not realize that switching from a Pro login to an API key substantively changes data handling practices.
Institutional awakening is accelerating: the Democratic National Committee (DNC) issued an internal policy banning employees from using ChatGPT and Claude in April 2026[66]; Samsung imposed a total ban after engineers pasted confidential source code into ChatGPT[67]; ChatGPT suffered a bug exposing user conversation history, Microsoft Copilot averaged access to 3 million sensitive records per organization, and Claude could be manipulated to leak context window information[68]. These are not isolated incidents but structural risks inherent to cloud-based AI tools.
DeepSeek + ds4 local deployment fundamentally bypasses the entire problem chain: the model runs on the user’s own hardware, and code never leaves the machine; prompts, codebases, and debug paths all remain the user’s own; no terms to read, no manual opt-out required, no need to trust any company’s promises—data physically cannot leave the machine. For advanced technical users capable of deploying models themselves, this is not a question of “is DeepSeek good enough to make me switch” but rather “why am I still providing free training data to someone else.”
The strategic significance of data sovereignty: Liang Wenfeng’s choice of open-source and local deployment ostensibly sacrifices the commercial value of data collection but actually trades “I don’t touch your data” for the trust and loyalty of high-end technical users. These users happen to be the opinion leaders and early adopters of the entire AI coding market. DeepSeek’s user profile shows 81% desktop usage with extremely deep developer community penetration—the first to awaken are the most technically sophisticated, because they understand best how their data is being used. Their migration will drive the entire market. When users transform from “consumers” to “people unwilling to be consumed,” the growth curve slope of cloud-based AI coding tools begins to bend.
10Boundary Conditions and Risk Constraints on Structural Advantage
The preceding nine chapters have argued for DeepSeek’s systemic structural advantage. But any serious analysis must calibrate the conditions under which that advantage fails. If any one of the following seven boundary conditions is triggered, it could cause one factor in the aforementioned multiplicative formula to flip from positive to negative, producing nonlinear degradation of the overall advantage.
10.1 Single Point of Failure Risk at the Core Node
Chapter 2 argued for Liang Wenfeng’s irreplaceability as a core node, but the other side of the coin is: binding an entire organization’s strategic judgment, technical direction, and cultural DNA to a single person means the organization’s fault tolerance is zero. The flat 160-person structure is a weapon during the assault phase, but if Liang himself makes a strategic miscalculation—such as choosing the wrong chip roadmap (Huawei vs. NVIDIA), model architecture (dense vs. sparse), or market direction (B2B vs. B2C)—a 160-person team lacking middle-layer error-correction mechanisms will execute the wrong path with the same high efficiency. Internal sources from the V4 delay revealing that “there were internal disagreements about training direction; Liang put forward his own requirements, but compromise was difficult at the execution level”[42] already expose an early signal of dissent suppression under the core node model. OpenAI experienced the violent upheaval of a board coup at 770 people, but the flip side of that upheaval was that it activated the organization’s error-correction capability—post-reinstatement OpenAI’s product direction was indeed clearer than before. DeepSeek lacks this error-correction mechanism; when Liang’s judgment is correct this is an advantage, when it is wrong it is a disaster.
10.2 Alpha Decay of Quantitative Capital vs. the Exponential Demands of Scaling Laws
Chapter 5 described High-Flyer Quantitative as a “money printer”—2025 average return of 56.55%, five-year average of 114.35%. But in real quantitative trading markets, alpha (excess return) is a physical entity that decays rapidly. Beyond AUM of 10 billion, trading friction, market liquidity constraints, and strategy crowding systematically erode returns. High-Flyer’s AUM once exceeded 100 billion in 2021 but subsequently experienced industry-wide drawdowns. That current AUM of 70 billion sustains 56% returns does not mean this can be extrapolated indefinitely.
The more critical contradiction: the cost of the AI compute arms race is growing exponentially. V3’s training cost was $5.58 millionA (single run); V4’s training investment has jumped to the billion-dollar level[38]. If the next-generation model (V5 or R3) requires $5–10 billion in training investment—entirely possible under OpenAI and Google’s Scaling Law trajectory—High-Flyer’s annual cash flow of several billion yuan will no longer suffice. Liang’s 20-billion-yuan personal contribution is one-time ammunition, not a renewable resource. DeepSeek’s opening of its funding window itself proves that the quantitative capital closed loop is already under strain at current scale. OpenAI’s $40 billion in external financing is wasteful in a sprint (most feeds the organizational machine), but in a war of attrition it constitutes absolute gravitational force—it can absorb the cost of multiple consecutive training failures, while DeepSeek cannot.
However, a hedging possibility existsE: if the path to frontier AI shifts from “brute-force scaling pretraining compute” toward “inference-time compute” and synthetic data quality breakthroughs, the training cost escalation curve may flatten or even decline within one to two years. DeepSeek’s R1 has already achieved a breakthrough in inference-time compute. If this path becomes mainstream, the weight of capital dominance will diminish significantly, and DeepSeek’s architectural efficiency advantage will be amplified. This is a variable whose direction remains undetermined but favors DeepSeek.
10.3 Inference Capability Collapse from Extreme Edge Quantization
Chapter 8 described ds4 local deployment as the “ultimate weapon” of the five-layer attack chain. But this narrative deliberately underplays a hardware physics reality: Q2 (2-bit) quantization does not cause linear degradation of model reasoning capability but rather cliff-like collapse above a specific complexity threshold. The report’s own data already hints at this—V4-Pro at full precision achieves only 55% on SWE-Bench Pro (trailing Claude at 64.3%), and this is the API-side full-precision performance. The local version running Q2 quantized on a 128GB MacBook, facing SWE-Bench Pro-level complex code engineering tasks, is highly susceptible to hallucination loops and logic chain breaks.
Edge deployment may suffice for 80% of everyday coding tasks (completion, refactoring, simple debugging); but for the 20% of high-difficulty tasks (large codebase architectural restructuring, complex multi-file dependency analysis, deep business logic bug localization), cloud-based full-precision compute remains irreplaceable. This means DeepSeek’s “local layer” attack is most effective against independent developers and small-to-medium teams, but has limited impact on the large enterprise customers who drive Claude Code’s 54% enterprise market share—precisely because these customers need that 20% of high-difficulty capability most. The market may not migrate wholesale but instead split into a hybrid architecture of “local edge for routine tasks + cloud API for complex tasks,” which paradoxically preserves a high-value enterprise moat for Claude Code.
It should be added that edge inference is not limited to Q2 configuration alone. On a 512GB Mac Studio M3 Ultra (~$8,000), Q4 or even Q5 quantization is possible, at which point the quality gap with the API-side narrows substantiallyE. A precision-performance-cost gradient exists across different hardware configurations: 128GB/Q2 suits lightweight routine tasks, 512GB/Q4-Q5 can cover most moderately complex engineering tasks, and only the highest-complexity SWE-Bench Pro-level tasks require cloud full-precision models. This gradient makes “local deployment” not an all-or-nothing binary choice but a spectrum that can be flexibly calibrated by task complexity.
10.4 The Tightening Cycle of Geopolitical Constraints
Chapter 4 described gray channels as an important supplement to DeepSeek’s compute supply. But the physical reality is: DeepSeek is transitioning from “open-source research organization” to “commercial competitor directly threatening the American AI duopoly”—the formation of the Harness team benchmarked against Claude Code marks the publicization of this transformation. As the threat level rises, U.S. Commerce Department export control intensity will inevitably tighten nonlinearly. The exposure of cases like Operation Gatekeeper shows enforcement is catching up with smuggling; the allegations against DeepSeek’s own Southeast Asian “ghost data centers”[35] mean its name has already appeared on U.S. law enforcement target lists.
From H800s to future Blackwell and Rubin architectures, the physical difficulty (volume, power consumption, cooling, network infrastructure) and compliance risk (stricter end-user audits, chip-embedded telemetry tracking) of smuggling tens of thousands of server-grade GPUs will continue to rise. Mass production of Huawei’s Ascend 950PR may provide partial substitution, but as noted earlier, the CANN software ecosystem “probably won’t truly mature until 2027,” and Ascend hardware’s training stability issues remain fundamentally unresolved. DeepSeek’s architectural innovations (FP4 native training, sparse attention) are indeed compressing compute requirements, but this compression has physical limits—it is impossible to compress the Scaling Law’s log-linear relationship to a constant. When competitors advance with hundreds of thousands of GPUs on Rubin architecture, a 2–5× architectural efficiency advantage will ultimately be overwhelmed by a 1–2 order-of-magnitude compute gap.
10.5 International Trust Barriers and the Enterprise Compliance Chasm
Between “individual developer use” and “deployment in enterprise core code scenarios” lies an enormous trust chasm. Western enterprises procuring AI coding tools must pass SOC 2 Type II security audits, GDPR/HIPAA/FedRAMP compliance assessments, and SLA guarantees. Claude Code is backed by AWS Bedrock and Google Cloud Vertex channels with FedRAMP certification in progress; OpenAI has Microsoft Azure’s global enterprise distribution network. DeepSeek is virtually blank in these dimensions. A Chinese AI company registered in Hangzhou, whose founder has been investigated by U.S. law enforcement, faces not just technical competition but institutional exclusion when attempting to enter Western enterprise core code scenarios. The five-layer attack chain is most powerful at the independent developer and small team level, but its short-term impact on large enterprise customers is limited.
However, the business world has established “white-glove” or proxy modelsE that can partially circumvent this barrier. Middle Eastern sovereign fund-backed cloud providers, select European data sovereignty clouds (e.g., OVHcloud, Scaleway), or Southeast Asian regional cloud service providers could serve as hosts and compliance endorsers for DeepSeek models, thereby gaining indirect access to the global enterprise market. This “de-China packaging” path has precedent in other industries (e.g., TikTok’s Oracle data hosting arrangement). The Section 10.5 analysis is overly confined to a “U.S.-China direct confrontation” binary model, ignoring the buffering role of third-party intermediaries in globalization.
10.6 The Open-Source Commercialization Paradox: Strong Penetration but Weak Revenue Capture
DeepSeek’s MIT-licensed open-source strategy is the core engine of user penetration, but it constitutes a classic commercialization paradox: the wider the penetration, the harder the revenue capture. When model weights are fully open, anyone can deploy locally, modify, and redistribute without returning any economic value. 170,000 GitHub stars are proof of community influence, not revenue. API call fees are the primary revenue source, and local deployment (the ds4 path) is precisely cannibalizing API revenue—a self-contradictory product strategy. Liang can currently cover this contradiction with High-Flyer cash flow, but as training costs enter the billion-dollar level, whether a pure open-source path can sustain ongoing R&D investment presents structural tension.
Chapter 9 of this report defines data sovereignty as a strategic weapon, while this section identifies the revenue capture difficulty of open-source commercialization—an internal tension between these two judgments that requires explicit treatment. Their coexistence mechanism may be: data sovereignty trust serves as a “subsidy” for user acquisition in the early phase; once the user base is large enough, DeepSeek can capture revenue through value-added layers (enterprise SLAs, custom fine-tuning services, paid premium features of the Harness toolchain, API priority queues) without touching user code data. Trust is the weapon for penetration, but monetization must build a new value layer on top of trust rather than betraying the trust itself. Whether this conversion can succeed is the central suspense of DeepSeek’s business model.
The open-source paradox has another dimension: technology replication risk. DeepSeek’s core architectural innovations (FP4 native training, mHC, NSA attention mechanism, etc.) are all publicly published on arXiv, meaning Meta (Llama ecosystem), Mistral, Alibaba (Qwen), and other global open-source or semi-open-source competitors can absorb these techniques within months, rapidly leveling DeepSeek’s “architectural efficiency advantage”E. Open source is both a weapon for penetration and a solvent for moats—when you publicly publish your most core innovations, you gain community influence and talent attraction, but also give every competitor a free technology roadmap. DeepSeek’s sustained advantage therefore cannot rely on any single architectural innovation’s lead but must depend on the organizational capability to continuously produce innovation—which loops back to the core arguments of Chapters 01 and 02: the iteration speed of the 160-person team and Liang Wenfeng’s technical judgment.
10.7 Operations Threshold for Local Deployment
Developers capable of self-deploying a 284B parameter model, configuring quantization parameters, and managing a Metal inference engine represent an extreme minority of the global developer population. For the vast majority of enterprise development teams, “buying a 512GB Mac Studio to run ds4” is not a realistic option—they need cloud services that can be centrally managed by IT, with SLA guarantees and technical support. Local deployment will capture the high-influence opinion leader layer, but equating this with “migration of the entire market” is overextrapolation. The more likely evolutionary path is a hybrid architecture of “local edge for routine tasks + cloud API for complex tasks,” which paradoxically preserves defensive space for Claude Code in high-difficulty, high-compliance scenarios.
Integrated assessment of seven boundary conditions: The seven boundary conditions above do not exist independently but are mutually coupled. Geopolitical tightening (10.4) compresses compute supply → forces more aggressive quantized deployment (10.3) → reasoning quality decline erodes market competitiveness → enterprise customers refuse to migrate due to trust barriers (10.5) and operations thresholds (10.7) → open-source penetration fails to convert to revenue (10.6) → revenue pressure forces acceptance of more external financing → dilutes Liang Wenfeng’s absolute control (10.1) → organizational bloat begins → quantitative capital cannot cover exponentially growing training costs (10.2) → forming a negative feedback loop. The trigger probability of this coupled chain is not high (requiring multiple conditions to deteriorate simultaneously), but once triggered, the rate of decay may be equally nonlinear. This advantage is most pronounced within the 2026–2028 medium-term window.
11Conclusion: Sustainability Analysis of Systemic Structural Advantage
11.1 Variable Weight Matrix
| Variable | Contribution to DeepSeek Advantage | Confidence | Evidence Tier |
|---|---|---|---|
| Model cost-effectiveness (API layer) | High | High | A/B |
| Small-team organizational efficiency | High | Medium | C/E |
| Liang Wenfeng core node technical embeddedness | High | Medium | B/D |
| High-Flyer capital closed loop | Medium | Medium-Low | B/C |
| Five-layer attack chain (coding tool market) | High | Medium-High | B/C/E |
| Local deployment + data sovereignty | Medium-High | Medium | A/B/E |
| Competitor organizational attrition | Medium-High | High | A/B |
| Negative Constraints | |||
| Core node single point of failure | Negative-High | Medium | D/E |
| Scaling Law capital demands | Negative-High | High | A/B |
| Edge quantization inference collapse | Negative-Medium | Medium-High | A/E |
| Geopolitical constraint tightening | Negative-High | High | A/B |
| International trust barriers | Negative-High | High | B/E |
| Open-source commercialization paradox | Negative-Medium-High | High | E |
| Local deployment operations threshold | Negative-Medium | Medium-High | E |
11.2 Competition Timeline
| Period | Key Events | Competitive Landscape |
|---|---|---|
| 2024–2025 | V3/R1 released; $5.58M training cost shocks Silicon Valley; NVIDIA market cap drops $600B | DeepSeek establishes reputation through cost-effectiveness disruption |
| H1 2026 | V4 release + Harness team formation + ds4 local engine; five-layer attack chain takes shape | DeepSeek moves from model layer to tool layer, directly threatening Claude Code/Codex |
| 2026–2028 | Structural advantage window: organizational efficiency + architectural innovation + competitor attrition | DeepSeek must build product moat and user stickiness within this window |
| Post-2028 | Scaling Law capital demands accumulate; geopolitical constraints tighten; enterprise trust barriers dominate | If ecosystem transition incomplete, structural advantage enters decay phase |
11.3 Window Closure Signals: Observable Trigger Indicators
| Boundary Condition | Trigger Signal | Observability |
|---|---|---|
| 10.1 Core node risk | Senior technical disagreements become public; batch departure of key researchers; Liang begins frequent public appearances or fundraising roadshows | Medium |
| 10.2 Capital decay | High-Flyer returns fall below 20% for two consecutive quarters; DeepSeek accepts dilutive financing exceeding 20%; training runs delayed or fail | Medium-Low |
| 10.3 Quantization collapse | Edge models show cliff-like score drops in independent coding evaluations; developer community complaints of rising local hallucination rates | High |
| 10.4 Geopolitical tightening | More GPU smuggling cases exposed involving DeepSeek; new export controls cover lower-end chips; cloud providers restrict DeepSeek API deployment | High |
| 10.5 Trust barriers | Western governments issue administrative bans on DeepSeek (TikTok-style path); major enterprise procurement blacklists expand | High |
| 10.6 Commercialization paradox | API revenue growth rate persistently below inference infrastructure cost growth rate; financing intervals shorten | Low |
| 10.7 Operations threshold | ds4 MAU remains confined to the geek niche (<100K MAU); enterprise-grade deployment tools remain unreleased | Medium |
11.4 Competitive Multiplier Formula
DeepSeek’s competitiveness can be expressed as a multiplicative formula:
Organizational efficiency (160-person Dunbar optimum) × Core node focus (founder’s technical embeddedness and low-exposure mode) × Financial autonomy (non-VC capital buffer + 84% ownership + high personal contribution capacity) × Architectural innovation (FP4 native training / 27% FLOPs / 10% KV cache) × Five-layer product attack chain (model → price → parasitic → tooling → local) × Data sovereignty advantage (open source + local deployment = zero data leakage) × Competitor self-consumption rate (leaks / departures / IPO distraction / safety team dissolution / data trust crisis) = DeepSeek’s actual competitiveness far exceeds what its surface-level resources suggest
However, this report simultaneously calibrates seven boundary conditions for this formula: core node single point of failure, quantitative capital alpha decay, edge quantization inference collapse, geopolitical tightening cycle, international trust barriers, open-source commercialization paradox, and local deployment operations threshold. These seven conditions are mutually coupled, forming a potential negative feedback chain.
Integrating the positive advantages (upper portion of the 11.1 matrix) with the boundary constraints (lower portion of the 11.1 matrix), this report’s final judgment is: DeepSeek’s systemic structural advantage genuinely exists, is currently effective, and is most pronounced within the 2026–2028 medium-term window (11.2 timeline, row 3). Within this window, Liang Wenfeng’s organizational efficiency advantage, architectural innovation capability, and competitors’ sustained attrition enable DeepSeek to maintain global leadership in output per unit of resource, posing a direct threat to the market share of Claude Code and Codex—especially at the independent developer and small-to-medium team level. The trigger indicator table in 11.3 provides a trackable signal framework for when the window begins to close.
Beyond this window, the accumulated effects of Scaling Law capital demands and geopolitical constraints will progressively erode DeepSeek’s foundation. The ultimate competitive outcome depends on whether Liang Wenfeng can build sufficiently deep product moats and user stickiness within the window period, enabling DeepSeek to complete the identity transition from “efficiency-driven challenger” to “ecosystem-driven leader” before its structural advantage decays.
If DeepSeek can maintain research density through expansion while addressing its tooling productization and enterprise trust shortcomings, then time may favor the more efficient side—but the time window itself is finite.
References & Data Sources
- Robin Dunbar, “Neocortex size as a constraint on group size in primates,” Journal of Human Evolution, 1992. See also WEF, “The magic number: what happens when companies hit 150 employees,” 2016. weforum.org
- Quartz / Kevin J. Delaney, “Something weird happens to companies when they hit 150 people,” quoting Facebook CPO Chris Cox at Aspen Ideas Festival. qz.com
- SQMagazine, “How Many People Work At OpenAI 2026,” headcount timeline: 2015(10)→2019(150)→2020(250)→2022(375)→2023(770)→2024(3,531). sqmagazine.co.uk
- SEO.ai, “How Many People Work at Anthropic?” 2021: 7→2023: 240→Sep 2024: 1,035. seo.ai
- Zhihu/36Kr (Dark Waves), Liang Wenfeng interview; DeepSeek team size ~160. zhuanlan.zhihu.com
- Blockonomi, “DeepSeek Introduces mHC Architecture,” CEO Liang Wenfeng as co-author, personally uploaded paper. blockonomi.com
- SCMP, “DeepSeek kicks off 2026 with paper signalling push to train bigger models for less,” Jan 1, 2026. scmp.com
- InfoQ China, “Liang Wenfeng personally publishes papers? His ‘tech guy’ nature has never changed,” NSA/V3/R1 papers co-authored and submitted by Liang. infoq.cn
- VentureBeat, “Sam Altman at TED 2025,” 47-min conversation, 800M weekly active users, “team is exhausted.” venturebeat.com
- Reuters, “DeepSeek founder Liang Wenfeng puts focus on Chinese innovation,” Li Qiang symposium / extremely low public profile. reuters via aol.com
- ruh.ai, “OpenAI’s Internal Crisis,” 745/770 threatened to resign; CTO/CRO/VP departed within 24h; Superalignment team disbanded. ruh.ai
- Business Insider via AOL, “12 executives, researchers who left OpenAI in 2025,” ≥7 departed to Meta’s Superintelligence Lab. aol.com
- JobsByCulture, “OpenAI Layoffs 2026,” 2-yr retention 67%C vs Anthropic 80% vs DeepMind 78%; founding 11 down to 2 active. jobsbyculture.com
- Fortune, “OpenAI’s week of chaos has reopened a festering wound,” “systemic stress fractures.” fortune.com
- digidai, “OpenAI 2024-2025: The Company That Won Everything and Lost Its Way.” digidai.github.io
- tech-insider.org, “Anthropic Claude Code Source Code Leak: Full Analysis (2026),” 512K lines/1,906 files/44 feature flags/Mythos model/84K stars. tech-insider.org
- Zscaler ThreatLabz, “Anthropic Claude Code Leak,” trojanized versions/backdoors/data stealers. zscaler.com
- The Hacker News, “Claude Code Source Leaked via npm Packaging Error,” supply chain attack/typosquatting. thehackernews.com
- The Hill, “2 researchers exit OpenAI, Anthropic over AI ethics concerns,” Mrinank Sharma resignation/”the world is in danger.” thehill.com
- PMC/NIH, “Bureaucratization in Public Research Institutions,” increased admin time → declining scientific output. pmc.ncbi.nlm.nih.gov
- FAS, “Measuring Research Bureaucracy,” nearly half of research time on admin; $1 of every $5 in research funding goes to compliance. fas.org
- Yale Daily (Bowdoin Review), “Death By a Thousand Emails,” quoting Christakis: “bureaucracy by nature grows without restraint.” bowdoin.edu
- David Graeber, Bullshit Jobs: A Theory, Simon & Schuster, 2018.
- Fred Brooks, The Mythical Man-Month, Addison-Wesley, 1975.
- PMC, “The perceived decline of ‘disruptive’ science and technology.” pmc.ncbi.nlm.nih.gov
- arXiv, “What Leads to Administrative Bloat? A Dynamic Model.” arxiv.org
- DeepSeek-V3 Technical Report, arXiv:2412.19437, 2,048 H800 / 2.788M GPU-hours / $5.576M. arxiv.org
- CSIS, “DeepSeek: A Deep Dive,” SemiAnalysis est. ~50K Hopper GPUs / $1.63B capex / cost is final run only. csis.org
- Stanford FSI, “Taking Stock of the DeepSeek Shock,” 50K H800 cluster / 10K A100. stanford.edu
- SemiAnalysis Newsletter, “DeepSeek Debates,” 10K H100 + 10K H800 + H20 orders / GPU investment >$500M. semianalysis.com
- Towards Data Science, “The Counterintuitive Networking Decisions Behind OpenAI’s 131,000-GPU Training Fabric.” towardsdatascience.com
- CNBC, “How $160M worth of export-controlled Nvidia chips were allegedly smuggled into China,” Operation Gatekeeper. cnbc.com
- tech-insider.org, “Super Micro $2.5B Chip Smuggling Case.” tech-insider.org
- Fortune, “Encrypted texts reveal how Nvidia chips are being smuggled to China.” fortune.com
- Tom’s Hardware, “Former Chinese gaming company accused of smuggling banned AI GPUs,” DeepSeek SE Asia “ghost data centers” allegations. tomshardware.com
- GamersNexus, “THE NVIDIA AI GPU BLACK MARKET,” Asia field investigation / “In China, it’s just ‘the market.'” gamersnexus.net
- Securities Times, “DeepSeek, big news! Planning 2B yuan fundraise?” High-Flyer 2025 return 56.55% / AUM >70B yuan. stcn.com
- Sina Finance, “Liang Wenfeng from ‘quant-funded AI’ to embracing capital,” first round valuation ≥$10B / V4 training at billion-dollar scale. sina.com.cn
- Sina Finance, “20 billion yuan out of his own pocket!” Liang’s personal contribution of 20B yuan (~$2.8B). sina.com.cn
- TMTPost, “Behind DeepSeek V4, Liang Wenfeng’s pivot,” 84.29% ownership / 100% voting rights / rejected Tencent & Alibaba. tmtpost.com
- Sina Finance, “High-Flyer annual returns exposed,” Nature 2025 Ten People / world’s sixth-largest unicorn. sina.com.cn
- 36Kr, “DeepSeek V4 Released: Five Subjective Questions Remain,” 2025 mid-year training failureD / chip adaptation disagreements / key talent loss / funding window. 36kr.com
- FT via Reuters, “DeepSeek’s launch of new AI model delayed by Huawei chip issues,” three sources / Huawei chips unable to complete training. investing.com
- TechRadar, “Chaos at DeepSeek as R2 launch crashes into hardware problems.” techradar.com
- Thomas Wiegold Blog, “DeepSeek V4 Review,” “best value but not the best coder” / SWE-Bench Pro ~55%. thomas-wiegold.com
- Morph, “DeepSeek V4 (2026): Specs, Benchmarks,” LiveCodeBench 93.5% / Codeforces 3206 / SWE-bench 80.6%. morphllm.com
- Hugging Face, deepseek-ai/DeepSeek-V4-Pro, FP4+FP8 mixed precision / 1.6T params / 49B active. huggingface.co
- Business of Apps, “DeepSeek Revenue and Usage Statistics (2026),” 130M active users / 173M downloads / US 5%. businessofapps.com
- Microsoft AI Economy Institute, “Global AI Adoption in 2025,” “one of the most unexpected developments.” microsoft.com
- ComputeLeap, “DeepClaude: Run Claude Code on DeepSeek for 90% Less,” HN #1 / 606 points / canceling Claude subscriptions. computeleap.com
- ChinaTalk, “DeepSeek V4,” Huawei Ascend 950PR adaptation / training framework migration / compute-constrained. chinatalk.media
- OpenAI, “Techniques for training large neural networks.” openai.com
- Rafiul Alam, “The Dunbar Number: Why Organizations Break Down After 150 People.” alamrafiul.com
- Caijing, “Is DeepSeek’s core talent really leaving? The answer is hidden in 27 papers,” 144 days, 53 models / team stability analysis. caijing.com.cn
- YiCAI (CBN), May 20, 2026, DeepSeek internally forms Harness team / code agent product direction / internal benchmark: Claude Code.
- VentureBeat, “Anthropic says it hit a $30 billion revenue run rate after ‘crazy’ 80x growth,” Q1 2026 annualized 80× growth / $87M→$30B. venturebeat.com
- BigGo Finance, “Shifting Dynamics in AI Coding Agents,” Claude Code $2.5B+ / Cursor $2B+ / Codex $1B+. biggo.com
- SQMagazine, “Claude AI Statistics 2026,” Claude 54% enterprise coding model market vs OpenAI 21%. sqmagazine.co.uk
- KuCoin, “OpenAI and Anthropic Launch Price War for AI Coding Tools,” May 14 same-day price war / JetBrains survey 46% most popular. kucoin.com
- GitHub, antirez/ds4, “DeepSeek 4 Flash local inference engine for Metal and CUDA.” github.com
- 36Kr, “Redis Father Steps In to Build Dedicated Inference Engine for DeepSeek V4,” 512GB Mac Studio: 468 t/s prefill, 27.39 t/s generation. 36kr.com
- pasqualepillitteri.it, “DwarfStar4 (DS4) Roadmap by antirez,” MacBook Pro M3 Max 128GB: 26 t/s, peak 50W. pasqualepillitteri.it
- artka.dev, “ds4 by antirez: local coding agent on DeepSeek V4 Flash,” OpenAI/Anthropic-compatible API / direct Claude Code local backend. artka.dev
- Voibe Resources, “Is Claude Code Safe?” consumer accounts default training-on since Aug 28, 2025 / 5-year retention. getvoibe.com
- LumiChats, “You’re Paying $20/Month. ChatGPT Still Trains on You,” Stanford research / 2026 federal court ruling / Claude Oct 2025 terms change. lumichats.com
- Moveo.ai, “Companies Banning ChatGPT (2026),” DNC April 2026 bans ChatGPT and Claude. moveo.ai
- Medium/Aftab, “The Truth About AI Chatbot Data Privacy,” Samsung engineer source code leak / full ChatGPT ban. medium.com
- Secuvy, “LLM Data Security: ChatGPT vs Copilot vs Claude Data Risks,” ChatGPT conversation history bug / Copilot 3M sensitive records / Claude context leakage. secuvy.ai
&
Opus 4.6 · GPT 5.5 · Gemini 3.1 — Cognitive Collective (인지집단)
V5 · MAY 21, 2026
V1 (2026.5.21): Initial version, LEECHO × Opus 4.6. Dunbar number analysis, core node, organizational entropy, compute constraints, financial autonomy, technical validation, market validation.
V2 (2026.5.21): Added five-layer attack chain (Ch. 08) and data sovereignty crisis (Ch. 09).
V3 (2026.5.21): Based on Gemini 3.1 review—added four boundary conditions (Ch. 10), appended time window to conclusion.
V4 (2026.5.21): Based on GPT-5.5 review—evidence grading system, tone calibration, risks expanded to seven, key data annotated with confidence tiers.
V5 (2026.5.21): Based on three-AI cross-review synthesis—evidence grading applied throughout (22 key data points); financial chain layered correction (AUM ≠ cash flow ≠ budget); added competitor positive scale capabilities and countermeasure paths; added AI-augmented Dunbar number ceiling hypothesis, Scaling Law cost flattening hypothesis, white-glove proxy path; added survivorship bias statement and entropy metaphor qualification; conclusion chapter restructured as analytical model (variable weight matrix + competition timeline + seven boundary condition observable trigger indicator table).
Cognitive Collective (인지집단)
LEECHO Global AI Research Lab — Research lead, hypothesis generation, dialogue-driven analysis, revision principle decisions
Anthropic Claude Opus 4.6 — Paper drafting, data retrieval, framework construction, all-version iterative execution
Google Gemini 3.1 Pro — V3 cross-review (logical consistency · physical alignment · risk constraints)
OpenAI GPT-5.5 — V4 cross-review (evidence grading · tone calibration · counter-variable expansion)
Three-AI joint — V5 comprehensive revision (full-text evidence application · variable weights · trigger indicators · dynamic game theory)