SemiAnalysis Dissects Huawei's Kirin 9030: Process Technology Halted, So They Folded the Chip

marsbit發佈於 2026-06-15更新於 2026-06-15

文章摘要

SemiAnalysis has published a detailed teardown report on the HiSilicon Kirin 9030 Pro chipset found in Huawei's Mate 80 Pro. Fabricated using SMIC's most advanced N+3 node without EUV lithography, the analysis reveals significant technical achievements and strategic shifts. The report indicates SMIC's N+3 has achieved transistor density comparable to TSMC's N6 (113.4 vs 107.7 MTr/mm²), primarily through aggressive use of Self-Aligned Quadruple Patterning (SAQP) for its metal layers. This results in a notably small 32.5nm M0 metal pitch. However, SemiAnalysis notes this achievement comes with significantly higher process complexity, cost, and potential yield challenges compared to competitors using more advanced tools. The Kirin 9030 design maximizes this constrained density. While its GPU performance has improved ~70% and matches Qualcomm's 2022 flagship level, the CPU core's IPC lags behind current top-tier designs from Apple and Qualcomm, a gap attributed to the underlying manufacturing technology rather than design capability. Facing long-term restrictions on advanced tools, Huawei is charting a new path. The report highlights the company's "LogicFolding" roadmap, a 3D stacking technique aimed at shortening signal paths to boost performance and efficiency. The goal is to reach 5GHz frequency and a projected density of 295 MTr/mm² by 2031. SemiAnalysis concludes that export controls have not halted China's chip progress but have fundamentally altered its trajectory, mak...

Author: Tide Research

In the field of semiconductor reverse engineering, TechInsights had dominated for decades. Last weekend, Dylan Patel's SemiAnalysis officially released the first public teardown report from its STEEL lab (Teardown Engineering & Evaluation Lab), targeting one of the world's most-watched chips: the Kirin 9030 Pro, manufactured on SMIC's most advanced N+3 process, which powers the Huawei Mate 80 Pro.

The timing is intriguing. While TechInsights is being sold by private equity, SemiAnalysis' revenue has already surpassed that of this established giant. Dylan chose this moment to demonstrate its capabilities with a technically dense teardown report, complemented by real chip photos from its Oregon lab.

The report's headline is a bombshell: SMIC's N+3 process features a minimum metal pitch (M0 pitch) of only 32.5nm, smaller than the 36nm pitch of Intel's latest Panther Lake processors using the 18A process.

SMIC, without EUV lithography machines, achieved a finer metal pitch than Intel?

On its own, this headline could send shockwaves through the semiconductor industry. However, SemiAnalysis itself pours cold water on it in the report's second paragraph, calling it a "cherry picked metric."

This article will interpret this teardown report for you.

Catching Up in Density, at a High Cost

SMIC's N+3 process has indeed caught up with TSMC's N6 in terms of transistor density.

Through TEM (Transmission Electron Microscopy) cross-sectional analysis, STEEL Lab measured the Bohr density of N+3 at 113.4 MTr/mm², slightly higher than TSMC N6's 107.7 MTr/mm². The cell height shrank from 252nm in N+2 to 228nm, and the contacted gate pitch (CGP) reduced from 63nm to 57nm. Taken together, these numbers mean that SMIC, using pure DUV lithography without EUV, has achieved logic density comparable to TSMC's mature 7nm-class processes.

At what cost?

SMIC's M0 layer uses Self-Aligned Quadruple Patterning (SAQP), where a single photomask pattern undergoes four processing steps to achieve finer lines. TSMC's N6 requires only Double Patterning (SADP) for the same layer. Quadruple patterning means more mask counts, stricter overlay accuracy requirements, more complex process flows, and higher costs.

SemiAnalysis directly observed the consequence of SAQP in the cross-sectional images: the M0 trenches of N+3 show a pronounced inverted trapezoid profile (narrower at the bottom than the top), with a clear barrier layer enrichment zone at the trench bottom. While this morphology aids copper filling, the difficulty of process control skyrockets at this 32.5nm pitch.

To use a trader-friendly analogy: SMIC is printing banknotes of the same denomination, but each note costs multiples more to print than TSMC's, with greater yield risk. The density is the same, but the economics are entirely different.

Kirin 9030: Squeezing Every Square Millimeter of Silicon Under Constraints

HiSilicon's chip design prowess is another story.

Looking at die area, the Kirin 9030 is almost the same size as its predecessor, the 9020 (around 140mm²), but packs in more: the CPU upgraded from 1 big core + 3 medium cores to 1 big + 4 medium, GPU compute units increased from 4 to 6, an extra Tiny core was added to the NPU, and caches were expanded across the board. The density improvement from N+3 allowed Huawei to cram more logic into the same die size.

On performance, STEEL Lab referenced public benchmark data, offering a clear positioning: the Kirin 9030's GPU performance (Mali-G935) roughly matches flagship levels from 2022. Its 3DMark WLE score improved 70% over the previous generation, slightly surpassing the Snapdragon 8+ Gen 1, but compared to the current flagship Snapdragon 8 Elite Gen 5, the gap is 2.4x to 2.6x.

The CPU situation is more telling. The IPC of the big core, TaiShan Prime, is roughly at the level of Arm's Cortex-X2, a 2021 design. The IPC of Apple's M1 Firestorm core, released in 2020, remains 35% higher. The latest Apple M5 P-core's IPC is 60% higher, with absolute performance being 2.7x that of the Kirin core.

The root of the gap lies not in design, but in process technology. Apple and Qualcomm use TSMC's N4, N3P, processes that have fundamental advantages in the voltage-frequency curve: more transistors can be packed into the same area, higher frequencies can be achieved at the same power. Huawei's core designs are on par with the industry's previous generation leaders, but are trapped in manufacturing processes two generations behind.

When Process Scaling Stalled, Huawei Prepares to "Fold"

The most forward-looking part of the report is Huawei's τ scaling law and LogicFolding roadmap, presented at the 2026 ISCAS conference.

Traditional semiconductor scaling advances on a two-dimensional plane: shrinking transistors and narrowing metal lines. Moore's Law has progressed for decades, essentially doing just that. The τ scaling Huawei proposes shifts the optimization target from the spatial domain to the temporal domain, focusing on reducing time costs for data movement and processing, including transistor switching delay, signal propagation delay, and compute/memory access latency.

LogicFolding is the engineering implementation of this theory. Simply put, it splits the same logic block into top and bottom layers, stacking them face-to-face and connecting them via ultra-fine-pitch hybrid bonding. The direct benefit is shortening the longest signal paths. In modern chips, a significant portion of power and delay is spent driving long interconnects and repeater buffers. By folding logic vertically, critical paths become shorter, enabling higher frequencies and lower power consumption.

Huawei outlined an aggressive roadmap: While the Kirin 9030's big core runs at 2.75GHz, lab samples have achieved 3.39GHz, with a target of reaching 5GHz by 2031. Concurrently, through 3D stacking, the equivalent density aims to reach 295 MTr/mm², comparable to TSMC's 14A level.

SemiAnalysis remains skeptical. They point out that Huawei's density calculation method differs from traditional foundries: the density for 3D stacking is calculated based on package footprint, naturally yielding higher numbers when stacking multiple active logic layers. Using the same method to calculate AMD's MI450X (N2 top die + N3P base die) would yield a theoretical density of 460.2 MTr/mm², far exceeding Huawei's 2031 target.

However, the direction itself warrants attention. For Huawei, this path essentially involves a systems design company taking on tasks traditionally handled by foundries. AMD's V-Cache implements 3D stacking for cache, and AMD MI350X moves I/O and interconnects to the base die. What Huawei aims to do is more radical—directly splitting a single logic block and distributing it vertically. This represents a challenge of another magnitude in engineering difficulty.

Export Controls Reshape the Dimensions of the Race

SemiAnalysis' concluding point is blunt: Export controls have not stopped China's chip progress, but they have altered the path and the cost of that progress.

SMIC's N+3 proves that N6-level logic density can be achieved without EUV. But this path is more expensive, more complex, and has greater yield challenges. Every step forward increases the marginal difficulty: more masks, stricter overlay control, more costly multiple patterning. Theoretically, N+4 could reach 137.8 MTr/mm² (comparable to TSMC N5), and N+5, with the introduction of backside power delivery, could even approach Intel 18A's HP library density. But each step is harder, more expensive, and has less margin for error than the last.

Meanwhile, SMIC's N+2 and N+3 processes are being transferred to Huahong, potentially benefiting design houses like Alibaba's T-Head and Cambricon. Chip manufacturing knowledge is diffusing from a single foundry to an ecosystem, further diluting the effectiveness of sanctions against individual companies.

On the design side, Huawei and Peking University are already developing domestic EDA tool prototypes for LogicFolding. This doesn't equate to replacing the complete toolchains of Synopsys and Cadence, but domestic EDA is evolving towards "architecture-process-packaging co-optimization."

An interesting detail: STEEL found in its teardown that the DRAM in the Kirin 9030 Pro comes from Samsung (K4L2E165YD, LPDDR5X-9600, 1a node), while the 16GB Pro Max version shows packages from both Samsung and ChangXin Memory (CXMT). The CXMT chip packaging date is marked as week 45 of 2025, with process density equivalent to the industry's 1z level. This indicates that Chinese memory chips have begun entering Huawei's flagship supply chain, albeit still one to two generations behind Samsung and SK Hynix in process technology.

For investors, the signal truly worth tracking is whether Huawei's 3D stacking roadmap can, at a controllable cost, bring Chinese-produced chips to a "good enough" threshold for scenarios like smartphones, AI inference, and networking equipment.

Once "good enough" is established, the strategic value of this supply chain will be repriced.

你可能也喜歡

对话大摩数字战略主管：比特币涨到一百万美元不是不可能，但我希望它慢一点

摩根士丹利数字资产战略主管Amy Oldenburg分享了她对比特币的看法与机构参与路径。她认为，比特币的早期价值逻辑根植于新兴市场对去中心化金融的需求，例如非洲的移动货币解决了传统银行缺失下的安全与普惠问题。尽管摩根士丹利已推出现货比特币ETF（MSBT）并创下发行纪录，但机构大规模采用仍面临挑战。主要原因包括：银行控股公司面临的严格资本监管、财务顾问因比特币价格横盘而持谨慎态度，以及客户教育不足。她指出，当前市场注意力被黄金、AI等热门资产分散，比特币需要时间或一场真正的体系危机来证明其“数字黄金”的价值储备地位。对于未来，Oldenburg预测比特币将缓慢增长而非突然飙升，并认为价格达到100万美元并非不可能，但她希望上涨过程更平稳以减少波动性。她强调，理解比特币与其他加密资产的差异、以及自我保管与通过ETF持有敞口的区别至关重要。最后，她认为行业仍处早期，密码朋克精神与机构服务可以共存，共同推动数字资产领域的长期发展。

marsbit8 分鐘前

marsbit8 分鐘前

加密市场的并购交易正异常活跃

加密市场并购交易正异常活跃，本月并购案例已达10笔，同期融资仅14笔，并购占一级市场交易总数比例升至约42%，为历史最高水平。这并非行业繁荣的信号，而是融资市场持续低迷的反映。自2024年11月以来，月度融资交易数量已从约100笔大幅下滑至50笔左右，并购则成为最稳定的交易形式。估值走低、时间与试错成本考量、获取牌照合规资源、打通产业链是推动并购的四大主因。头部公司如Coinbase、Kraken、Ripple、MoonPay等正借此窗口，以更低成本和更强议价权收购团队、技术、牌照与市场入口。交易基础设施（如衍生品平台）、支付与稳定币、RWA资产发行与分销是当前并购的核心方向。并购升温为创业者提供了除代币上市外的另一条退出路径，可能促使团队更关注产品、收入和可被整合的战略价值。但同时，行业资源正加速向巨头集中，加密世界可能走向更中心化的格局，合规壁垒抬高，创业门槛随之提升。

链捕手14 分鐘前

加密市场的并购交易正异常活跃

加密市场并购交易异常活跃，近期并购案例占一级市场交易总数比例已达约42%，创历史新高。这主要反映融资市场持续颓势，并购并未取代融资热度，而是在融资收缩后成为最稳定的交易形式。并购持续高涨的原因包括：项目估值足够便宜，买方议价权增强；收购可节省时间与试错成本，快速补齐关键能力；能获取重要牌照与合规资源；有助于巨头打通产业链上下游，实现集团化扩张。当前并购重心集中在四类领域：交易基础设施（如衍生品平台）、支付与稳定币（构建支付网络）、合规牌照、以及资产发行与分销（掌握交易源头）。这表明头部公司正从单点产品向综合金融生态演进。并购升温为创业者提供了除代币上市外的另一条退出路径，鼓励团队更关注产品、收入和可被整合的战略价值。但同时，行业正变得更加中心化，资源向少数巨头集中，合规壁垒抬高，创业门槛显著提升，加密行业可能逐渐形成类似传统金融的格局。

marsbit15 分鐘前

Solana 隐私生态全景图，从计算到 AI 的完整隐私栈

Solana隐私生态仍处于早期阶段，但正在快速发展。其独特架构（如ZK压缩）有望实现无需持久Rollup的大规模可组合隐私协议。理想的“最终隐私栈”可能是全同态加密（FHE）与零知识证明（ZK）的结合。目前隐私计算主要由Arcium和MagicBlock提供。Arcium利用多方计算（MPC）构建可定制的执行环境，处理加密数据并正在开发保密代币标准（C-SPL），应用场景包括私密支付、数据分析及医疗保健。MagicBlock则基于可信执行环境（TEE）创建私密临时Rollup，确保交易的机密性、可扩展性和可组合性。两者基础设施催生了诸多应用。私密转账与余额方面，Umbra基于Arcium构建，提供加密代币账户，实现金额、余额和关联关系的隐私，并支持选择性审计。Privacy Cash采用类似Tornado的屏蔽池处理SOL，Hush则整合了质押收益和私密兑换功能。为消除链上痕迹，encifherio通过包装代币和TEE保护兑换隐私；Vanish Trade利用屏蔽流动性路由隐藏交易策略；Darklake构建ZK原生流动性基础设施和暗池，防止前端运行。更高级的应用如私密预测市场（如Melee Markets）利用Arcium加密订单簿，保护参与者策略。在私密AI领域，Loyal结合Magic Block和Arcium技术，在链上加密存储和处理用户数据、对话及交易，确保用户拥有数据控制权。总体而言，Solana隐私生态正从基础计算层向复杂应用层构建，覆盖支付、DeFi、预测市场及AI等多个垂直领域。

Foresight News21 分鐘前

Foresight News21 分鐘前

Orbixbit.com 加密货币交易所评测

加密货币市场获利日益困难，促使许多用户开始尝试功能类似的小型交易平台，Orbixbit便是其中之一。该中心化交易所成立于2018年，提供多种主流及山寨币交易，以其简洁界面、低交易费用、快速验证和支持多种交易风格（如现货、合约、保证金交易）吸引用户。平台适合新手和经验者，并提供自动化交易工具。安全方面，Orbixbit采用双因素认证、冷钱包存储、数据加密和多步提现验证等措施，并声称受塞浦路斯证监会（CySEC）监管。平台仅支持加密货币充值与提现，流程简单，但处理速度可能受区块确认时间影响。总体而言，Orbixbit凭借低费率、直观界面和多样交易工具积累了用户。它提供实时行情、图表和被动盈利选项，但小币种选择有限，部分交易对流动性不及大型交易所。对于寻求兼具基础与高级功能的交易平台用户，它是一个值得考虑的选项。

TheNewsCrypto58 分鐘前

交易

現貨

合約

SemiAnalysis Dissects Huawei's Kirin 9030: Process Technology Halted, So They Folded the Chip

文章摘要

Catching Up in Density, at a High Cost

Kirin 9030: Squeezing Every Square Millimeter of Silicon Under Constraints

When Process Scaling Stalled, Huawei Prepares to "Fold"

Export Controls Reshape the Dimensions of the Race

相關問答

你可能也喜歡

对话大摩数字战略主管：比特币涨到一百万美元不是不可能，但我希望它慢一点

加密市场的并购交易正异常活跃

加密市场的并购交易正异常活跃

Solana 隐私生态全景图，从计算到 AI 的完整隐私栈

Orbixbit.com 加密货币交易所评测

交易

熱門文章

如何購買CHIP

相關討論

熱門問答

熱門分類

熱門標籤