τ Scaling: Huawei's New Growth Engine Designed for the Post-Moore Era

marsbitPublicado a 2026-05-25Actualizado a 2026-05-25

Resumen

**Tau Scaling: Huawei's New Growth Engine for the Post-Moore Era** For 60 years, progress in semiconductors was driven by Moore's Law – making transistors smaller, denser, and cheaper. This path has now stalled due to plummeting returns below 7nm, astronomical lithography costs, and rising per-transistor expenses. After six years and testing 381 production chips, Huawei’s semiconductor team proposes a fundamental shift: **stop competing on size, start competing on time**. This is the core of their "τ (Tau) Scaling" theory. It treats *time* as the key optimization metric, compressing characteristic delays (τ) across all levels – from transistor switching (picoseconds) to data center tasks (seconds), spanning 12 orders of magnitude. **What is τ Scaling?** It holistically minimizes delay/time constants (τ) across four layers: transistors (switching speed), circuits (signal delay), chips (compute/memory access), and systems (end-to-end communication). The goal is to align optimization from process and circuit design to architecture and systems using this unified metric. **Mobile Application: LogicFolding** Without advancing the process node, this technique vertically stacks chips using ultra-precision hybrid bonding, distributing critical paths across layers ("stacking floors"). Results include a 55% transistor density increase, 41% better energy efficiency, over 40% higher SRAM frequency, and a roadmap targeting 4GHz by 2029. **AI Data Center Application: Full-Link Latency ...

For the past 60 years, the semiconductor industry has been driven by shrinking transistor sizes (Moore's Law) for progress—making them smaller, denser, and cheaper.

But now this path is stalling:

Benefits of processes below 7nm plummet
Lithography machine costs are astronomical
Design cost for a single advanced-node chip exceeds $10 billion
Cost per transistor is no longer falling, but rising

Huawei's semiconductor team, after 6 years and verification across 381 mass-produced chips, has identified a new direction:

Stop competing on size, start competing on time.

They propose the τ Scaling theory (τ Scaling):

Treat "time" as the core optimization metric, compressing the characteristic time τ across the entire technology chain—from transistor switching (picoseconds) to data center tasks (seconds), covering 12 orders of magnitude.

Simply put:

It used to be about who is smaller; now it's about who is faster, has lower latency, and higher efficiency.

1. What Exactly is τ Scaling?

τ represents the delay/time constant at each layer, divided into four levels:

Transistor: Switching speed
Circuit: Signal transmission delay
Chip: Computation and memory access latency
System: End-to-end communication and synchronization time

The goal is to compress τ stack-wide. Process, circuits, architecture, and systems are optimized using the same set of metrics, ending siloed optimization.

2. Mobile Implementation: LogicFolding

Without advancing the process node, chips are stacked vertically. Ultra-precise hybrid bonding splits critical paths across multiple layers, essentially "adding floors" to the chip.

Transistor density: Increased by 55% in one generation, from 155→238 million/mm²
Energy efficiency: Improved by 41%, with main frequency rising nearly 13%
SRAM frequency: Increased by over 40%
Kirin 2026 target: 3.1GHz main frequency, aiming for 4GHz by 2029

3. AI Data Center Implementation: End-to-End Latency Reduction

In AI clusters, 80% of energy consumption and 70% of cost come from data movement. The core is reducing communication time.

1. Unified Bus

Removes multi-layer protocols, slashing remote access latency from tens of microseconds to about 100 nanoseconds—500 times faster.

2. Hi-ONE Optical Interconnect

Single module achieves 8Tb/s. Replaces copper with fiber optics, extending distance from 1 meter to 100 meters, suitable for 10,000-card clusters.

3. 3D Folding

Solves the problem of "area scaling outpacing I/O scaling" in 2.5D packaging. Moves memory, power delivery, and optical interfaces to vertical layers, scaling them in sync with computing power.

Prediction: AI hardware integration density to increase over 100x by 2035

4. Logic and Memory Re-integration

CPUs and memory developed separately in the past. In the AI era, where data movement is more critical than computation, memory and logic must be tightly 3D integrated. Industry influence is shifting towards memory and packaging.

5. Remaining Challenges

EDA tools need to adapt to 3D stacking design
Optimization needed for wafer-to-wafer process variations and vertical interconnect loss
New standards for energy efficiency and benchmarks required

Conclusion

The size-centric era of Moore's Law is over; the era of time scaling has begun.

Without fixating on cutting-edge lithography machines, continuous improvements in performance and energy efficiency are still achievable through 3D stacking, system architecture, and interconnect optimization.

This will be the core trajectory for semiconductors in the next decade.

Preguntas relacionadas

QWhat is the core concept of τ Scaling proposed by Huawei?

Aτ Scaling shifts the optimization focus from transistor size to time (τ) as the key metric. It aims to compress the characteristic time τ across the entire technology stack, from transistor switching speeds to system-level task latencies, to achieve faster performance, lower latency, and higher efficiency.

QHow does LogicFolding, as an application of τ Scaling in mobile chips, improve performance?

ALogicFolding vertically stacks chips using ultra-precise hybrid bonding to distribute critical paths across multiple layers. Without upgrading the manufacturing process, it increases transistor density by 55% (e.g., from 155 to 238 million/mm²), boosts energy efficiency by 41%, raises the main frequency by nearly 13%, and increases SRAM frequency by over 40%.

QWhat are the key technological implementations for reducing latency in AI data centers according to the τ Scaling approach?

AThe key implementations are: 1) Unified Bus, which reduces remote access latency from tens of microseconds to about 100 nanoseconds; 2) Hi-ONE optical interconnects, offering 8 Tb/s per module and extending reach from 1 meter to 100 meters; and 3) 3D Folding, which vertically integrates components like memory and power to scale with computing power and avoid 2.5D packaging bottlenecks.

QWhy does the article argue that the semiconductor industry is shifting focus from scaling size to optimizing time?

ABecause traditional scaling (Moore's Law) has become unsustainable: benefits plummet below 7nm, lithography costs are exorbitant, chip design fees exceed $1 billion, and the cost per transistor is rising instead of falling. Therefore, the new paradigm is to compete on speed, latency, and efficiency by optimizing the time factor τ across all levels.

QWhat are some of the remaining challenges for implementing the τ Scaling strategy mentioned in the article?

AChallenges include: adapting EDA tools for 3D stacked design; optimizing process variations between wafers and losses in vertical interconnects; and establishing new standards for energy efficiency and benchmarking to support this new approach.

Lecturas Relacionadas

After $HYPE Hits a New High, Is It Worth Considering the Stock of "HYPE Version MicroStrategy" $PURR?

**HYPE Hits New Highs: Is $PURR, the "HYPE Version of MicroStrategy," Worth Considering?** The stock of Hyperliquid Strategies (NASDAQ: $PURR), a publicly-traded company that exclusively buys and holds the cryptocurrency HYPE, has gained over 100% year-to-date, mirroring HYPE's own 150% surge to new all-time highs. This has sparked discussions about PURR being a more "capital-efficient" play than MicroStrategy's bitcoin strategy, given its reported ~$1 billion unrealized gain on a ~$220 million investment. The article clarifies that PURR is essentially a pure-play wrapper for HYPE, with no other business. It resulted from a 2025 SPAC merger led by firms like Paradigm and Atlas Merchant Capital, bringing traditional finance veterans to its board. Its value is entirely derived from the price of HYPE. While PURR offers a crucial compliance bridge for US-based institutional and retirement accounts unable to access HYPE directly, the analysis questions the "capital efficiency" narrative. The outsized gains are attributed to HYPE's exceptional performance, not superior corporate strategy. For investors who can buy HYPE directly, holding PURR introduces unnecessary risks: potential shareholder dilution from future stock offerings, incomplete passthrough of staking rewards, market hour mismatches, and counterparty risk via its single custodian. A key metric is its mNAV (modified net asset value). Current calculations show PURR trades at a discount to its HYPE holdings, but this could flip to a premium depending on the execution of registered share issuances. The article concludes that PURR is primarily a "conduit product." The investment thesis hinges entirely on one's bullishness on HYPE itself, not on the PURR wrapper, which adds friction and risk for those with direct crypto access.

marsbitHace 12 min(s)

After $HYPE Hits a New High, Is It Worth Considering the Stock of "HYPE Version MicroStrategy" $PURR?

marsbitHace 12 min(s)

The Real Progress and Investment Opportunities of Decentralized AI Computing Power Networks in 2026

In 2026, the AI compute market is marked by centralized GPU consolidation and a significant GPU shortage for smaller players. In this context, Decentralized Physical Infrastructure Networks (DePIN), valued at $9.4B+, have emerged as a viable, revenue-generating alternative. Leading protocols like Aethir ($150M ARR), io.net (130k+ GPUs), Akash, Bittensor, and Render are carving out distinct niches, moving beyond hype to deliver verifiable income primarily from non-crypto-native clients. The key advantage of decentralized GPU networks lies in serving latency-tolerant, cost-sensitive workloads like AI inference, fine-tuning, data preprocessing, and agent operations, offering substantial cost savings (45-80%) compared to major cloud providers. However, reliability variance, lack of robust SLAs, and fragmented tech stacks remain significant adoption hurdles. The sector is maturing with critical 2026 shifts: 1) Evolution of tokenomics towards demand-driven, revenue-linked models (e.g., Render's BME, io.net's IDE), and 2) Clearer enterprise adoption pathways, with traditional firms integrating decentralized compute. For new entrants, opportunities are now concentrated in specialized tooling layers (orchestration, verification, SLA management), vertical applications (e.g., bio-med, content generation), and innovative token designs tied to real usage, rather than generic GPU aggregation. The convergence with the emerging AI Agent economy presents a significant future growth vector.

marsbitHace 13 min(s)

The Real Progress and Investment Opportunities of Decentralized AI Computing Power Networks in 2026

marsbitHace 13 min(s)

The Iran Deal Is Not a Finale, But a 60-Day Political Respite

The reported "Iran deal" is not a comprehensive peace agreement, but a proposed 60-day memorandum of understanding. During this period, Iran would clear mines from the Strait of Hormuz, the US would lift its naval blockade, and Iran would receive a sanctions waiver to sell oil, with formal nuclear negotiations to follow. However, the core structural conflicts remain unresolved: Iran's stockpile of highly enriched uranium, ultimate control of the Strait, the sequence of sanctions relief versus nuclear concessions, and the potential for Israeli unilateral action to derail the process. The article highlights deeper geopolitical currents: China is indirectly shaping talks via Pakistan to restore Iranian oil flows and curb US dominance, while trade routes through Oman and the UAE create leaks in the US blockade. Parallel US diplomatic efforts aim to draw India into its security framework, though India remains cautious. Ultimately, the deal offers short-term political breathing room for both Washington and Tehran. The true test will come after the 60-day window, when the irreconcilable demands—US insistence on nuclear rollback versus Iran's determination to retain the Strait as a deterrent—will resurface, determining if this is a path to peace or merely a managed pause in the conflict.

marsbitHace 23 min(s)

The Iran Deal Is Not a Finale, But a 60-Day Political Respite

marsbitHace 23 min(s)

We Captured Thousands of Job Postings and Discovered ByteDance is Reviving Smartphone R&D

This article analyzes ByteDance's recent hiring activities, revealing a potential restart of smartphone hardware development. By scraping and analyzing thousands of ByteDance job postings, the authors identify three key categories: roles for the "Doubao Phone Assistant" (an AI agent), for a "Mobile OS" (system-level development), and for hardware/engineering positions in Shenzhen (a manufacturing hub). The piece traces the context to the 2025 launch of the "Doubao Phone," a concept device that integrated an AI agent directly into a smartphone, allowing it to see the screen, operate apps, and perform tasks like shopping or booking tickets. While innovative as an early AI Agent prototype, it faced operational restrictions from major platforms like WeChat and Alipay. The new hiring signals a deeper commitment. "Doubao Phone Assistant" roles focus on core Agent capabilities (task execution, memory, cross-app operation). "Mobile OS" positions involve deep system work (kernel, chip adaptation, power/thermal management) necessary for a responsive, always-on AI. Shenzhen-based hardware roles (structure design, testing, production) suggest preparation for physical device manufacturing. The article concludes that in the AI era, where phones may become an Agent's "body," controlling the operating system and hardware is critical. For a company like ByteDance, being merely an app within others' ecosystems is no longer sustainable if it aims to own the next-generation user interface. Therefore, while a consumer phone brand isn't confirmed, ByteDance is decisively moving beyond app development into the complex domain of system-level and hardware-integrated AI.

marsbitHace 43 min(s)

We Captured Thousands of Job Postings and Discovered ByteDance is Reviving Smartphone R&D

marsbitHace 43 min(s)

AI Impact on SaaS Software Stocks: Deconstructing the Bottom-Fishing Logic of Salesforce, ServiceNow, and Snowflake

"AI Nightmare for SaaS Stocks: Unpacking the Bottom-Fishing Logic for Salesforce, ServiceNow, and Snowflake" A deep dive analysis argues that the recent collapse in SaaS software stocks, dubbed the "SaaS Doom," presents a contrarian buying opportunity. The market panic, triggered by fears that AI will disrupt traditional per-user subscription models through "seat compression" and AI agents bypassing software UIs, has led to extreme selling in the software sector. The analysis evaluates three major players under a unified framework: 1. **Salesforce (CRM):** Positioned as a "margin of safety" play. Trading at historically low valuations (13-14x forward P/E), with strong cash flow and a massive buyback, it offers value. Its key challenge is transitioning from a "seat economy" to an AI-driven "task economy" with its Agentforce platform. 2. **ServiceNow (NOW):** The "clearest AI narrative" play. Its "AI Control Tower" strategy aims to become the governance and orchestration layer for enterprise AI agents, benefiting from AI proliferation. Backed by Nvidia's CEO, it trades at a relatively low valuation post-correction. 3. **Snowflake (SNOW):** The "high-risk, high-reward" bet. Its consumption-based model aligns with rising AI workloads, and its RPO growth is strong. However, it faces intense competition (e.g., Databricks), is not yet GAAP profitable, and carries the highest valuation. The conclusion counters the simplified "AI kills software" narrative. AI is eliminating software that sells only functional interfaces but rewarding platforms that provide essential infrastructure, data, and governance. The current sell-off may have created a buying opportunity for resilient software leaders positioned as future AI infrastructure platforms.

marsbitHace 1 hora(s)

AI Impact on SaaS Software Stocks: Deconstructing the Bottom-Fishing Logic of Salesforce, ServiceNow, and Snowflake

marsbitHace 1 hora(s)

Trading

Spot

Futuros

Artículos destacados

Cómo comprar ERA

¡Bienvenido a HTX.com! Hemos hecho que comprar Caldera (ERA) sea simple y conveniente. Sigue nuestra guía paso a paso para iniciar tu viaje de criptos.Paso 1: crea tu cuenta HTXUtiliza tu correo electrónico o número de teléfono para registrarte y obtener una cuenta gratuita en HTX. Experimenta un proceso de registro sin complicaciones y desbloquea todas las funciones.Obtener mi cuentaPaso 2: ve a Comprar cripto y elige tu método de pagoTarjeta de crédito/débito: usa tu Visa o Mastercard para comprar Caldera (ERA) al instante.Saldo: utiliza fondos del saldo de tu cuenta HTX para tradear sin problemas.Terceros: hemos agregado métodos de pago populares como Google Pay y Apple Pay para mejorar la comodidad.P2P: tradear directamente con otros usuarios en HTX.Over-the-Counter (OTC): ofrecemos servicios personalizados y tipos de cambio competitivos para los traders.Paso 3: guarda tu Caldera (ERA)Después de comprar tu Caldera (ERA), guárdalo en tu cuenta HTX. Alternativamente, puedes enviarlo a otro lugar mediante transferencia blockchain o utilizarlo para tradear otras criptomonedas.Paso 4: tradear Caldera (ERA)Tradear fácilmente con Caldera (ERA) en HTX's mercado spot. Simplemente accede a tu cuenta, selecciona tu par de trading, ejecuta tus trades y monitorea en tiempo real. Ofrecemos una experiencia fácil de usar tanto para principiantes como para traders experimentados.

359 Vistas totalesPublicado en 2025.07.17Actualizado en 2025.07.17

Discusiones

Bienvenido a la comunidad de HTX. Aquí puedes mantenerte informado sobre los últimos desarrollos de la plataforma y acceder a análisis profesionales del mercado. A continuación se presentan las opiniones de los usuarios sobre el precio de ERA (ERA).