NVIDIA Starts Selling the Method of Making Shovels

marsbitPubblicato 2026-04-15Pubblicato ultima volta 2026-04-15

Introduzione

NVIDIA is accelerating its own GPU design process with AI tools, reducing tasks that took 8 engineers 1010 months to a single GPU running overnight, while achieving better results. However, it is not using AI to fully design GPUs yet. Instead, it is embedding its technology across the chip industry. A key move is its $2020 billion investment in Synopsys, a leading EDA (Electronic Design Automation) tool provider, and partnerships with Cadence and Siemens. This integrates NVIDIA’s hardware acceleration into essential chip design software, making its GPUs the fastest platform for EDA tools. Competitors like AMD and Intel now face a dilemma: use slower tools or rely on NVIDIA’s hardware to design competing chips. Chinese GPU firms, despite high market valuations, struggle with massive losses and dependence on restricted foreign EDA tools. NVIDIA’s strategy extends from AI-enhanced design to manufacturing, creating a self-reinforcing cycle where its hardware becomes essential for developing faster chips—even for those aiming to compete with it.

Author: Ada, Deep Tide TechFlow

San Francisco, San Jose Convention Center, GTC live.

NVIDIA's Chief Scientist Bill Dally sat on stage, facing Google's Jeff Dean. Halfway through their conversation, Dally dropped a number: "Previously, migrating a standard cell library containing about 2500 to 3000 units required a team of 8 engineers taking about 10 months."

He paused.

"Now it only takes a single GPU card, running overnight."

There were no gasps from the audience, because those who understood the statement knew what it meant. The work of 8 engineers over 10 months was devoured overnight by a GPU produced in-house. And Dally added: the results matched or even exceeded human design in the three key metrics of area, power consumption, and latency.

The next day, news outlets interpreted it as "NVIDIA uses AI to design GPUs."

But the truth of the matter is far more intriguing than the headline.

What is NVIDIA running internally?

What NVIDIA runs internally isn't a black box either; it's a set of toolchains refined over several years.

NB-Cell is a reinforcement learning-based program, specifically tackling the grueling work of standard cell library migration. Prefix RL aims to solve the long-standing research challenge of placement in the lookahead stage of carry-lookahead chains. Dally stated that the layouts generated by this system are something "humans could never have conceived of," with key metrics improving by about 20% to 30% compared to human design.

Then there are the two internal LLMs, Chip Nemo and Bug Nemo. NVIDIA fed these two large models with the RTL code, architecture documents, and design specifications of every GPU in its history. According to Dally's description, this is equivalent to distilling NVIDIA's twenty years of muscle memory from G80 to Blackwell into an internal model. New hires essentially interface directly with a senior engineer possessing twenty years of experience.

So, can "AI design GPUs" now?

Quite the opposite. Dally's exact words were: "I would love to one day just say 'design me a new GPU,' but we are still far from that step."

NVIDIA did not use AI to design a GPU. But it did something else that will make the entire industry unable to function without it.

$2 Billion Entry into the Heart of EDA

On December 1, 2025, NVIDIA invested $2 billion in Synopsys, one of the three EDA giants. The two parties signed a joint development agreement to embed NVIDIA's accelerated computing stack into Synopsys's entire EDA workflow. Blackwell and the next-generation Rubin GPU are to be deeply integrated with Synopsys.ai.

Synopsys's position needs some explanation. Nearly every advanced process chip globally—Apple's M-series, AMD's MI-series, Google's TPU—runs on toolchains from either Synopsys or Cadence during the design phase. These two, plus Siemens EDA, monopolize the underlying tools for chip design. You might not use Qualcomm's chips, you might not use TSMC's production lines, but you cannot escape the software from these three companies.

Three months after the investment in Synopsys, NVIDIA brought Cadence, Siemens, and Dassault on board, announcing that they are all developing AI-driven chip design tools based on NVIDIA GPUs.

The benchmark data released by NVIDIA is quite staggering: Synopsys PrimeSim is 30x faster on Blackwell, Proteus is 20x faster, Sentaurus achieves a 12x speedup on B200 compared to CPU. MediaTek used H100 to accelerate Cadence Spectre by 6x. Astera Labs used Synopsys + NVIDIA to accelerate chip verification by 3.5x.

One detail is worth highlighting separately: Cadence's Millennium M2000 platform is labeled as "exclusively based on NVIDIA Blackwell, specifically built for the EDA market."

The word "exclusively" is most telling. This means that EDA tools, which previously ran on CPUs (Intel, AMD could play), now require buying NVIDIA cards if you want the fastest EDA.

The True Shape of the Flywheel

NVIDIA's flywheel, as most people understand it, goes like this: sell GPUs to AI companies, AI companies train large models, large models prove GPUs are irreplaceable, more people buy GPUs.

This flywheel is scary enough. But there's another layer beneath it.

NVIDIA uses its own tools to design the next generation of GPUs, creating a generational gap in design efficiency, while simultaneously tying the entire industry's EDA toolchain to its own hardware. Competitors want to catch up, but even the tools for catching up must be rented from NVIDIA's ecosystem.

The anxiety hidden behind AMD's earnings report that caused its stock to plunge is precisely this. Even though NVIDIA and Synopsys ostensibly say the "investment does not carry any obligation to purchase NVIDIA hardware," the market understands: accelerated EDA features debut first on NVIDIA hardware. AMD and Intel can only rely on a path "optimized for their biggest competitor's platform."

Imagine an AMD engineer in the future wanting to design a chip to rival Blackwell. They open Synopsys's tool, which runs fastest on NVIDIA GPUs. So, they either endure a design cycle twice as long, or buy a bunch of NVIDIA cards to design the chip meant to defeat NVIDIA.

The shovels are still being sold. But the way they are sold has changed.

The Real Situation of Domestic GPUs

At this point, it's necessary to present some sobering numbers.

In the same fiscal year 2025 that NVIDIA's net profit exceeded $70 billion, the domestic GPU "Four Little Dragons"—Moore Threads, MetaX, Biren, and Enflame—were queuing up before the IPO window.

Moore Threads' prospectus shows that from 2022 to 2024, the cumulative net loss was 5 billion yuan, with another 271 million yuan loss in the first half of 2025. As of June 30, the accumulated uncovered loss was 1.478 billion yuan. The company's management itself estimates that achieving consolidated profitability will be possible no earlier than 2027. MetaX is slightly better, with cumulative losses exceeding 3 billion yuan over three years. The worst is Biren, with losses exceeding 6.3 billion yuan over three and a half years. Its revenue in the first half of 2025 was only 58.9 million yuan, not even a fraction of Moore Threads' 702 million yuan during the same period.

Look at the R&D intensity. Moore Threads' R&D expenses as a percentage of revenue were 2422.51% in 2022, and still as high as 309.88% in 2024. The money spent on R&D in a year is more than three times the revenue. This isn't business operation; it's life support, sustained by continuous输血 (transfusions) from the primary market and the recently opened Sci-Tech Innovation Board (STAR Market) window.

The tool situation is even more卡脖子 (a chokehold). Empyrean Technology's (Huada Jiutian) 2022 IPO prospectus showed its tools only partially support 5nm advanced processes. Primarius Technologies (Gailun Electronics) can cover 7nm/5nm/3nm nodes, but only makes point tools, far from a full flow.

Empyrean founder Liu Weiping was very candid: "Domestic EDA still has obvious deficiencies in supporting advanced processes, especially current ones like 7nm, 5nm, 3nm. Currently, domestic EDA can achieve 14nm level. Although 7nm process technology is mastered, the deep integration of 7nm with practical applications still requires协同发力 (collaborative effort) from the entire industry chain."

In other words, full-flow EDA for advanced processes is basically unusable domestically. Domestic GPU companies still use Synopsys and Cadence to design chips. In 2025, Trump once announced export controls on all critical software. Although not substantially implemented, EDA tools for advanced processes below 7nm remain under strict control. When the license gets cut off, the switch is in someone else's hands.

The capital market's reaction is surreal enough. On its listing day, MetaX's stock closed at 829.9 yuan, a single-day increase of 692.95%. After Moore Threads went public, its stock price once rose to become the third highest in the A-share market,仅次于 (only behind) Kweichow Moutai and Cambricon. Some media calculated its total market capitalization at about 359.5 billion yuan based on the stock price at the time.

The real business behind the numbers is this: a group of companies that are still burning money and losing, still reliant on controlled foreign toolchains to continue designing chips, are being priced in the secondary market as the successors to the "domestic NVIDIA."

And the set of tools these companies use to design chips is becoming part of the NVIDIA ecosystem. NVIDIA's $2 billion binding with Synopsys, Cadence Millennium M2000's "exclusively based on NVIDIA Blackwell" label, make the act of catching up itself a paradox.

A Complete Chain from Design to Manufacturing

Back to that GTC conversation.

Dally was very humble throughout. "AI is still far from designing chips on its own"—NVIDIA has been saying this for four or five years. But the phrasing changes every year. Four years ago it was "AI can assist design," three years ago it was "AI can automate certain steps," this year it's "does the work of 8 people in 10 months overnight." Pushing forward a step every year, leaving behind a "still far from the ultimate goal" each time. Looking back three years later, the previous "still far" has been achieved, and the new "still far" is defined at a place all competitors still cannot reach.

What NVIDIA has done in the past twelve months is essentially one thing: applying AI to the most valuable, deepest moat segments of the chip industry chain, and then selling these tools layer by layer to the entire industry.

The front end of chip design is being taken over by internal LLMs like Chip Nemo; the mid-design tasks like standard cell library migration and layout optimization are being taken over by NB-Cell and Prefix RL; the entire EDA toolchain is being tied to its own GPUs through the $2 billion deal with Synopsys and Cadence's "exclusively based on Blackwell"; the computational lithography on the manufacturing end is being taken over by cuLitho, which TSMC is already using.

From design to manufacturing, every segment has been re-done by NVIDIA using AI. Every segment ultimately leads to the same conclusion: if you want the fastest tools, you have to buy NVIDIA's cards.

For all opponents who want to build a chip that can defeat Blackwell, the most awkward thing has already happened. The EDA tools needed to design this chip have their fastest versions running on NVIDIA GPUs; the computational lithography needed to manufacture this chip has its fastest algorithm library provided by NVIDIA; the compute power used to train the design AI is still NVIDIA's cards.

The one you want to defeat is now renting you all the tools needed to defeat it. The rent is paid yearly, and the contract price increases annually.

Domande pertinenti

QWhat was the key achievement of NVIDIA's AI tool NB-Cell as described in the article?

ANB-Cell, a reinforcement learning-based program, automated the migration of standard cell libraries, completing in one night on a single GPU what previously took 8 engineers 10 months, with results matching or exceeding human designs in area, power, and latency metrics.

QHow did NVIDIA strengthen its position in the EDA (Electronic Design Automation) industry through its partnership with Synopsys?

ANVIDIA invested $2 billion in Synopsys, integrating its accelerated computing stack into Synopsys's entire EDA workflow, and later collaborated with Cadence and Siemens to develop AI-driven chip design tools primarily optimized for NVIDIA GPUs, making NVIDIA hardware essential for the fastest EDA performance.

QWhat is the financial and operational situation of China's domestic GPU companies like Moore Threads and Biren as highlighted in the article?

AChina's GPU 'four dragons'—Moore Threads, Biren,沐曦(Muxi), and燧原(Suray)—face significant losses: Moore Threads had a cumulative net loss of RMB 5 billion over three years, while Biren lost over RMB 6.3 billion in three and a half years, with R&D costs far exceeding revenues and reliance on foreign EDA tools under export control risks.

QWhat broader strategy is NVIDIA employing across the chip design and manufacturing chain according to the article?

ANVIDIA is using AI to optimize every critical stage of the chip industry—from front-end design with tools like Chip Nemo, to mid-design automation with NB-Cell and Prefix RL, EDA tool integration via partnerships, and manufacturing with cuLitho for computational lithography—ensuring that the fastest tools require NVIDIA GPUs, creating a self-reinforcing ecosystem.

QWhat paradox does the article highlight for competitors like AMD or Intel trying to challenge NVIDIA's dominance?

ACompetitors must use EDA tools and design workflows that are fastest on NVIDIA GPUs (e.g., Cadence's Millennium M2000 exclusively on Blackwell) to design chips intended to outperform NVIDIA, meaning they either accept slower development cycles or buy NVIDIA hardware to compete against NVIDIA, embedding dependency into their own R&D process.

Letture associate

Silicon Valley 'Startup Guru' Steve Hoffman: Web3 + AI Could Be a Trap

Silicon Valley investor and "Godfather of Startups" Steve Hoffman warns that combining Web3 with AI is likely a trap, not a promising venture. In an interview, Hoffman argues that while AI is a foundational technology touching all industries, Web3 adds complexity, friction, and regulatory risk without solving mainstream consumer or business needs. He advises founders to focus on deep, specialized applications where startups can out-iterate giants, rather than on generic features easily replicated by large tech companies. Hoffman observes that Silicon Valley will lead foundational AI research, while China excels at rapid, large-scale application and commercialization, particularly in robotics. He stresses that AI-driven autonomous agents capable of collaborative, multi-step tasks are 2-4 years away, which will cause significant job displacement. The solution is not to slow AI but to redesign business models around human-AI collaboration and reform social systems like education and retraining. For startups, Hoffman recommends focusing on vertical, expertise-heavy domains to build defensibility. He sees major opportunities in AI fraud detection and cybersecurity. Key founder mindsets include systemic thinking over feature-focus, relentless customer centricity, building adaptive teams, and deeply understanding AI's capabilities and limits. Hoffman is also leading a non-profit initiative to establish university centers aimed at training future leaders in responsible, human-value-aligned AI innovation.

marsbit1 h fa

Silicon Valley 'Startup Guru' Steve Hoffman: Web3 + AI Could Be a Trap

marsbit1 h fa

Token Inefficient, Economy Tokenless

The article "Tokens Aren't Economical, Economics Aren't Tokenized" analyzes a pivotal shift in the AI industry from a technology-driven narrative to one dominated by capital efficiency. It highlights two concurrent trends: a severe capital shortage due to the exorbitant and recurring costs of compute (e.g., OpenAI's high burn rate) and a wave of corporate spin-offs where major tech companies are separating their AI units (like Kuaishou's Kling and Baidu's Kunlunxin). The core argument is that AI's "anti-internet" business model, where user growth increases costs rather than profits, has created a disconnect between high valuations and actual cash flow. Spin-offs address this by allowing AI assets to be valued independently. Within a parent company, they are seen as cost centers, but as standalone entities, they are priced based on their growth potential and scarcity in the primary market, leading to massive valuation premiums (e.g., Kling's estimated value tripling post-spin-off). The industry is at an inflection point, moving from "model worship" to "value realization." The competition is evolving from a pure compute (GPU) race to a broader focus on systemic efficiency and full-stack engineering (involving CPUs and orchestration) to achieve viable commercialization. The year 2026 is framed as a critical moment where the industry must definitively answer how to economically translate AI capability into tangible business value, reshaping the sector's future power structure.

marsbit1 h fa

Token Inefficient, Economy Tokenless

marsbit1 h fa

Crossing the 'Memory Wall': The Wafer-Level Revolution and Computing Power Routes in the AI Inference Era

In 2026, a historic shift occurred in AI as major cloud providers' inference spending surpassed training spending for the first time, signaling a move from "building large models" to "using large models." This shifts the core challenge from computing power to the "memory wall"—the bottleneck of data movement (model weights, activations, KV Cache) between external DRAM and processors, where energy and latency from data transfer far exceed computation itself. Companies like Nvidia face GPU idle time due to bandwidth limits. In contrast, Cerebras Systems adopts a radical "wafer-scale" approach with its Wafer-Scale Engine (WSE). Instead of cutting a silicon wafer into many chips, Cerebras uses almost the entire wafer as one massive chip (WSE-3). This design provides 44GB of on-chip SRAM, delivering memory bandwidth thousands of times higher than traditional HBM (e.g., 21 PB/s vs. Nvidia B200). For LLM inference, weights are streamed layer-by-layer from external MemoryX storage to the chip, avoiding HBM bottlenecks. This results in token generation speeds 1.5–5 times faster than Nvidia's B200 in some models and significant advantages in first-token latency and long-context tasks. Additionally, Cerebras's architecture offers much lower interconnect power consumption (0.15 pJ/bit vs. GPU's ~10 pJ/bit). However, Cerebras faces challenges: SRAM scaling has slowed with advanced nodes, limiting future capacity gains; the chip requires specialized liquid cooling and custom software stacks; and its external I/O bandwidth (150 GB/s) is low compared to NVLink, hindering multi-system scaling for very large models. Competition is intensifying. Major players are pursuing three paths: 1) Developing proprietary inference ASICs (e.g., Google TPU, Microsoft Maia), 2) Leveraging advanced packaging (e.g., TSMC's SoW) to democratize wafer-scale-like integration, potentially eroding Cerebras's process advantage within a few years, and 3) Exploring optical interconnects for ultimate bandwidth. Commercially, Cerebras is transitioning from a hardware vendor to a service provider, facing the immense challenge of building high-power, specialized data centers to meet large contracts (e.g., 250MW/year from 2026–2028). In conclusion, the AI inference era presents a fundamental architectural trade-off. Cerebras opts for extreme physical optimization for low-latency, single-task performance, while Nvidia prioritizes versatility and massive cluster throughput. The path forward remains uncertain, with technology and business models still evolving in the race toward advanced AI.

marsbit1 h fa

Crossing the 'Memory Wall': The Wafer-Level Revolution and Computing Power Routes in the AI Inference Era

marsbit1 h fa

Has Bitcoin's 'Rebound Ended', Officially Entering the Late Bear Market Phase?

**Title: Has Bitcoin's Rebound Ended, Entering the Late Bear Market Phase?** **Summary:** Bitcoin's price has declined by 13% this week, signaling a potential return to late-stage bear market conditions. The price fell to around $67k, positioned between the Realized Price and Realized Cap Weighted Average. For the first time since early 2022, the Short-Term Holder cost basis has dropped below this key average, confirming a hallmark of late-cycle bear markets. Profitability metrics have collapsed sharply. The 7-day average of the Realized Profit/Loss ratio plummeted from a local high of 3.16 to 0.29, mirroring the February panic sell-off. Critically, the 90-day average never breached the threshold of 2, indicating the recent rally to $82k was a bear market bounce, not a structural shift. Realized losses surged to $1.35 billion daily, with $770 million coming from Long-Term Holders selling at a loss. This accelerating redistribution of supply from weak to strong hands is a necessary but ongoing process for a market bottom. The rally stalled almost precisely at the aggregate cost basis (~$83k) of US spot Bitcoin ETF investors, turning that level into strong resistance and leaving the average ETF holder underwater again. Spot market flows have turned decisively negative, showing sellers are dominating order books despite the price drop. While a significant futures long liquidation event cleared over $400 million in leverage, providing a potential reset, sustained spot demand is yet to materialize. Options markets continue to price in higher future volatility (Implied Volatility) than recent price action (Realized Volatility) has shown, with a persistent skew towards put options, indicating ongoing demand for downside protection. In conclusion, multiple metrics point to a fragile market structure. Resistance at the ETF cost basis, accelerating realized losses, dominant spot selling, and cautious options pricing all suggest the bear market trend persists. A sustainable recovery likely requires a resurgence of spot demand, ETF holders returning to profit, and a clear reduction in selling pressure.

marsbit1 h fa

Has Bitcoin's 'Rebound Ended', Officially Entering the Late Bear Market Phase?

marsbit1 h fa

TechFlow Intelligence Agency: Anthropic Calls for Global Pause in AI Development While Preparing for Trillion-Dollar IPO; SpaceX IPO Roadshow Heats Up, But S&P 500 Rejects Fast-Track Inclusion

In today's TechFlow Intelligence Briefing, several major tech stories highlight a growing theme of trust and credibility gaps across AI, crypto, and finance. AI company Anthropic has publicly called for a global pause in AI development, citing risks from Claude's "recursive self-improvement." Ironically, this coincides with reports the company is preparing for a massive IPO targeting a near $1 trillion valuation. This perceived hypocrisy, coupled with widespread user complaints about Claude's declining performance, is sparking debate over whether the safety warning is genuine or a competitive tactic. Meanwhile, in a substantive security move, Anthropic open-sourced a framework for AI-powered vulnerability discovery. In the crypto market, Bitcoin's price drop below $61,000 triggered over $1.16 billion in liquidations, flipping the market into a state where more BTC is held at a loss than at a profit, a historical bearish signal. On the corporate front, SpaceX's highly anticipated IPO is generating immense Wall Street excitement, with Goldman Sachs projecting 100x revenue growth by 2030. However, the S&P 500 has refused to fast-track the company's inclusion post-IPO, potentially limiting immediate institutional demand. Separately, ByteDance's AI app Doubao lost over 6 million monthly active users after introducing a subscription model, highlighting the challenges of AI monetization. Other notable developments include Nvidia certifying HBM4 memory from Samsung, SK Hynix, and Micron; Cloudflare's acquisition of front-end tooling company VoidZero; and its CEO warning that bot traffic now exceeds human traffic online. The underlying narrative connects these events: a trust crisis. From AI firms' contradictory actions and crypto volatility to the clash between SpaceX's hyped narrative and institutional rules, a pattern is emerging where stated intentions and actual practices are increasingly misaligned.

marsbit1 h fa

TechFlow Intelligence Agency: Anthropic Calls for Global Pause in AI Development While Preparing for Trillion-Dollar IPO; SpaceX IPO Roadshow Heats Up, But S&P 500 Rejects Fast-Track Inclusion

marsbit1 h fa

Trading

Spot
Futures

Articoli Popolari

Come comprare ADA

Benvenuto in HTX.com! Abbiamo reso l'acquisto di Cardano (ADA) semplice e conveniente. Segui la nostra guida passo passo per intraprendere il tuo viaggio nel mondo delle criptovalute.Step 1: Crea il tuo Account HTXUsa la tua email o numero di telefono per registrarti il tuo account gratuito su HTX. Vivi un'esperienza facile e sblocca tutte le funzionalità,Crea il mio accountStep 2: Vai in Acquista crypto e seleziona il tuo metodo di pagamentoCarta di credito/debito: utilizza la tua Visa o Mastercard per acquistare immediatamente CardanoADA.Bilancio: Usa i fondi dal bilancio del tuo account HTX per fare trading senza problemi.Terze parti: abbiamo aggiunto metodi di pagamento molto utilizzati come Google Pay e Apple Pay per maggiore comodità.P2P: Fai trading direttamente con altri utenti HTX.Over-the-Counter (OTC): Offriamo servizi su misura e tassi di cambio competitivi per i trader.Step 3: Conserva Cardano (ADA)Dopo aver acquistato Cardano (ADA), conserva nel tuo account HTX. In alternativa, puoi inviare tramite trasferimento blockchain o scambiare per altre criptovalute.Step 4: Scambia Cardano (ADA)Scambia facilmente Cardano (ADA) nel mercato spot di HTX. Accedi al tuo account, seleziona la tua coppia di trading, esegui le tue operazioni e monitora in tempo reale. Offriamo un'esperienza user-friendly sia per chi ha appena iniziato che per i trader più esperti.

1.3k Totale visualizzazioniPubblicato il 2024.12.10Aggiornato il 2026.06.02

Come comprare ADA

Discussioni

Benvenuto nella Community HTX. Qui puoi rimanere informato sugli ultimi sviluppi della piattaforma e accedere ad approfondimenti esperti sul mercato. Le opinioni degli utenti sul prezzo di ADA ADA sono presentate come di seguito.

活动图片