"Nvidia Concept Stock" CoreWeave Co-founder Interview: AI Demand Seems to Intensify Daily

marsbitPublished on 2026-06-18Last updated on 2026-06-18

Abstract

Interview with CoreWeave co-founder Brannin McBee and VP Nick Robbins highlights the ongoing intensification of AI demand, driven by agents, reasoning, and enterprise adoption. They note a structural shift: bottlenecks are expanding beyond GPU availability to powered data center shells, CPU, storage, electrical work, and supply chain execution. CoreWeave, serving top AI labs and cloud giants, emphasizes its customer-driven model to anticipate and build for next-gen needs. The company has redesigned data centers to accommodate more CPU and storage alongside GPUs, anticipating increased demand for Nvidia's Vera CPU and Vera Rubin servers. While GPU procurement remains critical, competition now hinges on comprehensive engineering and delivery capability. CoreWeave leverages its execution track record and performance to compete with hyperscalers and other neocloud providers. Current primary constraint is powered data center shell availability, not components like HBM memory, though cost fluctuations are passed to customers under their contract model. Vera Rubin deployment is expected to ramp significantly through 2027, following a pattern similar to previous Nvidia platforms.

Editor's Note: This interview offers a window into the AI computing power cycle: demand has not cooled down following the previous wave of GPU procurement frenzy; instead, it is being continuously driven higher by agents, inference, and enterprise-grade AI applications.

This article features an interview with Brannin McBee, Co-founder and Chief Development Officer of CoreWeave, and Nick Robbins, Vice President of Corporate Development and Investor Relations, discussing the current state of AI demand and the neocloud market. The core message from CoreWeave executives is direct—AI demand seems to be intensifying daily in new ways, and the real bottleneck is shifting from "having GPUs" to more complex infrastructure issues: data center powered shells, CPUs, storage, electricians, supply chain execution capabilities, and how much customers are willing to pay for next-generation computing power.

CoreWeave's uniqueness lies in its position in the middle of the AI infrastructure chain: it serves leading clients like OpenAI, Anthropic, Meta, Google, Microsoft, and Nvidia, while also directly sensing demand changes from research labs, enterprise customers, and hyperscale cloud providers. Therefore, what it observes is not just "GPU shortages," but structural changes in AI workloads themselves. With the rise of agentic AI and reasoning models, computing power demand is no longer solely centered around GPUs; the importance of CPUs and storage is also increasing. New-generation data center designs must reserve space for Vera CPUs, Vera Rubin servers, and more storage.

This also explains why AI infrastructure competition is shifting from mere chip procurement to more comprehensive engineering delivery capabilities. Those who can secure powered data centers, deploy servers, streamline supply chains, and optimize cost per token faster are closer to the core of this round of AI capital expenditure cycles. CoreWeave repeatedly emphasizes being "customer-driven," which reflects a larger judgment: AI cloud providers are no longer just selling computing power; they are proactively reconstructing the next-generation AI factory based on the roadmaps of their most advanced clients.

For investors and industry observers, the most notable aspect of this interview is not any single data point, but the direction of change in AI infrastructure demand: GPUs remain important, but bottlenecks are spreading; Nvidia is still the core, but CPUs, HBM, storage, and data center power supply capabilities are becoming new variables; AI demand continues to grow, but future success may depend on who can deliver complex infrastructure consistently, stably, and at scale.

The following is the original text:

CoreWeave is regarded as an innovative early market leader in the neocloud (new type of cloud services) domain.

It is the only cloud service provider to have received the highest "Platinum Rating" from AI research firm SemiAnalysis. Founded in 2017, CoreWeave provides large-scale GPU computing power to startups and large enterprises.

Key Context recently interviewed Brannin McBee, Co-founder and Chief Development Officer of CoreWeave, and Nick Robbins, Vice President of Corporate Development and Investor Relations, to discuss the current state of AI demand and the neocloud market.

The following are edited highlights from the conversation:

AI Demand Continues to Intensify

Tae: When did the wave of demand for agentic AI begin to surge?

Brannin: We saw the real beginnings in the fourth quarter of last year. At that time, we were engaging in engineering-level discussions with clients about the products they expected to bring to market in the first quarter of this year.

This has always been a crucial perspective for how we view customer demand. We have a deeply interconnected engineering relationship with our customers. It is this relationship that allows us to see trends ahead of time, rather than reacting after changes occur.

If you look at the product landscape of the AI market, I'd say the first quarter was the moment of a massive inflection point for inference and AI consumption, and this acceleration is still continuing now.

Tae: What is the current state of AI demand? Compared to a few months ago, has there been absolutely no sign of slowing down in recent weeks?

Nick: It seems to be intensifying in new ways every day.

Tae: Could you discuss the rising trend of CPU demand relative to GPUs in the agentic AI wave? Would you deploy racks of Vera CPUs alongside Nvidia GPU servers?

Brannin: CoreWeave has been running CPUs since 2023. We've always had a full cloud product offering. So the question isn't whether we just started adding CPUs, but rather, what do customers actually need? And is this demand rising in relative terms? The answer is, very clearly, yes.

As agent and reasoning capabilities truly emerge within models, storage demands are also increasing compared to previous generations. I believe this trend will continue.

Nick: To your question, the answer is yes. You will absolutely see lots of Vera CPUs deployed alongside lots of Vera Rubin servers. Last year, we fundamentally redesigned our base data center solution to make room for more storage and more CPUs to be deployed alongside GPUs.

We did this because we are in a very unique position within the entire ecosystem. We are the only independent cloud provider serving all the most advanced technology users. No other independent AI cloud provider can say that Anthropic, OpenAI, Meta, Google, Microsoft, Nvidia, etc., are all their customers.

This creates a beneficial flywheel, or positive feedback loop, for our business: we understand where customers are taking the technology and plan accordingly.

The Bottleneck Is No Longer Just GPUs

Tae: Will you primarily use Nvidia Vera CPUs in the future?

Nick: That depends on the specific workload. Our actions are driven by customer demand. We do expect to be an early and significant adopter of Vera CPUs, which we have disclosed. Currently, our fleet is actually predominantly AMD, but this may change over time based on customer needs. There is very strong customer interest in Vera CPUs.

Brannin: This is also a good reminder to talk about how our contracts work. As you probably know, over 98% of our revenue is contract-driven. We are not guessing what infrastructure customers want. Customers tell us very explicitly what configurations they need. Everything is customer-driven. It is the customers defining what we build.

Tae: Let's talk about the competitive landscape. How do you enter the market and compete against neoclouds like SpaceX, Nebius, Oracle, and hyperscalers like Azure, AWS, and Google?

Brannin: In terms of differentiation, I prefer to look at it from a third-party validation standpoint. Nine out of the top ten AI labs globally, excluding China, are using our platform. SemiAnalysis has consistently ranked us alone at the highest level for performance. I don't think the GPU allocation we receive is due to any personal friendship with Jensen.

This indicates that suppliers have deep confidence in our execution track record and engineering capabilities, believing we can best showcase their product capabilities globally.

Nick: We win hyperscaler customers because we are exceptionally good at execution. We can build these systems incredibly fast, and they run exceptionally well. We win research lab customers because we provide the highest-performing versions of the technology and perform best in per-token efficiency.

We win enterprise customers because the infrastructure simply runs well, and we've built a very excellent, best-in-class orchestration layer, which is also recognized by things like the Platinum Rating.

But increasingly importantly, among AI cloud providers, we have built out the most mature layer of capabilities covering inference and development tools, helping enterprises actually put AI into production.

This means we are building and delivering products that ultimately help enterprises with relatively lower technical maturity to transform data into models, and models into agents that can run internally, while we can cross-sell CoreWeave cloud services in the process.

Tae: What is the current bottleneck? Is it data center powered shells? GPUs? Or electricians?

Brannin: It's powered shells. More specifically, the components inside these shells. You specifically mentioned electricians, which is absolutely correct. It's a complex area.

But importantly, we already have 49 of these sites live and operational. We are not pinning our hopes on one or two sites. We've done this 49 times.

This is a very deep execution track record.

It also means we have accumulated a vast amount of knowledge on how to handle supply chain issues, which suppliers in that supply chain are good to work with, and which ones aren't.

Tae: What can you share about the cost and shortage of HBM memory? How are you managing this? Do customers need to bear the increased costs?

Nick: The answer is yes. Our business model is designed to lock in the GPU price we charge customers at the same time we sign GPU purchase orders and determine our cost. More broadly, that's the server price, and the server price obviously includes HBM cost.

This is how we isolate ourselves from day-to-day price fluctuations.

If our component costs rise in the next transaction, we reflect that in the price we believe we can charge customers, thereby protecting our margins. We are very well protected in passing these costs on to customers. This is something we watch very closely.

Currently, sourcing components is not the biggest bottleneck. The biggest bottleneck is the powered shell. But that answer may shift back and forth at some point in the future.

Tae: How do you expect the deployment ramp for Vera Rubin to unfold? What will the second half of this year look like?

Nick: We are clearly the first company globally to have spun up and fully validated VR, the Vera Rubin rack. We did the same with GB200, GB300 last year. I expect VR to start appearing later this year.

I expect the really massive, very strong deployment ramp to run through all of 2027. This pace is similar to GB: GB started appearing in 2025, but the truly massive ramp was actually throughout 2026. That is to say, quite a bit was deployed by the end of last year, but this year is the year of truly massive GB deployment.

I expect VR to follow a very similar cadence over the next 12 to 18 months.

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

BrevisBREV

ZRX（0X）ZRX

PancakeSwapCAKE

Anthropic CEO's Latest Interview: On Technological Explosion, Safety Red Lines, and the Civilization Contract

Interview with Anthropic CEO Dario Amodei covers the intense pressures and ethical dilemmas of leading AI development. He describes the experience as "exponential growth," feeling constant acceleration akin to relativistic time dilation. The discussion delves into his departure from OpenAI, rooted in a fundamental loss of trust and divergent values rather than mere technical disagreements. Amodei emphasizes Anthropic's enterprise-focused business model, arguing it aligns better with safety and responsible deployment than consumer-facing, ad-driven models. He addresses critical issues like AI's impact on employment, advocating for proactive macroeconomic policies and a shift towards "doing more with the same resources" to avoid widespread job displacement. On safety and governance, he details Anthropic's cautious approach, including delaying the release of the powerful "Mythos" model due to its advanced cyber capabilities. He stresses the need for "human-in-the-loop" principles in military applications, setting red lines against autonomous weapons and mass surveillance. Amodei calls for industry collaboration among trustworthy actors to establish standards and advocates for a balanced regulatory framework with checks and balances, such as Anthropic's Long-term Benefit Trust, rather than corporate or government monopoly over the technology. He expresses geopolitical concerns, particularly regarding China, and a belief that AI should bolster liberal democracies. While acknowledging a non-zero risk of civilizational catastrophe from advanced AI, he asserts Anthropic's actions are aimed at significantly reducing that probability. The interview concludes with Amodei arguing that trust must be earned through concrete actions, like sacrificing commercial gain for safety, to distinguish Anthropic in a Silicon Valley landscape he criticizes for eroded public trust.

marsbit1m ago

Anthropic CEO's Latest Interview: On Technological Explosion, Safety Red Lines, and the Civilization Contract

marsbit1m ago

As Capital Rotates From Crypto to AI, Zoomex Traders Already Have Access to Both

Zoomex, a crypto derivatives exchange, launches Zoomex Stocks, a tokenized equities trading solution that allows users to access both AI/semiconductor stocks and crypto from a single Unified Trading Account. This comes as institutional capital rotates from crypto (with Bitcoin ETFs seeing outflows) toward surging AI equities. The product offers 12 tokenized U.S. stocks/ETFs (like TSLAx, NVDAx, QQQx) backed 1:1 by real assets via the xStocks model. Key benefits include 24/7 trading with USDT, no separate brokerage account, a flat 0.50% fee, near-instant on-chain settlement, and no geographic restrictions. It enables crypto-native traders to seamlessly diversify into traditional high-growth equities without leaving the Zoomex platform.

TheNewsCrypto12m ago

As Capital Rotates From Crypto to AI, Zoomex Traders Already Have Access to Both

TheNewsCrypto12m ago

The Brutal Truth Behind CARDS' $535M FDV: Only $43M in Net Revenue and Halved Profit Margins

The article titled "The Brutal Truth Behind CARDS' $535 Million FDV: Only $43 Million Net Revenue, Profit Margins Halved" provides a critical analysis of Collector Crypt (CC), a platform combining physical collectible cards with NFTs in a gacha-style system. Key findings include: * CC has generated $635 million in total user deposits. However, 90.6% ($576 million) is instantly returned to users via automatic card buybacks, resulting in only $43 million in net platform revenue (6.7% retention). * Activity is highly concentrated among dozens of high-frequency wallets, with an average of only ~420 daily active players. * There is minimal secondary market activity for the cards (under $5 million total), indicating the platform functions more as a gambling casino than a collector's marketplace. eBay sales as a percentage of gacha volume have declined for six consecutive quarters. * Despite a tripling in transaction volume, net profit margins have been halved from 11.2% to 5.8% as activity shifts to higher-priced card packs with lower margins. * Value captured by the CARDS token is minimal: only $140,000 (from burns and recent buybacks), representing just 3.4% of CC's cumulative net revenue. In contrast, wallets linked to operational infrastructure have off-ramped $45.7 million in USDC. * The token's ~$535 million Fully Diluted Valuation (FDV) represents a 7.3x multiple of annualized net revenue. Only 20.5% of the token supply is floating, with 72% allocated to insiders and locked until November 2027. The conclusion is that CC has found product-market fit as a high-speed gambling platform for a niche user base, not as a growing collector economy. The token currently captures a negligible share of the platform's revenue.

Foresight News26m ago

The Brutal Truth Behind CARDS' $535M FDV: Only $43M in Net Revenue and Halved Profit Margins

Foresight News26m ago

Cardano Van Rossem Upgrade Moves Closer To Mainnet As Governance Phase Advances

Cardano's Van Rossem protocol upgrade is progressing through its final governance and readiness phase, following its successful enactment on the PreProd testnet. This upgrade is a key test of Cardano's on-chain governance model post-Voltaire, involving coordination among stakeholders. Traders are monitoring the rollout closely, as a smooth process could boost confidence in ADA, while delays could hurt sentiment, especially in a weak market. While PreProd success is a positive step, the focus is now on mainnet activation and clear communication from official channels. Ultimately, a technically sound and uneventful execution would validate Cardano's governance process, serving as a concrete development catalyst.

bitcoinist34m ago

Cardano Van Rossem Upgrade Moves Closer To Mainnet As Governance Phase Advances

bitcoinist34m ago

BitTorrent Launches BTTInferGrid: The Decentralized Infrastructure Layer for Scalable AI Inference

BitTorrent has launched BTTInferGrid, a decentralized GPU computing network designed to meet the surging demand for AI inference workloads. The platform aggregates global idle GPU resources into an open-access, verifiable, and pay-as-you-go infrastructure, aiming to solve the cost, scalability, and supply bottlenecks of traditional centralized cloud providers. BTTInferGrid addresses a key market shift, as industry forecasts indicate over 70% of future AI compute will be for inference—a continuous operational cost. It tackles centralization issues like inflexible resource allocation during volatile demand, prohibitive GPU pricing, and the underutilization of fragmented global compute capacity. The platform establishes a direct corridor between AI developers and idle hardware. On the supply side, it allows providers to monetize underutilized GPUs through tokenized incentives. On the demand side, it offers developers cost-efficient, on-demand inference with on-chain verification. Key differentiators include permissionless access for providers, verifiable service quality through blockchain validation, and a sustainable, demand-driven economic model. Built on BitTorrent's proven DePIN expertise from the BitTorrent File System (BTFS), BTTInferGrid follows a phased roadmap. It begins with network bootstrapping in 2026, focusing on scaling GPU nodes, and aims to evolve into a foundational Web3 AI infrastructure layer by 2028, supporting diverse model architectures and decentralized fine-tuning.

TheNewsCrypto1h ago

BitTorrent Launches BTTInferGrid: The Decentralized Infrastructure Layer for Scalable AI Inference

TheNewsCrypto1h ago

Trading

Spot

Futures

Hot Articles

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

Audiera is a dual-platform Web4 entertainment ecosystem combining a mobile rhythm experience and a lightweight Telegram mini-game, powered by AI interaction and an on-chain creator economy.

40.3k Total ViewsPublished 2026.03.11Updated 2026.03.11

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

43.0k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

2.2k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.