"Nvidia Concept Stock" CoreWeave Co-founder Interview: AI Demand Seems to Intensify Daily

marsbitPublished on 2026-06-18Last updated on 2026-06-18

Abstract

Interview with CoreWeave co-founder Brannin McBee and VP Nick Robbins highlights the ongoing intensification of AI demand, driven by agents, reasoning, and enterprise adoption. They note a structural shift: bottlenecks are expanding beyond GPU availability to powered data center shells, CPU, storage, electrical work, and supply chain execution. CoreWeave, serving top AI labs and cloud giants, emphasizes its customer-driven model to anticipate and build for next-gen needs. The company has redesigned data centers to accommodate more CPU and storage alongside GPUs, anticipating increased demand for Nvidia's Vera CPU and Vera Rubin servers. While GPU procurement remains critical, competition now hinges on comprehensive engineering and delivery capability. CoreWeave leverages its execution track record and performance to compete with hyperscalers and other neocloud providers. Current primary constraint is powered data center shell availability, not components like HBM memory, though cost fluctuations are passed to customers under their contract model. Vera Rubin deployment is expected to ramp significantly through 2027, following a pattern similar to previous Nvidia platforms.

Editor's Note: This interview offers a window into the AI computing power cycle: demand has not cooled down following the previous wave of GPU procurement frenzy; instead, it is being continuously driven higher by agents, inference, and enterprise-grade AI applications.

This article features an interview with Brannin McBee, Co-founder and Chief Development Officer of CoreWeave, and Nick Robbins, Vice President of Corporate Development and Investor Relations, discussing the current state of AI demand and the neocloud market. The core message from CoreWeave executives is direct—AI demand seems to be intensifying daily in new ways, and the real bottleneck is shifting from "having GPUs" to more complex infrastructure issues: data center powered shells, CPUs, storage, electricians, supply chain execution capabilities, and how much customers are willing to pay for next-generation computing power.

CoreWeave's uniqueness lies in its position in the middle of the AI infrastructure chain: it serves leading clients like OpenAI, Anthropic, Meta, Google, Microsoft, and Nvidia, while also directly sensing demand changes from research labs, enterprise customers, and hyperscale cloud providers. Therefore, what it observes is not just "GPU shortages," but structural changes in AI workloads themselves. With the rise of agentic AI and reasoning models, computing power demand is no longer solely centered around GPUs; the importance of CPUs and storage is also increasing. New-generation data center designs must reserve space for Vera CPUs, Vera Rubin servers, and more storage.

This also explains why AI infrastructure competition is shifting from mere chip procurement to more comprehensive engineering delivery capabilities. Those who can secure powered data centers, deploy servers, streamline supply chains, and optimize cost per token faster are closer to the core of this round of AI capital expenditure cycles. CoreWeave repeatedly emphasizes being "customer-driven," which reflects a larger judgment: AI cloud providers are no longer just selling computing power; they are proactively reconstructing the next-generation AI factory based on the roadmaps of their most advanced clients.

For investors and industry observers, the most notable aspect of this interview is not any single data point, but the direction of change in AI infrastructure demand: GPUs remain important, but bottlenecks are spreading; Nvidia is still the core, but CPUs, HBM, storage, and data center power supply capabilities are becoming new variables; AI demand continues to grow, but future success may depend on who can deliver complex infrastructure consistently, stably, and at scale.

The following is the original text:

CoreWeave is regarded as an innovative early market leader in the neocloud (new type of cloud services) domain.

It is the only cloud service provider to have received the highest "Platinum Rating" from AI research firm SemiAnalysis. Founded in 2017, CoreWeave provides large-scale GPU computing power to startups and large enterprises.

Key Context recently interviewed Brannin McBee, Co-founder and Chief Development Officer of CoreWeave, and Nick Robbins, Vice President of Corporate Development and Investor Relations, to discuss the current state of AI demand and the neocloud market.

The following are edited highlights from the conversation:

AI Demand Continues to Intensify

Tae: When did the wave of demand for agentic AI begin to surge?

Brannin: We saw the real beginnings in the fourth quarter of last year. At that time, we were engaging in engineering-level discussions with clients about the products they expected to bring to market in the first quarter of this year.

This has always been a crucial perspective for how we view customer demand. We have a deeply interconnected engineering relationship with our customers. It is this relationship that allows us to see trends ahead of time, rather than reacting after changes occur.

If you look at the product landscape of the AI market, I'd say the first quarter was the moment of a massive inflection point for inference and AI consumption, and this acceleration is still continuing now.

Tae: What is the current state of AI demand? Compared to a few months ago, has there been absolutely no sign of slowing down in recent weeks?

Nick: It seems to be intensifying in new ways every day.

Tae: Could you discuss the rising trend of CPU demand relative to GPUs in the agentic AI wave? Would you deploy racks of Vera CPUs alongside Nvidia GPU servers?

Brannin: CoreWeave has been running CPUs since 2023. We've always had a full cloud product offering. So the question isn't whether we just started adding CPUs, but rather, what do customers actually need? And is this demand rising in relative terms? The answer is, very clearly, yes.

As agent and reasoning capabilities truly emerge within models, storage demands are also increasing compared to previous generations. I believe this trend will continue.

Nick: To your question, the answer is yes. You will absolutely see lots of Vera CPUs deployed alongside lots of Vera Rubin servers. Last year, we fundamentally redesigned our base data center solution to make room for more storage and more CPUs to be deployed alongside GPUs.

We did this because we are in a very unique position within the entire ecosystem. We are the only independent cloud provider serving all the most advanced technology users. No other independent AI cloud provider can say that Anthropic, OpenAI, Meta, Google, Microsoft, Nvidia, etc., are all their customers.

This creates a beneficial flywheel, or positive feedback loop, for our business: we understand where customers are taking the technology and plan accordingly.

The Bottleneck Is No Longer Just GPUs

Tae: Will you primarily use Nvidia Vera CPUs in the future?

Nick: That depends on the specific workload. Our actions are driven by customer demand. We do expect to be an early and significant adopter of Vera CPUs, which we have disclosed. Currently, our fleet is actually predominantly AMD, but this may change over time based on customer needs. There is very strong customer interest in Vera CPUs.

Brannin: This is also a good reminder to talk about how our contracts work. As you probably know, over 98% of our revenue is contract-driven. We are not guessing what infrastructure customers want. Customers tell us very explicitly what configurations they need. Everything is customer-driven. It is the customers defining what we build.

Tae: Let's talk about the competitive landscape. How do you enter the market and compete against neoclouds like SpaceX, Nebius, Oracle, and hyperscalers like Azure, AWS, and Google?

Brannin: In terms of differentiation, I prefer to look at it from a third-party validation standpoint. Nine out of the top ten AI labs globally, excluding China, are using our platform. SemiAnalysis has consistently ranked us alone at the highest level for performance. I don't think the GPU allocation we receive is due to any personal friendship with Jensen.

This indicates that suppliers have deep confidence in our execution track record and engineering capabilities, believing we can best showcase their product capabilities globally.

Nick: We win hyperscaler customers because we are exceptionally good at execution. We can build these systems incredibly fast, and they run exceptionally well. We win research lab customers because we provide the highest-performing versions of the technology and perform best in per-token efficiency.

We win enterprise customers because the infrastructure simply runs well, and we've built a very excellent, best-in-class orchestration layer, which is also recognized by things like the Platinum Rating.

But increasingly importantly, among AI cloud providers, we have built out the most mature layer of capabilities covering inference and development tools, helping enterprises actually put AI into production.

This means we are building and delivering products that ultimately help enterprises with relatively lower technical maturity to transform data into models, and models into agents that can run internally, while we can cross-sell CoreWeave cloud services in the process.

Tae: What is the current bottleneck? Is it data center powered shells? GPUs? Or electricians?

Brannin: It's powered shells. More specifically, the components inside these shells. You specifically mentioned electricians, which is absolutely correct. It's a complex area.

But importantly, we already have 49 of these sites live and operational. We are not pinning our hopes on one or two sites. We've done this 49 times.

This is a very deep execution track record.

It also means we have accumulated a vast amount of knowledge on how to handle supply chain issues, which suppliers in that supply chain are good to work with, and which ones aren't.

Tae: What can you share about the cost and shortage of HBM memory? How are you managing this? Do customers need to bear the increased costs?

Nick: The answer is yes. Our business model is designed to lock in the GPU price we charge customers at the same time we sign GPU purchase orders and determine our cost. More broadly, that's the server price, and the server price obviously includes HBM cost.

This is how we isolate ourselves from day-to-day price fluctuations.

If our component costs rise in the next transaction, we reflect that in the price we believe we can charge customers, thereby protecting our margins. We are very well protected in passing these costs on to customers. This is something we watch very closely.

Currently, sourcing components is not the biggest bottleneck. The biggest bottleneck is the powered shell. But that answer may shift back and forth at some point in the future.

Tae: How do you expect the deployment ramp for Vera Rubin to unfold? What will the second half of this year look like?

Nick: We are clearly the first company globally to have spun up and fully validated VR, the Vera Rubin rack. We did the same with GB200, GB300 last year. I expect VR to start appearing later this year.

I expect the really massive, very strong deployment ramp to run through all of 2027. This pace is similar to GB: GB started appearing in 2025, but the truly massive ramp was actually throughout 2026. That is to say, quite a bit was deployed by the end of last year, but this year is the year of truly massive GB deployment.

I expect VR to follow a very similar cadence over the next 12 to 18 months.

Trending Cryptos

Related Questions

QAccording to CoreWeave executives, how is AI demand changing currently?

AAI demand seems to be intensifying in new ways every day, with a significant inflection point in Q1 for inference and AI consumption, and this acceleration continues.

QWhat is becoming a new, more complex bottleneck in AI infrastructure, beyond just GPU availability?

AThe main bottleneck is now powered data center shells (and their internal components like electricity supply), alongside a shift where CPU and storage demands are rising in relative importance compared to GPUs.

QWhy does CoreWeave believe it has a unique position in the AI ecosystem?

AIt is the only independent AI cloud serving all the most advanced users, including OpenAI, Anthropic, Meta, Google, Microsoft, and Nvidia, which creates a beneficial feedback loop for understanding and planning for client needs.

QHow does CoreWeave's business model protect it from fluctuations in component costs like HBM memory?

ATheir business model locks in the GPU/server prices charged to clients at the same time they place procurement orders, isolating them from daily price volatility and allowing them to pass on cost increases to protect their margins.

QWhat is CoreWeave's expected deployment timeline for the Nvidia Vera Rubin (VR) platform?

AThey expect Vera Rubin to start appearing later this year, with a very robust, large-scale deployment ramp occurring throughout 2027, following a similar rhythm to the GB200/GB300 rollout.

Related Reads

Matrixdock Featured Again in SBMA’s 《Crucible》: Discussing How Tokenisation Enhances Efficiency in the Precious Metals Market

Matrixdock's research article, titled "Why Tokenisation Matters for the Bullion Industry and How Carrying Costs Fit In," has been featured again in the SBMA's industry publication *Crucible*. Authored by Matrixdock lead Eva Meng, the piece examines how tokenisation enhances the efficiency and utility of the precious metals market. The article argues that tokenisation builds upon the accessibility improvements brought by gold ETFs, not by redefining gold's value but by enabling it to function within digital finance. It extends gold's role beyond a portfolio holding, potentially facilitating instant settlement, digital collateral, and operation in 24/7 markets. A key focus is transparently handling the unavoidable carrying costs (storage, insurance) of physical assets like gold and silver. Matrixdock introduces the Fungible Reserve Standard (FRS) framework, based on an "Economic Purity Principle," which aims to reflect these real-world economic costs clearly within the token mechanism, rather than bundling them opaquely. The platform's practical applications are highlighted, including its gold token XAUm and its silver token XAGm, the first built on the FRS framework. As the tokenised gold market surpassed $6 billion in February 2026, the industry's focus is shifting from initial proofs of reserves to broader concerns of market efficiency and capital utilization. Tokenisation is positioning gold and other precious metals to become active components within the evolving digital financial system.

marsbit14m ago

Matrixdock Featured Again in SBMA’s 《Crucible》: Discussing How Tokenisation Enhances Efficiency in the Precious Metals Market

marsbit14m ago

New Huo Research: Dense Bottom Fishing in the $60K BTC Range, a 'High Value-for-Money Zone' Sees a Handover Surge

Bitcoin has shown a significant oversold rebound this week, with extreme panic in the crypto market easing. Multiple data points indicate a notable market bottom is forming. On the market front, the net outflow from Bitcoin spot ETFs has continued to shrink, and the negative premium between Coinbase and USDT is steadily correcting. Industry fundamentals suggest the shutdown cost for mainstream miners is concentrated between $30,000 and $50,000, potentially solidifying a阶段性 industry cost floor—a classic signal of market bottoms in previous cycles. Institutional capital is notably positioning against the trend. For instance, Sinohope Group's weekly OTC trading volume surged over 8 times环比, with active platform users doubling, both reaching record highs. This confirms a sharp increase in large capital transaction activity and a spike in off-exchange funding demand. Sinohope Research also observed on-chain data showing that funds from entities with public company attributes and long-term "whale" wallets are actively accumulating Bitcoin around the key $60,000 price level. The research institute has maintained since mid-May that a high-value investment window has reopened, and the market is now undergoing a shift from panic selling to long-term holding. Looking ahead, the core drivers for an upward market move will be liquidity release and macro policy developments. The successful and strong performance of SpaceX's IPO has reignited market optimism, and the massive liquidity frozen during its subscription period is now being unlocked. This substantial capital is expected to seek new value opportunities, potentially flowing into currently undervalued assets like Bitcoin. On the macro and policy front, the tone set in Kevin Warsh's upcoming speech at the FOMC meeting is crucial for near-term monetary policy expectations. Furthermore, the potential passage of the CLARITY Act by late July could significantly boost institutional confidence for capital entry. Considering these bottoming signals alongside favorable liquidity and policy factors, Sinohope Research remains optimistic about the market's subsequent trajectory.

marsbit18m ago

New Huo Research: Dense Bottom Fishing in the $60K BTC Range, a 'High Value-for-Money Zone' Sees a Handover Surge

marsbit18m ago

Uncovering the 'God of Investment Research' Behind Citrini: Perpetual Substack Chart-Topper, a Single Report Evaporated Trillions from US Stocks

Revealing the "Research God" Behind Citrini: A Non-Finance Founder Shaking Up Markets Citrini, an independent research firm consistently ranked #1 on Substack's finance charts with nearly 250,000 subscribers, has gained significant attention in the current bull market. Its founder, James van Geelen, holds dual degrees in biology and psychology from UCLA, with a background as an emergency medical technician and a healthcare entrepreneur before founding Citrini. The firm is known for its deep, narrative-driven analyses focusing on long-term "super trends" like AI, geopolitics, and macro policy. Citrini made headlines in February with its report "The 2028 Global Intelligence Crisis," a thought experiment on AI's potential societal impact. Despite being labeled a scenario analysis, it triggered a widespread sell-off in software and related stocks, briefly wiping hundreds of billions from the US market. Other notable reports include an on-the-ground analysis of the Strait of Hormuz and accurate calls on the copper foil industry's importance for AI/semiconductors. Geelen champions "second-order thinking," focusing on the indirect consequences of events. His investment style is thematic and often contrarian, seeking opportunities others miss. Citrini operates with a founder-driven, anonymous elite team model, recently adding specialists in macroeconomics and semiconductor analysis. The firm also manages a model portfolio, Citrindex, which has reportedly achieved over 200% cumulative returns.

marsbit22m ago

Uncovering the 'God of Investment Research' Behind Citrini: Perpetual Substack Chart-Topper, a Single Report Evaporated Trillions from US Stocks

marsbit22m ago

Ethereum Q1 2026 Review: On-chain Activity Hits New Highs, Tokenized Assets Lead the Industry

Ethereum Q1 2026 Review: Chain Activity Hits Record High, Tokenized Assets Lead the Industry In Q1 2026, the Ethereum ecosystem displayed a dual narrative: record-high on-chain activity coincided with declining USD-denominated metrics. User adoption surged, with monthly active addresses reaching 13.2 million (up 53.5% QoQ) and layer-1 transactions hitting 200.4 million (up 38% QoQ). However, the total value locked (TVL) decreased 11% to $316.2B, and protocol revenue fell 16.9% to $2.0B, largely due to a broader crypto market downturn. Tokenized assets emerged as a key strength. Their total market cap reached $203.4B, with significant growth in tokenized funds (+73.1% YoY) and commodities (+325.9% YoY). Ethereum dominated cross-chain comparisons, holding over 61% of stablecoin value and 84% of tokenized commodities value among top chains. A major development was the impact of network scaling. The "Blob" upgrade significantly increased data capacity, causing average transaction fees on layer-1 to plummet 47.9% QoQ despite higher usage. This demonstrates the "Jevons Paradox" in action: cheaper block space stimulates demand. The report highlights Ethereum's established position as the primary settlement layer for institutional tokenization, evidenced by new fund launches from giants like BlackRock and JPMorgan in May. Analysts draw parallels between Ethereum's current stage and the internet in the mid-1990s, suggesting its open, neutral network is poised to become the foundational infrastructure for global finance, outcompeting closed, private alternatives.

Foresight News26m ago

Ethereum Q1 2026 Review: On-chain Activity Hits New Highs, Tokenized Assets Lead the Industry

Foresight News26m ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片