Interview with CoreWeave Co-founders: AI Demand Seems to 'Intensify' Every Day

marsbitXuất bản vào 2026-06-19Cập nhật gần nhất vào 2026-06-19

Tóm tắt

An Interview with CoreWeave Executives: AI Demand Seems to 'Intensify' Every Day In an interview, CoreWeave executives highlight a structural shift in AI infrastructure demand. While GPU availability remains crucial, the primary bottlenecks are evolving to include powered data center shells, skilled labor (like electricians), and complex supply chain execution. They note that AI demand, particularly for agentic AI and reasoning models, continues to intensify daily, accelerating since Q1 2024. This demand is driving a need for more balanced infrastructure. CoreWeave is redesigning data centers to allocate more space for storage and CPUs alongside GPUs, with significant interest in Nvidia's upcoming Vera CPUs. The company, serving top AI labs and hyperscalers, emphasizes its client-driven model, building precisely to customer specifications. CoreWeave attributes its competitive edge to proven execution, performance, and a mature platform for AI deployment. Pricing is structured to pass component cost increases (e.g., for HBM memory) to customers, protecting margins. Looking ahead, they anticipate Vera Rubin platform deployments to begin meaningfully in late 2025, with a major ramp throughout 2027, mirroring the Blackwell (GB) series rollout pattern. The competition is shifting from merely acquiring chips to holistic engineering and delivery capability.

Original Title: An Interview with CoreWeave Executives: AI Demand Seems to 'Intensify' Every Day

Original Author: Tae Kim

Original Compiler: Peggy, BlockBeats

Editor's Note: This interview provides a window into the AI computing power cycle: demand has not cooled off due to the last round of GPU buying frenzy, but is instead being further driven by agents, inference, and enterprise-level AI applications.

This article interviews CoreWeave Co-founder and Chief Development Officer Brannin McBee, as well as Vice President of Corporate Development and Investor Relations Nick Robbins, discussing the current state of AI demand and the neocloud market. The core message from CoreWeave executives is straightforward—AI demand seems to intensify in new ways every day, and the real bottleneck is shifting from 'having GPUs' to more complex infrastructure issues: data center power shells, CPUs, storage, electricians, supply chain execution capabilities, and how much customers are willing to pay for the next generation of computing power.

CoreWeave's uniqueness lies in its position in the middle of the AI infrastructure chain: it serves top clients like OpenAI, Anthropic, Meta, Google, Microsoft, and Nvidia, while also directly sensing demand changes from research labs, enterprise clients, and hyperscale cloud providers. Therefore, what it sees is not just 'GPU shortages,' but the structural changes happening in AI workloads themselves. With the rise of agentic AI and reasoning models, computing power demand no longer revolves solely around GPUs; the importance of CPUs and storage is also increasing. The design of the next generation of data centers must reserve space for Vera CPUs, Vera Rubin servers, and more storage.

This also explains why the competition in AI infrastructure is shifting from mere chip procurement to more comprehensive engineering delivery capabilities. Whoever can acquire powered data centers, deploy servers, streamline supply chains, and optimize per-token costs faster will be closer to the core of this round of AI capital expenditure cycles. CoreWeave repeatedly emphasizes being 'customer-driven,' which actually reflects a bigger judgment: AI cloud providers are no longer just selling computing power, but are proactively reconstructing the next generation of AI factories based on the roadmaps of the most cutting-edge clients.

For investors and industry observers, the most noteworthy aspect of this interview is not any single data point, but the direction of change in AI infrastructure demand: GPUs are still important, but bottlenecks are spreading; Nvidia remains core, but CPUs, HBM, storage, and data center power supply capacity are becoming new variables; AI demand is still growing, but the future winner may depend on who can deliver complex infrastructure consistently, stably, and at scale.

Below is the original text:

CoreWeave is seen as an innovative early market leader in the neocloud (new cloud services) space.

It is the only cloud service provider to receive the highest-level 'Platinum Rating' from AI research firm SemiAnalysis. Founded in 2017, CoreWeave provides large-scale GPU computing power to startups and large enterprises.

Key Context recently interviewed CoreWeave Co-founder and Chief Development Officer Brannin McBee, and Vice President of Corporate Development and Investor Relations Nick Robbins, to discuss the current state of AI demand and the neocloud market.

Below are edited highlights from the conversation:

AI Demand Continues to Intensify

Tae: When did the demand wave for Agent AI begin to explode?

Brannin: We saw the real beginning in the fourth quarter of last year. At that time, we were having engineering-level conversations with clients about the products they expected to bring to market in the first quarter of this year.

This has always been a very important perspective for us when looking at customer demand. We have a deeply interconnected engineering relationship with our clients. It is this relationship that allows us to see trends early, rather than reacting passively after changes occur.

If we look from a product perspective in the AI market, I would say the first quarter was a moment of a huge inflection point for inference and AI consumption, and this acceleration is still ongoing.

Tae: What is the current state of AI demand? Compared to a few months ago, has there been absolutely no sign of slowing down in recent weeks?

Nick: It seems to intensify in new ways every day.

Tae: Talk about the rising trend in CPU demand relative to GPUs within the Agent AI wave. Will you deploy rows of Vera CPU racks next to Nvidia GPU servers?

Brannin: CoreWeave has been running CPUs since 2023. We've always had a full cloud product suite. So the question isn't whether we're just starting to add CPUs, but rather, what do customers actually need? Is that demand rising in relative terms? The answer is, very clearly, yes.

As agent and inference capabilities truly emerge within models, storage demand is also rising relative to previous generations. I believe this trend will continue.

Nick: To your question, the answer is yes. You will absolutely see a significant amount of Vera CPUs deployed alongside a significant amount of Vera Rubin servers. Last year, we fundamentally redesigned our base data center layout to allocate more space for storage and more CPUs to be deployed alongside GPUs.

We did this because we are in a very unique position within the entire ecosystem. We are the only independent cloud provider serving all the most advanced technology users. No other independent AI cloud provider can say that Anthropic, OpenAI, Meta, Google, Microsoft, Nvidia, etc., are all its customers.

This creates a beneficial flywheel, or positive feedback loop, for our business: we understand where customers are taking the technology and plan accordingly.

The Bottleneck Is No Longer Just GPUs

Tae: Will you primarily use Nvidia Vera CPUs in the future?

Nick: It depends on the specific workload. We operate in a customer-driven manner. We do expect to be an early and significant adopter of Vera CPUs, which we have disclosed. Currently, our fleet is actually predominantly AMD, but over time, this may change based on customer demand. Customer interest in Vera CPUs is very strong.

Brannin: This is also a good reminder to talk about how our contracts work. As you know, over 98% of our revenue is contract-driven. We are not guessing what kind of infrastructure our customers want. Customers tell us very explicitly what configurations they need. Everything is customer-driven. It is the customer defining what we build.

Tae: Talk about the competitive landscape. How are you entering the market and competing against neoclouds like SpaceX, Nebius, Oracle, and hyperscale cloud providers like Azure, AWS, Google?

Brannin: Regarding differentiation, I prefer to look at it from a third-party validation standpoint. Nine of the top ten AI labs globally, excluding China, use our platform. SemiAnalysis consistently rates us alone at the highest level for performance. I don't think we get the GPU allocations we do because of a personal friendship with Jensen.

It speaks to the deep confidence suppliers have in our execution track record and engineering capabilities, believing we can best demonstrate their product capabilities globally.

Nick: We are able to win hyperscale cloud provider customers because we are exceptionally good at execution. We can build these systems at incredible speed, and they run exceptionally well. We win research lab customers because we offer the strongest-performing versions of the technology and the best per-token efficiency.

We win enterprise customers because the infrastructure truly runs well, and we've built a superior, best-in-class orchestration layer, which is also recognized by the Platinum Rating and others.

But increasingly important, among AI cloud providers, we have built the most mature layer of capabilities covering inference and development tools, helping businesses actually put AI into production.

This means we are building and delivering products that ultimately help businesses with relatively lower technical maturity transform data into models, and then into agents that can run internally, while we can cross-sell CoreWeave cloud services in this process.

Tae: What are the current bottlenecks? Is it data center shells with power, GPUs, or electricians?

Brannin: It's powered shells, meaning data center shells with power availability. More precisely, the components inside these shells. You specifically mentioned electricians, and that's absolutely correct. This is a complex area.

But importantly, we already have 49 such sites live and operational. We are not pinning our hopes on one or two sites. We've done it 49 times.

This is a very deep execution track record.

It also means we have accumulated extensive knowledge on how to handle supply chain issues, which suppliers in this supply chain are suitable to partner with, and which are not.

Editor's Note: Powered shells refer to the data center building itself, excluding the actual computing server hardware.

Tae: What can you share about the cost and shortage of HBM memory? How are you coping? Do customers have to bear the cost increase?

Nick: The answer is yes. Our business model is designed to lock in the price we charge customers for GPUs—and more broadly, servers, whose price obviously includes HBM costs—at the same time we sign GPU purchase orders and determine our own costs.

This is how we isolate the impact of day-to-day price fluctuations.

If our component costs rise in the next transaction, we reflect that cost into the price we believe we can charge customers, thereby protecting our margins. We are well-protected in passing these costs on to customers. This is something we watch very closely.

Currently, acquiring components is not the biggest bottleneck. The biggest bottleneck is the powered shell. But at some point in the future, this answer may shift back and forth.

Tae: How do you expect the deployment ramp for Vera Rubin to unfold? What will the second half of this year look like?

Nick: We are clearly the first company globally to have powered on and fully validated VR, or Vera Rubin, cabinets. We did the same with GB200, GB300 last year. I expect VR to start appearing later this year.

I expect a truly large-scale, very strong deployment ramp throughout 2027. This rhythm is similar to GB: GB started appearing in 2025, but the truly massive ramp actually spanned the entirety of 2026. That is to say, a fair amount was deployed by the end of last year, but this year is the year of truly massive GB deployment.

I expect VR to follow a very similar rhythm over the next 12 to 18 months.

Câu hỏi Liên quan

QAccording to the CoreWeave executives, what is the current state of AI demand and how has it changed?

AThe executives state that AI demand seems to 'intensify' every day in new ways. It has not slowed down and continues to accelerate, with a significant inflection point occurring in the first quarter for inference and AI consumption.

QWhat role are CPUs playing in the AI workload landscape according to the interview, and what is CoreWeave's approach to CPU deployment?

AAs agentic AI and reasoning models rise, the relative importance and demand for CPUs and storage are increasing. CoreWeave has been running CPUs since 2023. The company redesigned its fundamental data center approach last year to make space for more CPU and storage alongside GPUs. Its deployment is entirely client-driven, with significant interest in NVIDIA's Vera CPU, though its current fleet is primarily AMD.

QWhat is identified as the primary bottleneck for AI infrastructure scaling in the interview, and why?

AThe primary bottleneck is identified as 'powered shells'—data center buildings with power and cooling, excluding the computing hardware. This is more critical than components like GPUs, HBM, or even electricians, because securing and building out these physical facilities is a complex supply chain and execution challenge, even though CoreWeave already has 49 such sites operational.

QHow does CoreWeave's business model protect its margins from price fluctuations in components like HBM memory?

ACoreWeave's model locks in the GPU price it charges clients at the same time it signs its own purchase orders for the GPUs/servers (which include the cost of HBM). If component costs rise for future purchases, they reflect that increased cost in the price they can charge new clients, thereby protecting their profit margins from day-to-day price volatility.

QWhat is CoreWeave's expected timeline for the deployment ramp of NVIDIA's upcoming Vera Rubin (VR) platform?

ACoreWeave expects Vera Rubin to start appearing later this year. They anticipate the truly massive, strong deployment ramp will occur throughout 2027, following a similar cadence to the GB200/GB300 platforms. GB started appearing in 2025, but the major ramp happened throughout 2026. They expect VR to follow a similar 12-18 month ramp trajectory.

Nội dung Liên quan

Làm Nhạc Trong Thị Trường Gấu: Một Cuộc Thử Nghiệm Sinh Tồn Của Một Ban Nhạc Bitcoin

Vào thời kỳ thị trường crypto suy thoái, ban nhạc Orange Pill Jam vẫn tiếp tục sáng tác và thu âm, thực hiện một thử nghiệm sống còn chống lại thuật toán. Ban nhạc này, chịu ảnh hưởng từ văn hóa Bitcoin, tạo ra âm nhạc đề cập đến chủ quyền tài chính, quyền riêng tư và sự tha hóa của các thể chế hiện đại. Họ chấp nhận thanh toán bằng Bitcoin và hoạt động theo mô hình Copyleft, cho phép chia sẻ và phối lại nhạc tự do. Ban nhạc hình thành sau một buổi biểu diễn ngẫu hứng của ca sĩ chính Mermaid tại hội nghị Plan B Forum ở Lugano năm 2022. Cô cùng nhà sản xuất/nhạc công Michi và người đồng sáng lập Martino phát triển âm nhạc từ những phác thảo ban đầu. Quy trình sáng tác kết hợp giữa ca từ đầy ý nghĩa của Mermaid và khả năng biến ý tưởng thành nhịp điệu cảm xúc của Michi. Các bài hát như "Cypherpunks' Manifesto" và "Free Fire" truyền tải thông điệp mạnh mẽ về quyền riêng tư, bẫy "miễn phí" và cảm giác được chấp nhận. Trước sự trỗi dậy của AI trong sáng tạo âm nhạc, họ tin rằng máy móc không thể thay thế được sự sáng tạo thực sự bắt nguồn từ ý định và bản năng của con người. Dù đối mặt với thách thức về doanh thu trong thị trường giá xuống và khó khăn trong việc tiếp cận khán giả trên các nền tảng phát nhạc hàng loạt, Orange Pill Jam vẫn kiên trì. Họ đại diện cho niềm tin vào Bitcoin vượt trên giá cả và vào giá trị không thể thay thế của sự sáng tạo cụ thể, không thể sao chép của con người trong thời đại kỹ thuật số.

marsbit12 phút trước

Làm Nhạc Trong Thị Trường Gấu: Một Cuộc Thử Nghiệm Sinh Tồn Của Một Ban Nhạc Bitcoin

marsbit12 phút trước

Sự ra đi liên tiếp của nhân tài AI tại Google: Áp lực thử nghiệm hay dự báo "tử chiến"?

Các tài năng AI hàng đầu của Google như Noam Shazeer (đồng tác giả Transformer), John Jumper (AlphaFold) và Daniel De Freitas (Character.AI) lần lượt rời đi để gia nhập OpenAI và Anthropic, gây ra những lo ngại về làn sóng chảy máu chất xám. Tuy nhiên, góc nhìn khác cho rằng đây là cuộc chiến tranh giành nhân tài điển hình trước thềm IPO của các đối thủ, và việc họ nhắm vào Google chứng tỏ đây vẫn là nguồn nhân lực AI dồi dào bậc nhất. Thay vì một "cáo phó", đây là bài kiểm tra áp lực dành cho Google. Công ty không chỉ cạnh tranh ở mặt mô hình (Gemini) mà còn có lợi thế toàn diện: cơ sở hạ tầng (TPU, Google Cloud), hệ sinh thái sản phẩm với hàng tỷ người dùng (Search, YouTube, Android), và nguồn thu ổn định. Đáng chú ý, Google còn là đối tác cung cấp hạ tầng đám mây cho chính các đối thủ như Anthropic và OpenAI, vừa cạnh tranh vừa hợp tác. Dù phải đối mặt với thách thức chuyển đổi từ kinh doanh tìm kiếm cốt lõi và sự nhanh nhẹn của các startup, Google với nguồn lực khổng lồ, khả năng tích hợp AI sâu rộng vào các sản phẩm hiện có và tham vọng dài hạn trong AI cho khoa học (như AlphaFold) vẫn được xem là một trong số ít công ty có khả năng vượt qua giai đoạn chuyển đổi này và định hình tương lai AI.

marsbit18 phút trước

Sự ra đi liên tiếp của nhân tài AI tại Google: Áp lực thử nghiệm hay dự báo "tử chiến"?

marsbit18 phút trước

Mô Hình Đảo Chiều XRP Hình Thành Xung Quanh Mẫu Harmonic và Vùng Hỗ Trợ Chính

XRP đang ở trong một khu vực kỹ thuật quan trọng có thể định hình bước di chuyển lớn tiếp theo. Nhà phân tích The_Alchemist_Trader_ chỉ ra rằng XRPUSD đang giao dịch tại một vùng hỗ trợ tới hạn được củng cố bởi nhiều lớp hội tụ, bao gồm mức thoái lui Fibonacci 0.618 và điểm kiểm soát của phạm vi giao dịch hiện tại. Sự hội tụ này làm tăng tầm quan trọng của vùng giá này, biến nó thành chiến trường giữa người mua giá thấp và người bán. Luận thuyết biểu đồ cho thấy XRP có thể đang cố gắng xây dựng nền tảng cho một mô hình đảo chiều rộng hơn dựa trên cấu trúc Harmonic. Các thiết lập này cung cấp cho nhà giao dịch cách thức có cấu trúc để xác định các điểm xoay chiều tiềm năng, khu vực mất hiệu lực và mục tiêu đo lường. Câu hỏi then chốt là liệu vùng hỗ trợ này sẽ tạo ra một phản ứng dứt khoát hay chỉ đơn giản làm chậm đà giảm. Để xác nhận luận điểm đảo chiều, những người ủng hộ XRP cần thấy giá hồi phục mạnh từ vùng hội tụ, vượt qua được kháng cự gần đó, duy trì khối lượng và tránh tái kiểm tra vùng hỗ trợ ngay lập tức. Nếu không, thị trường có thể coi đây là một đợt bật tái phục hồi thất bại khác. Phân tích này nên được xem như một thiết lập để theo dõi hơn là một dự đoán. Vùng hỗ trợ rõ ràng, sự hội tụ đáng chú ý và cấu trúc đảo chiều tiềm năng rất đáng quan sát, nhưng thị trường vẫn cần xác nhận nó bằng hành động giá.

bitcoinist1 giờ trước

Mô Hình Đảo Chiều XRP Hình Thành Xung Quanh Mẫu Harmonic và Vùng Hỗ Trợ Chính

bitcoinist1 giờ trước

Ý tưởng về stablecoin thuật toán của Vitalik: Góc nhìn từ quyền chọn giải mã cơ chế và thách thức

Tác giả phân tích đề xuất về stablecoin thuật toán của Vitalik từ góc nhìn quyền chọn. Ý tưởng cốt lõi là chia 1 ETH thành hai phần: phần "Ổn định" (P) đảm bảo giá trị đến một mức giá thực hiện nhất định, hoạt động như một quyền chọn mua bán được bảo đảm (covered call); và phần "Tăng giá" (N) nhận toàn bộ lợi nhuận nếu giá vượt trên mức đó. Cơ chế này không cần nợ, ký quỹ hay thanh lý. Tuy nhiên, thách thức lớn nằm ở việc duy trì tính ổn định. Tài sản P cần liên tục gia hạn các quyền chọn mua có giá thực hiện sâu trong vùng có lãi, dẫn đến rủi ro trượt giá khi gia hạn, bị tận dụng giao dịch và thiếu thanh khoản. Vấn đề then chốt là ai sẽ liên tục nắm giữ phần N - một vị thế mua ETH có đòn bẩy không chịu phí funding và rủi ro thanh lý - để hệ thống mở rộng. Bài viết so sánh với kinh nghiệm từ Rysk, nơi quyền chọn đã thành công khi trở thành module cơ sở cho các sản phẩm như công cụ tạo thu nhập thay vì một sản phẩm giao dịch độc lập. Tác giả kết luận rằng tương lai của quyền chọn trong DeFi không phải là cạnh tranh với hợp đồng vĩnh cửu, mà là đóng vai trò là động cơ định giá và phân bổ rủi ro nền tảng cho thế hệ sản phẩm tài chính phức tạp tiếp theo như stablecoin, sản phẩm cấu trúc hoặc chỉ số.

marsbit1 giờ trước

Ý tưởng về stablecoin thuật toán của Vitalik: Góc nhìn từ quyền chọn giải mã cơ chế và thách thức

marsbit1 giờ trước

SpaceX, AI và XRP: Tại sao lần chuyển dịch tài sản tiếp theo có thể khác biệt?

Bài viết khám phá sự hội tụ tiềm năng của ba lĩnh vực công nghệ - Cơ sở hạ tầng vũ trụ (đại diện bởi SpaceX), Trí tuệ nhân tạo (AI) và mạng lưới thanh toán blockchain (như XRP) - như là động lực chính cho một đợt chuyển dịch tài sản và chu kỳ đầu tư cơ sở hạ tầng mới. Nội dung chính cho rằng, khi các động lực tăng trưởng truyền thống chậm lại, vốn toàn cầu đang tìm kiếm các hệ thống nền tảng mới. Các dự án cơ sở hạ tầng không gian, mạng lưới vệ tinh, trung tâm dữ liệu và năng lực tính toán AI sẽ thúc đẩy nhu cầu lớn về hàng hóa cơ bản. Đồng thời, các hoạt động kinh tế mới nổi này sẽ cần một lớp giải quyết thanh toán toàn cầu hiệu quả, nơi các tài sản kỹ thuật số tập trung vào thanh toán và khả năng tương tác như XRP có thể đóng vai trò then chốt. Bài viết nhấn mạnh sự chuyển dịch từ một "câu chuyện đầu cơ" sang một "câu chuyện cơ sở hạ tầng" trong thị trường tài sản kỹ thuật số, nơi giá trị sẽ ngày càng được xác định bởi tiện ích thực tế như khối lượng giao dịch, hoạt động thanh toán và sự tích hợp với AI (ví dụ: các tác nhân AI thực hiện giao dịch tự động). Sự rõ ràng về mặt quy định được coi là yếu tố quan trọng để thúc đẩy việc áp dụng rộng rãi hơn bởi các tổ chức. Tóm lại, tác giả đưa ra viễn cảnh về một nền kinh tế tương lai, nơi các hệ thống cơ sở hạ tầng vật chất (vũ trụ, AI), nhu cầu nguyên vật liệu và lớp tài chính kỹ thuật số (blockchain) hội tụ, tạo ra cơ hội đầu tư sớm cho những ai nhận ra xu hướng này.

marsbit3 giờ trước

SpaceX, AI và XRP: Tại sao lần chuyển dịch tài sản tiếp theo có thể khác biệt?