GPT Designs GPT

marsbitОпубликовано 2026-06-25Обновлено 2026-06-25

Введение

OpenAI has unveiled its first custom AI chip, Jalapeño, a move signaling a strategic shift beyond being a mere model company. While many see it as a challenge to NVIDIA, its core aim is to control the entire intelligent production pipeline—from models and chips to data centers and energy. The key driver is the evolving competitive landscape: model advantages are shrinking, while the computational gap in areas like cost-per-token, system throughput, and energy efficiency is becoming the true long-term barrier. Jalapeño is primarily an inference chip, targeting the massive and growing "inference tax"—the daily operational cost of generating tokens for services like ChatGPT and APIs. By designing its own hardware optimized for its specific workloads and future product roadmaps (even using AI to aid the chip design process), OpenAI aims to drastically reduce token generation costs and improve system efficiency. This creates a potential flywheel: better models help design better chips, which lower costs for running next-generation models, supporting more users and products, which in turn provides more data to refine future chips. The strategy mirrors Apple’s integrated approach, building a closed loop where hardware, software, and applications are co-optimized. In the long term, OpenAI is not trying to become the next NVIDIA (a supplier of "shovels" to all AI companies) but to own and operate the entire "mine"—selling the end product of intelligence itself. This move marks OpenA...

OpenAI is finally making chips.

When many people saw this news, their first reaction was: Nvidia is in trouble.

But what I see is precisely the opposite.

The most significant meaning of the first chip, Jalapeño, is not that it's coming directly for Nvidia.

This is the first time OpenAI has publicly admitted it's not satisfied with being just a model company.

What it wants to control is the entire process of producing intelligence.

From models, to chips. From data centers, to energy. From training, to inference. From producing Tokens, to selling Tokens.

Jalapeño appears to be a chip on the surface, but it's actually more like a roadmap.

OpenAI has finally laid its ambition on the table.

I. The Model Gap is Shrinking, the Compute Gap is Widening

Since the explosion of large models, almost all attention in the AI industry has been on the models.

The industry was shocked by GPT-4, then Claude caught up, Gemini caught up, DeepSeek delivered high cost-performance, Meta pushed open source. Every release, everyone focuses on the same set of things: parameters, leaderboards, coding ability, math ability, long context, multimodal capabilities.

Models are, of course, important. But a change has already occurred: the window of model leadership is getting shorter. Today, a model is just released, and within months, the open-source community, competitors, and cloud providers catch up. Performance gaps still exist, but they are increasingly difficult to constitute a long-term moat on their own.

The things that truly create differentiation are moving to a deeper, more foundational level. Compute supply, inference costs, system throughput, networking capabilities, data center construction, energy acquisition. These aren't as flashy as model launches, nor do they go viral immediately. But they determine whether an AI company can run long-term.

Jensen Huang recently said something: Nvidia systems might not have the lowest purchase price, but they can generate the lowest-cost Tokens, the highest Token throughput, and ultimately bring the highest revenue.

Huang's statement was direct. The industry has long complained that Nvidia is expensive. Huang didn't argue about the purchase price; he reframed the problem in another dimension: don't look at how much the machines cost to buy, look at the production cost per Token.

This is the new ledger for the AI era. Servers and GPUs are not the ultimate unit; the Token is.

OpenAI happens to be at the very center of this problem.

ChatGPT handles a massive volume of requests daily, Codex consumes even more inference steps, and in the future, there are Agents, video generation, robotics, long reasoning chains. The more useful the model, the greater the Token consumption. The more successful the product, the thicker the inference bill.

The brutal part is here: the more users OpenAI has, the more money Nvidia makes. The stronger OpenAI's products, the heavier the underlying compute tax.

If every Token has to pass through an external hardware platform paying a toll, it's hard for OpenAI to have a complete moat. It can have the strongest model, a super entry point, a developer ecosystem. But the core production cost is always in someone else's hands.

This is the essence of Jalapeño. OpenAI has started building its own Token factory.

II. GPT Begins Designing GPT

The most underestimated detail about the Jalapeño chip is the nine-month tape-out.

Traditional high-performance ASIC projects typically have cycles of 18 to 36 months. Advanced processes are even more troublesome—architecture, verification, physical implementation, packaging, software stack, debugging—any hiccup can rapidly escalate costs. OpenAI and Broadcom compressed the cycle to nine months.

This cannot be understood as the chip industry suddenly becoming simple. OpenAI did not spontaneously grow a semiconductor supply chain. Broadcom has deep experience in custom chips and network infrastructure; Celestica handles boards, racks, and systems engineering.

What OpenAI truly contributed is something scarcer: it knows how future models will run.

Many chip companies building AI accelerators face the challenge of guessing the workload. Model architectures change, inference methods change, service patterns change. Once a chip is taped out, it's not as easy to roll back in the physical world as it is in the software world.

OpenAI doesn't have to rely entirely on guesswork. Running ChatGPT, Codex, and APIs daily, it knows which kernels are used most, which memory transfers are most wasteful, which network bottlenecks most affect cluster efficiency, which latencies directly hurt product experience. It also knows how future Agent products will consume inference resources.

This experience was once just backend engineering knowledge; now it's being written into the chip architecture.

A crucial statement in OpenAI's official press release: OpenAI used its own models to accelerate parts of the design and optimization process. It also said that models provided to users are helping improve the infrastructure that will run future models.

GPT has started participating in designing the machines for the next generation of GPT.

For decades, the chip chain was: first design the chip, the chip runs the software, the software runs the AI. Now, the chain is turning back: AI helps humans design chips, which then run the next generation of AI.

Once this loop is established, nine months might just be the beginning. The future could be six months, three months, or even more frequent iterations.

The chip industry had its own rhythm, the model industry had its own rhythm. The former was slow, the latter fast. Jalapeño is pulling these two rhythms together.

If this step succeeds, OpenAI's flywheel will become formidable. Better models help design better chips, better chips lower the running cost of the next model generation, lower costs support more users and products, more users and products generate more real workload data, which in turn defines the next generation of chips.

This is the cycle OpenAI truly wants.

III. Cutting the Inference Tax, Controlling Cash Flow

Jalapeño is not a training chip; it targets large language model inference. This is key.

Training is like building an aircraft carrier. A huge one-time investment, requiring extremely strong general-purpose capability, and constant adaptation to new models, architectures, and experiments. The training market still heavily depends on Nvidia—not just the GPUs, but the entire platform: CUDA, networking, systems, software libraries, developer ecosystem.

Inference is more like a fleet of taxis. Running daily, hourly, by the minute. Every time a user asks a question, an API responds, an Agent takes a step forward, inference happens. It cares more about low latency, low cost, high throughput, high utilization.

Training burns big money in phases; inference burns daily cash flow.

This is also the most painful problem for AI companies as they enter the commercialization stage. GPT training is expensive once, but inference happens every day. The Agent era will further amplify this problem—one task may involve dozens or even hundreds of model calls. Long context, chain-of-thought reasoning, multimodal generation, code execution—all continue to push Token consumption higher.

Jalapeño is precisely targeting this inference tax. It's more like OpenAI's own TPU. Google, Amazon, Meta, Microsoft have all taken similar routes—as long as the workload is sufficiently large, custom ASICs make economic sense for high cost-effectiveness.

OpenAI now meets these conditions. Real requests, a product roadmap, a model team, industry partners like Broadcom, and immense cost pressure.

Jalapeño doesn't need to be sold externally to prove its value. As long as it makes ChatGPT answers cheaper, makes Codex run faster, and makes API margins higher, it's meaningful.

OpenAI also mentioned that Jalapeño will reduce data transfers, balance compute, memory, and network resources, bringing actual utilization closer to theoretical peaks. Compute is expensive often because it's not fully utilized—GPUs waiting for networks, memory transfers slowing down computation, poor scheduling causing idle time—all waste eventually turns into electricity bills and capital expenditure.

The purchase price is only the first layer; system efficiency is the final account.

IV. OpenAI is Looking More and More Like Apple

Many interpret Jalapeño as OpenAI challenging Nvidia, but I think OpenAI doesn't want to become the next Nvidia; it's more like emulating Apple.

Apple's greatest strength has never been any single point. The iPhone is strong, iOS is strong, the A-series and M-series chips are strong, the App Store is strong. But the truly difficult thing to compete against is how all these things are placed within the same closed loop.

Chips are optimized for the system, the system is optimized for applications, and the application experience in turn defines the next generation of chips. This closed loop allows Apple to deliver experiences under the same battery, same size, and same thermal constraints that others find hard to replicate.

OpenAI is building something similar. The model is the intelligence kernel, ChatGPT is the super entry point, Codex is the development tool, API is the ecosystem distribution layer, Jalapeño is the custom chip, and data centers are the AI factories.

Over the past two years, OpenAI CEO Altman has repeatedly discussed chips, energy, nuclear fusion, data centers. Looking back now, he might not have been chasing trends at all; he has stopped planning OpenAI in the way an AI startup would.

If Nvidia sells shovels, then OpenAI wants to own the mine.

Nvidia wants to be the factory equipment supplier for all AI companies, selling GPUs, networking, systems, software ecosystems, AI factory solutions—its ideal customers are every company that needs to produce Tokens.

OpenAI wants to build a factory for itself, selling not the equipment, but the final, generated intelligence.

In the short term, OpenAI still depends on Nvidia. Training and general-purpose computing still require the GPU platform, and Jalapeño likely won't cover all workloads quickly. It will probably first enter OpenAI's most certain, largest-scale, highest-optimization-return inference scenarios.

In the long term, cracks have appeared. When model companies start having their own chip roadmaps, Nvidia's customers are no longer just customers. They also become another type of player in the AI infrastructure landscape.

Words Beyond the Layout

Over the past two decades, the most important asset on the internet was traffic. Whoever controlled the users, controlled the value.

Today, new rules are emerging in the AI era.

Models are becoming more like traffic, while compute is becoming more like land.

Models will iterate, products will change, leaderboards will keep refreshing. But those factories that produce intelligence—chips, networks, data centers, energy—will increasingly concentrate in the hands of a few players.

GPT designing GPT looks like just another tape-out.

But what it truly announces is this:

OpenAI is no longer satisfied with being the smartest company; it wants to be the company that controls the production of intelligence.

This article is from WeChat public account:Layout Beyond, author: Huahua

This article is from WeChat public account:Layout Beyond, author: Huahua, title image from: AI-generated

Трендовые криптовалюты

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

BrevisBREV

ZRX（0X）ZRX

PancakeSwapCAKE

Связанные с этим вопросы

QWhat is the most significant implication of OpenAI's announcement of the Jalapeño chip, according to the article?

AThe article argues that the most significant implication is not a direct challenge to Nvidia, but rather OpenAI's first public acknowledgment that it is no longer satisfied being just a model company. It wants to control the entire process of intelligence production, from models and chips to data centers and energy.

QAccording to the article, what is the new fundamental 'unit' of account in the AI era, and why is this important for OpenAI?

AThe article states that the Token has become the new fundamental 'unit' of account in the AI era, moving beyond server and GPU costs. This is crucial for OpenAI because its successful products like ChatGPT consume vast amounts of tokens daily. The cost of producing each token is the core determinant of its long-term economic moat and competitiveness.

QHow does the article explain the significance of Jalapeño's reported 9-month time-to-market for chip design?

AThe article highlights that a 9-month design cycle for a high-performance ASIC is remarkably fast compared to the traditional 18-36 months. This speed is attributed not just to partners like Broadcom, but to OpenAI's unique, real-world knowledge of model workloads from running its services. Crucially, it mentions OpenAI used its own models to accelerate parts of the design, signaling the beginning of a closed loop where AI helps design the hardware for the next generation of AI.

QThe article states Jalapeño is primarily an inference chip. Why is focusing on inference strategically important for OpenAI at this stage?

AFocusing on inference is strategically important because while training is a massive but periodic capital expense, inference is a continuous, daily cash flow drain. As OpenAI's products scale and new use cases like Agents emerge, token consumption for inference grows massively. By targeting the 'inference tax' with a custom ASIC, OpenAI aims to directly reduce the ongoing operational cost of its services, improving margins and controlling its financial fundamentals.

QWhat company does the article compare OpenAI's long-term strategy to, and what is the core similarity?

AThe article compares OpenAI's long-term strategy to Apple's. The core similarity is the ambition to build a tightly integrated, closed-loop ecosystem. For Apple, this loop includes chip (A/M-series), operating system (iOS), hardware (iPhone), and app store. For OpenAI, the envisioned loop includes model (GPT), application/entry point (ChatGPT/API), developer tools (Codex), and now its own infrastructure layer (Jalapeño chip, data centers). This integration allows for deep optimization that is difficult for competitors to replicate.

Похожее

Грант Кардон увеличил свои холдинги биткоина до 2700 BTC – Почему сейчас?

Кардона Кэпитал, компания Гранта Кардона, увеличила свои биткоин-холдинги до примерно 2700 BTC (стоимостью около $159 млн), купив актив по средней цене $59 000 на фоне падения рынка. Эта покупка контрастирует с действиями крупнейшего корпоративного держателя, MicroStrategy, который впервые утвердил план продажи до $1,25 млрд биткоинов и уже начал распродажу. Направление задают и спотовые биткоин-ETF США, зафиксировавшие в июне рекордный отток средств примерно в $4,06 млрд. Несмотря на массовую продажу и слабые настроения, технический анализ указывает на возможное дно цены биткоина. На недельном графике цена достигла нижней полосы Боллинджера (зеленая линия), которая неоднократно выступала в качестве поддержки и предшествовала восстановлению.

ambcrypto15 мин. назад

Грант Кардон увеличил свои холдинги биткоина до 2700 BTC – Почему сейчас?

ambcrypto15 мин. назад

Чем останется биткойн в эпоху ИИ?

Недавнее падение биткойна ниже 60 000 долларов вновь поднимает вопрос о его ценности в эпоху ИИ. Автор рассматривает ИИ и биткойн как две стороны одной медали. ИИ радикально снизил стоимость создания контента (текстов, изображений, видео) почти до нуля, что привело к потоку информации, где подлинное и сфабрикованное становится все труднее отличить. В результате истинную ценность приобретает не сам контент, а возможность его **верификации** — подтверждения подлинности фактов, активов, записей. Здесь и проявляется суть биткойна. Его часто критикуют за огромное энергопотребление, которое, в отличие от ИИ, кажется непродуктивным. Однако автор предлагает другую точку зрения: если ИИ сжигает энергию для **создания** (генерирования контента и возможностей), то биткойн сжигает её для **верификации**. Его децентрализованная сеть, основанная на криптографии и консенсусе, создает неизменяемый и самостоятельно проверяемый реестр транзакций. Энергия тратится на то, чтобы сделать подделку истории или мошенническую транзакцию астрономически дорогой и практически невозможной без захвата всей сети. Проводя историческую параллель, автор сравнивает ИИ с печатным станком Гутенберга, который резко удешевил распространение знаний, а биткойн/блокчейн — с двойной бухгалтерией, которая снизила затраты на доверие в коммерции. Таким образом, ИИ и блокчейн не конкурируют, а дополняют друг друга в новой цифровой реальности: один отвечает за безграничное **создание**, другой — за надежное **доказательство** и проверку. Биткойн, в этой логике, — это не просто машина для создания монет, а «машина для создания верифицируемости». В мире, где ИИ может сгенерировать что угодно, конечной ценностью может стать не количество контента, а наличие независимо проверяемых фактов и активов. Будущее биткойна остается неопределенным, но его основная функция — обеспечение доверия без доверия — приобретает новую актуальность в эпоху повсеместных глубоких подделок.

marsbit20 мин. назад

marsbit20 мин. назад

В эпоху ИИ, что остаётся у биткоина?

Автор: Sevclub, Seven Research В эпоху искусственного интеллекта, когда генерация текстов, изображений и видео стала дешёвой и быстрой, подлинность информации становится всё более сомнительной. ИИ снижает стоимость производства контента почти до нуля, что приводит к переизбытку и смешению правды и лжи. В этих условиях ключевой ценностью становится возможность верификации — подтверждения истинности. В этом контексте можно по-новому взглянуть на Биткоин, который часто критикуют за высокое энергопотребление. Его суть не в вере, а в криптографической проверке. Биткоин тратит энергию не на вычисления, как ИИ, а на обеспечение "неизменяемости", повышая стоимость фальсификации истории транзакций. Это делает его своего рода машиной по производству "верифицируемости". Проводя параллель с эпохой Возрождения, можно сказать, что ИИ — это новая "печатная пресса", радикально снижающая стоимость создания. Тогда как блокчейн (и Биткоин как его первое воплощение) может стать аналогом "двойной бухгалтерии", снижающим стоимость проверки и установления доверия в цифровом мире. Они не конкурируют, а дополняют друг друга: ИИ генерирует, блокчейн доказывает и верифицирует. Таким образом, в эпоху, когда ИИ может создать что угодно, истинным дефицитом становится не сам контент, а возможность независимой проверки фактов. Биткоин представляет собой попытку создать основу для такой верифицируемости цифровых активов и записей.

链捕手28 мин. назад

链捕手28 мин. назад

Маркировка Cardano как "призрачной цепи" опровергнута? Почему 34 dApps ADA не раскрывают полной картины

Термин «ghost chain» («цепь-призрак») относится к блокчейну с минимальной активностью и развитием. Хотя Cardano (ADA) обвиняют в этом из-за малого количества dApps (34 против 442 у Solana и 1564 у Ethereum) и значительно более низких показателей транзакций и пользователей, статья объясняет это архитектурными особенностями. Cardano использует модель EUTXO и механизмы батчинга (объединения транзакций), которые повышают детерминизм и безопасность, но при этом статистика «недооценивает» реальную активность в сети. При этом разработка на Cardano остается интенсивной. Автор приходит к выводу, что, несмотря на разрыв в метриках с другими ведущими блокчейнами (Ethereum, Solana, Tron), лишь одно это не является достаточным основанием для ярлыка «ghost chain», так как Cardano занимает свою нишу, делая акцент на научно обоснованный подход, безопасность и соответствие требованиям институциональных клиентов.

ambcrypto1 ч. назад

Маркировка Cardano как "призрачной цепи" опровергнута? Почему 34 dApps ADA не раскрывают полной картины

ambcrypto1 ч. назад

Запас Ethereum у Bitmine достиг $9,8 млрд: «Лучшие годы для криптовалют еще впереди»

Криптокомпания Bitmine Immersion Technologies увеличила свои запасы Ethereum (ETH) на 27 084 монеты за последнюю неделю. Теперь в её казне находится 5 700 040 ETH, что составляет 4,7% от общего предложения Ethereum и оценивается примерно в 9,01 млрд долларов по цене 1569 долларов за монету. Это произошло на фоне падения цены ETH и оттока средств из ETF-фондов Ethereum в июне. Несмотря на слабые рыночные условия и критику в адрес аналогичной стратегии накопления биткоинов компанией MicroStrategy, Bitmine продолжает агрессивно покупать ETH. Председатель Bitmine Том Ли считает, что текущая волатильность отчасти связана с «оконной отделкой» перед концом квартала, и выражает уверенность в будущем крипторынка. Компания подчеркивает свою устойчивость, отмечая ежегодный доход от стейкинга в размере около 211 млн долларов, наличие 555 млн долларов денежных средств и ликвидных ценных бумаг, а также включение в индекс Russell 1000. Ли заявил, что лучшие годы для криптовалют ещё впереди, и ожидает, что токенизация и прогресс в области искусственного интеллекта подстегнут спрос на блокчейн и децентрализованные криптоактивы.

ambcrypto2 ч. назад

Запас Ethereum у Bitmine достиг $9,8 млрд: «Лучшие годы для криптовалют еще впереди»

ambcrypto2 ч. назад

Торговля

Спот

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на PEOPLE (PEOPLE) представлены ниже.