Google Officially Declares War

链捕手Published on 2026-05-21Last updated on 2026-05-21

Abstract

Google Declares War with AI-First I/O 2026 At its 2026 I/O developer conference, Google launched an aggressive, multi-pronged offensive, embedding AI across its ecosystem and challenging rivals on performance and price. The event showcased three major releases: Gemini 3.5 Flash, the video-centric Gemini Omni Flash, and the system-level AI assistant Spark. Gemini 3.5 Flash, despite being a smaller "Flash" model, outperforms its Pro counterpart in key benchmarks like mathematical reasoning (GSM8K) and coding (SWE-bench). Google attributes this to "extreme knowledge distillation" from a larger teacher model and a novel, highly granular MoE (Mixture of Experts) architecture with 256 experts, achieving sub-65ms response times. The native multi-modal model, Gemini Omni Flash, offers real-time video understanding with 120ms latency, enabling applications like preventing a cup from overfilling. The new Spark assistant gains deep Android system integration, allowing it to automate complex multi-app workflows based on voice commands. Complementing these, Google unveiled lightweight AI glasses featuring Micro-OLED displays and on-device Gemini chips for instant, offline translation and scene analysis. CEO Sundar Pichai announced Gemini has reached 900 million monthly active users, leveraged through integration into Chrome, Android, and Workspace. Google also slashed prices dramatically: the Gemini 3.5 Flash API is priced at a fraction of competitor rates. This price war is enabled ...

The 2026 Google I/O developer conference left only one impression: arrogance.

Not only did they cram AI agents seamlessly into all core traffic portals like search, browsers, phones, and smart glasses like force-feeding a duck, but they also threw out three killer bombs in a row: Gemini 3.5 Flash, the video model Omni, and the brand-new AI assistant Spark.

After flexing its muscles, Sundar Pichai even boasted by announcing that Gemini's monthly active users had surpassed 900 million, and simultaneously announced a significant price reduction.

The message couldn't be clearer:I'm stronger and cheaper than you.

If that's not a declaration of war, what is?

01

Undoubtedly, the most dazzling highlight of the conference was the debut of Gemini 3.5 Flash.

Normally, "Pro" represents the core strength, while "Flash" represents lightweight and speed.

In terms of model parameters, 3.5 Flash is indeed smaller than 3.1 Pro, yet its performance surpassed the latter in almost all reasoning and coding benchmarks:

In the GSM8K test for complex mathematical reasoning, 3.5 Flash scored 95.8%, surpassing 3.1 Pro's 93.2%; in the full version of the SWE-bench for code generation capability, 3.5 Flash achieved a solve rate of 38.4%, far exceeding 3.1 Pro's 32.1%......

Why?

According to the "Gemini 3.5 Technical Report" released by DeepMind, there are two key core technologies.

Extreme Knowledge Distillation: Google did not simply rely on brute-force compute to train Flash this time. Instead, it used a never-before-disclosed "Gemini 3.5 Ultra" as a teacher model for dimensional reduction distillation of Flash.

According to a tweet analysis by Jeff Dean, Chief Scientist at DeepMind, the proportion of fine-tuning on high-quality reasoning chain datasets for 3.5 Flash increased by 400% compared to the previous generation.

This means it inherited the "logical brain" of a super-large model, rather than a memorized "knowledge base."

A Novel MoE Architecture (Mixture of Experts): Inside 3.5 Flash, Google employed more fine-grained expert networks.

Traditional MoE might only have 8 or 16 experts, activating only 1-2 per task, sufficient for supporting trillion-parameter scale models.

However, according to an analysis from a16z's 2026 AI Infrastructure Investment Memo, 3.5 Flash employs 256 micro-experts, activating the most efficient 4 of them during each inference.

This is why it can cover an extremely vast multimodal feature space while maintaining an extremely low number of activated parameters.

In the TTFT (Time to First Token) metric, 3.5 Flash has reached below 65 milliseconds.

A human blink takes 100-150 milliseconds.

In short, when it operates as an agent, from a human physiological perspective, there is virtually no perceptible delay.

For developers requiring frequent tool calls, multi-round reflection, and extremely low latency, this is the perfect super-agent foundation.

Only with such extreme engineering optimization can "device-side deployment" dominance be established in a fiercely competitive environment.

First, the native multimodal Gemini Omni Flash.

Omni means all-around, targeting the earlier GPT-4o. Just from the name, you can feel the intensity of the rivalry.

At least in terms of performance, Gemini Omni Flash is far more deserving of the "o" character than GPT-4o.

Early models like Sora or Gemini 1.5 were essentially patchwork, converting speech to text and then text to vision.

But this released Omni features true native end-to-end multimodal alignment. It natively understands temporal coherence and physical laws within videos, and latency dropped from the industry average of 400-600 milliseconds to 120 milliseconds.

An example from the keynote: a user wearing a camera pours water; as the cup nearly fills, Omni can say "Stop stop stop!" 0.5 seconds before the water overflows.

This kind of real-time inference about the physical state of the real world seems simple but is profoundly significant:AI has officially evolved from a chatbot on a screen into a real-world auxiliary tool.

Although still in its early stages.

Second, the intelligent assistant Spark.

According to a The Verge interview with an Android engineering VP, Spark has been granted native API control at the system layer of Android 17.

In short, the complex workflows that previously required you to open many apps can now be handled without lifting a finger. Just tell Spark what you need, and it can handle everything for you—even sending messages in your tone, sorting emails, summarizing schedules, tracking webpage updates, identifying hidden charges on bills, batch-processing documents, and so on......

In other words, with an AI assistant in the future, we might hardly need apps anymore; any complex operation is simplified into a single command.

Third, smart glasses.

Why glasses again?

At least in Google's view, seamless access to vision and hearing is the ultimate host for multimodal large models.

These glasses appear without any fancy aesthetics, focusing entirely on utility:

4-gram Micro-OLED full-color waveguide lenses with a light transmittance as high as 85%;

Equipped with a self-developed lightweight Gemini edge-side chip, local inference latency ≤12ms, capable of real-time translation, image recognition, and scene analysis without an internet connection;

Natively integrated with the Spark agent, syncing with phone and cloud data to deliver personalized services like schedule reminders, real-time translation, and environmental alerts.

In short, it's about bypassing the phone screen and embedding the agent into the human first-person perspective through glasses.

There's simply too much content. Google seems to have dumped all its trump cards at once, proclaiming a truth to the market:

An algorithm without an entry point is nothing.

The era of chasing model parameters and benchmark scores is over. Pure model providers no longer have a moat. The future is a four-dimensional space battle of "device + cloud + ecosystem + hardware."

Cramming AI into its suite is reshaping the entire internet's traffic distribution logic: from "users actively search/click" to "AI agents actively distributing services."

For the vast majority of developers and small-to-medium enterprises, this is excellent news because the underlying compute and models become extremely cheap, allowing everyone to focus on application-layer innovation.

But other competitors right now probably just want to curse out loud.

02

When Sundar Pichai casually announced on stage that "Gemini's monthly active users have officially surpassed 900 million," it caused quite a stir in the audience.

900 million—more than all the MAUs of its US competitors combined.

How was this achieved?

The answer is simple and brutal: Forced integration.

Google doesn't need to spend on advertising for user acquisition like independent AI companies. It just needs to add an icon next to Chrome's address bar, integrate a shortcut in the bottom navigation bar of 3 billion Android phones, push a full update within Google Workspace......

The customer acquisition cost is basically zero.

More crucially, in the coming period, the gaze of 900 million active users as they look at products with smart glasses, the logical adjustments made when using Spark to handle tasks, and the interactions with the Omni visual model will generate a massive amount of high-quality, multimodal real-world feedback data, all of which will become nourishment for Gemini 4.

This forms an extremely robust barrier:The better the model -> the more people use it -> the more data generated -> the better the model becomes.

To rapidly strengthen this loop, Google directly declared a price war on all competitors: The AI Ultra subscription was slashed from $249.99/month to $99.9/month.

The input price for 1 million tokens for 3.5 Flash was driven down to $0.02, and the output price for 1 million tokens is $0.08.

What kind of magical price is this?

For comparison, the industry average prices for similar-tier models are $0.15-0.2 and $0.6-1, respectively.

Sundar Pichai calculated: top customers process about 1 trillion tokens per day. Shifting 80% of the workload to Gemini 3.5 Flash for a year can save over $1 billion.

Why dare to sell AI at cabbage prices?

The biggest reliance is: vertically integrated computing infrastructure.

Including giants like OpenAI, Anthropic, despite their apparent success, are essentially "compute tenants," needing to buy computing power from Microsoft, Amazon, who in turn pay Jensen Huang (Nvidia).

Google has its own TPUs, coupled with the incredibly efficient sparse activation of 3.5 Flash's MoE architecture, compressing compute costs to the extreme.

It can leverage its heavy-asset advantage to deliver a dimensional blow to pure algorithm companies.

The logic is clear.

Foundation models are rapidly becoming commoditized. Like water and electricity, have you ever seen a water utility company making obscene profits?

Google isn't afraid of the model itself not being profitable, because it can make money back through search ads, cloud services, and fees from the Android ecosystem.

But for pure model API sellers like OpenAI, Anthropic, Cohere, Mistral, this is not feasible.

Investors probably want to press Sam Altman's head and ask:"Google's API price is only one-tenth of yours, and its performance is better. Tell me, how does your business model work?"

Competitive landscapes across multiple industries will thus enter an accelerated reshuffling period.

AI vendors, needless to say, must quickly find cheaper compute sources or venture into chipmaking themselves.

Next is Apple, still developing behind closed doors.

The combination of smart glasses + the Omni video model + Spark's native system-level control undoubtedly already threatens the iPhone.

According to Macquarie's "Consumer Electronics Trend Forecast Report": Within the next three years, the proportion of time spent on screenless interaction based on vision/voice is expected to jump from the current 8% to 35%.

If users get accustomed to using glasses and voice for daily work and entertainment, screen time will inevitably be significantly reduced.

If Apple cannot counter with sufficiently impressive wearable devices (Vision Pro is too heavy and expensive, destined to be a toy for a minority), its monopoly on mobile internet entry points will face an unprecedented challenge.

This is not an iteration; it's a revolution.

Google, with its three blades—technology, traffic, and price—has issued a declaration of war to all its rivals.

Is there anyone still laughing at its corporate bureaucracy now?

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

China Launches Production of Chip Printing Machines: Why Experts Aren't Quick to Applaud

China has reportedly begun production of immersion DUV lithography machines, critical for manufacturing modern semiconductors. According to reports citing anonymous sources, the state-owned Shanghai Aishengna Electronic Technology Group plans to produce a limited number of these machines in 2026 and 2027, with initial deliveries intended for major Chinese chipmakers. However, industry experts and analysts express significant skepticism. They note the new equipment is far from matching the performance, reliability, and commercial viability of systems from the global leader, ASML. Key challenges identified include the difficulty of transitioning from limited production to high-volume manufacturing with consistent precision and yield over thousands of cycles. Analysts also caution that previous announcements of Chinese progress in this field have often failed to meet expectations. While the development is notable, experts suggest a more immediate risk to ASML could come from potential U.S. export restrictions rather than Chinese competition. The market consensus is that ASML's dominance in the lithography equipment segment remains secure for the foreseeable future. The true test for the Chinese machine will be whether it can achieve meaningful, scaled production by 2027 as claimed.

cryptonews.ru1h ago

China Launches Production of Chip Printing Machines: Why Experts Aren't Quick to Applaud

cryptonews.ru1h ago

The Most Important Decision by the Fed Tomorrow! What Are the Expectations? It May Not Be as Final as Previous Ones!

The Federal Reserve (Fed) is set to announce its latest interest rate decision tomorrow. Markets widely expect the Fed to hold rates steady, with an 80% probability of no change and a 19% chance of a 25-basis-point hike. The Fed has kept its benchmark rate at 3.50–3.75% since December. Recent energy price increases and hawkish comments from some Fed officials have raised the possibility of a hike. However, softer-than-expected inflation data and a lull in US-Iran tensions suggest the bar for a rate increase this week might be higher than markets anticipate. Economists note that if the Fed hikes after a long pause, it likely wouldn't be a one-off move. Historically, when the Fed begins a rate cycle, it continues in the same direction for several meetings. A potential hike could signal further increases ahead. Yet, officials are believed to be unprepared to commit to a series of hikes now. Former St. Louis Fed President James Bullard echoed this, stating the Fed typically avoids single, isolated rate changes and that policymakers must decide if they are ready to start a hiking series, which he doubts they are for this meeting.

cryptonews.ru1h ago

The Most Important Decision by the Fed Tomorrow! What Are the Expectations? It May Not Be as Final as Previous Ones!

cryptonews.ru1h ago

Visa outlines stablecoin strategy during Q3 earnings call

Visa's fiscal third-quarter revenue grew 14% year-over-year to $11.6 billion, supported by strong growth in payments volume, cross-border transactions, and processed transactions. During its earnings call, the company detailed its expanding stablecoin strategy, emphasizing investments across the entire stablecoin ecosystem, including blockchain infrastructure, issuance, wallets, and applications. Visa highlighted progress in issuance and application layers, and announced its participation in the OpenStandard consortium, which plans to issue the OpenUSD stablecoin for global money movement. The Visa stablecoin platform is designed to allow partners to settle with Visa using stablecoins, provide onchain wallet services, and facilitate conversions between fiat currency and stablecoins, starting with OpenUSD. The platform will also integrate with Pismo to support financial institutions' tokenized deposits, with plans to add more infrastructure providers. Visa further described stablecoins and artificial intelligence as complementary long-term growth technologies, suggesting that while stablecoins reshape the backend of commerce, AI is transforming the frontend.

cointelegraph1h ago

Visa outlines stablecoin strategy during Q3 earnings call

cointelegraph1h ago

3% Cashback, 6% Annual Interest, and a Debit Card: Elon Musk Launches X Money in the US

On July 27, 2026, the social media platform X (formerly Twitter) launched its integrated financial service, X Money, initially available only to US-based Premium and Premium+ subscribers. Founder Elon Musk emphasized the launch's significance for the platform. US residents over 18 with a verified X account can use the service. X Payments LLC is not a bank; user funds are held in FDIC-insured accounts at Cross River Bank, with insurance potentially extended up to $10 million through a cash sweep program across partner banks. The core of the service is the X Card, a virtual or physical metal Visa debit card. Key features include: 3% cashback on most purchases (with some exclusions), free instant peer-to-peer payments, worldwide ATM fee reimbursement, no foreign transaction fees, and support for Apple/Google Pay. The service offers interest on account balances, up to 6.00% APY. Premium+ users get this rate immediately, while Premium users must meet a qualifying direct deposit requirement. Additional features include early direct deposit, wire transfers, bill pay, and paper check ordering. Security features include passkey login, customizable transaction limits, and Visa's Zero Liability Policy for fraud. The public launch followed a limited beta test with select Premium+ users in late June 2026. The launch marks X's continued evolution from a social network into a broader digital platform combining communication with everyday financial operations.

cryptonews.ru2h ago

3% Cashback, 6% Annual Interest, and a Debit Card: Elon Musk Launches X Money in the US

cryptonews.ru2h ago

$134 Million in Bitcoin Long Positions 'Evaporated' as BTC Price Dropped Below $63K Ahead of Critical Fed Meeting

Bitcoin's price tumbled below $63,000 on Tuesday, leading to a sharp spike in leveraged long position liquidations totaling $134 million and bringing total BTC-related liquidations over $156 million. The drop, which saw BTC briefly hit a daily low near $62,684, extended its monthly losses to 4%. Market experts are pointing to a pivotal week ahead, highlighted by the upcoming Federal Open Market Committee (FOMC) meeting, core PCE inflation data, and major earnings reports from companies like Apple and Amazon. According to analysts, the tone from Fed Chair Kevin Warsh's press conference will be crucial. A dovish Fed stance coupled with strong AI capex forecasts and positive PCE data could propel Bitcoin toward $68,000-$70,000 in August. Conversely, hawkish signals alongside disappointing corporate earnings could quickly bring the $58,000-$60,000 range back into view.

cryptonews.ru2h ago

$134 Million in Bitcoin Long Positions 'Evaporated' as BTC Price Dropped Below $63K Ahead of Critical Fed Meeting

cryptonews.ru2h ago

Trading

Spot

Hot Articles

How to Buy WAR

Welcome to HTX.com! We've made purchasing WAR (WAR) simple and convenient. Follow our step-by-step guide to embark on your crypto journey.Step 1: Create Your HTX AccountUse your email or phone number to sign up for a free account on HTX. Experience a hassle-free registration journey and unlock all features.Get My AccountStep 2: Go to Buy Crypto and Choose Your Payment MethodCredit/Debit Card: Use your Visa or Mastercard to buy WAR (WAR) instantly.Balance: Use funds from your HTX account balance to trade seamlessly.Third Parties: We've added popular payment methods such as Google Pay and Apple Pay to enhance convenience.P2P: Trade directly with other users on HTX.Over-the-Counter (OTC): We offer tailor-made services and competitive exchange rates for traders.Step 3: Store Your WAR (WAR)After purchasing your WAR (WAR), store it in your HTX account. Alternatively, you can send it elsewhere via blockchain transfer or use it to trade other cryptocurrencies.Step 4: Trade WAR (WAR)Easily trade WAR (WAR) on HTX's spot market. Simply access your account, select your trading pair, execute your trades, and monitor in real-time. We offer a user-friendly experience for both beginners and seasoned traders.

2.6k Total ViewsPublished 2024.03.29Updated 2026.06.02

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of WAR (WAR) are presented below.

Hot Categories

Others212 news items

Google Officially Declares War

Abstract

01

02

Trending Cryptos

Related Questions

Related Reads

China Launches Production of Chip Printing Machines: Why Experts Aren't Quick to Applaud

The Most Important Decision by the Fed Tomorrow! What Are the Expectations? It May Not Be as Final as Previous Ones!

Visa outlines stablecoin strategy during Q3 earnings call

3% Cashback, 6% Annual Interest, and a Debit Card: Elon Musk Launches X Money in the US

$134 Million in Bitcoin Long Positions 'Evaporated' as BTC Price Dropped Below $63K Ahead of Critical Fed Meeting

Trading

Hot Articles

How to Buy WAR

Discussions

Top Questions

Hot Categories

Hot Tags