"Borrowing a Chicken to Lay Eggs"? Apple Secures Google Gemini License: Crafting the Strongest On-Device AI Through Model "Distillation"

marsbitОпубліковано о 2026-03-26Востаннє оновлено о 2026-03-26

Анотація

Apple has reportedly secured full access to Google's Gemini model, not for simple API integration, but to perform "model distillation"—a technique that allows Apple to train smaller, more efficient on-device AI models by learning from Gemini’s advanced reasoning processes. This approach enables Apple to develop localized AI systems that operate entirely on-device, improving speed and safeguarding user privacy. The distilled models are expected to power a significantly upgraded Siri in iOS 27, enabling deeper interaction, complex question-answering, document understanding, and practical task execution—all without requiring cloud connectivity. While leveraging Google’s model in the short term, Apple continues to develop its own foundational AI models internally, adopting a dual strategy of near-term collaboration and long-term independence. This move represents Apple’s effort to balance cutting-edge AI performance with its strict privacy and efficiency standards, potentially reshaping the landscape of on-device artificial intelligence.

To completely rid Siri of the "artificial stupidity" label, Apple has chosen a technical shortcut of "leveraging strength through borrowing."

According to the latest reports, Apple has obtained full access to the Google Gemini model. The core of this collaboration is not merely about API integration, but rather permits Apple to utilize Gemini's powerful computing capabilities for "model distillation," thereby tailoring smaller, smarter, localized AI models for its devices.

Technical "Distillation": Packing a Cloud Brain into the iPhone

The so-called "distillation" technology can be understood as having a "genius teacher" personally instruct an "average student":

Extracting the Essence: Apple queries the main Gemini model and obtains its detailed reasoning processes, allowing its self-developed smaller model to learn its internal computation methods.

Reducing Costs and Increasing Efficiency: After training, Apple's smaller model can achieve performance close to that of large cloud models with minimal computing power requirements.

Localized Security: These models can run on-device without needing an internet connection, enhancing response speed while perfectly upholding Apple's pride in user privacy.

See You in iOS 27: The New Generation Siri Receives a "Dimensionality Reduction Strike"

Leveraging this collaborative framework, the new Siri expected to launch with iOS 27 will possess completely revamped capabilities:

Deep Interaction: Able to answer complex questions, provide emotional support, and even tell stories to users.

All-in-One Assistant: Equipped with practical operational abilities like scanning and understanding documents, summarizing information, and booking travel with one click.

Highly Customized: Apple has the right to edit and adjust Gemini to ensure its response style highly aligns with the Apple user experience.

Walking on Two Legs: Apple Never Gave Up "Going It Alone"

Although currently relying on Google's support for "intelligence," Apple has not put all its eggs in one basket.

Its internal Apple Foundation Models team is still pushing forward with full force on independent, self-developed AI models. This indicates Apple is adopting a strategy of "short-term borrowing, long-term self-reliance," quickly capturing the market using third-party mature technology while building its own core foundational model system.

From cloud "distillation" to local operation, Apple is attempting to find that perfect balance between privacy, efficiency, and cutting-edge AI capability through this unique approach. When your iPhone possesses an offline "Gemini-level" brain, the AI competition in mobile devices will truly have entered the second half.

Трендові криптовалюти

Пов'язані питання

QWhat is the core of the collaboration between Apple and Google Gemini?

AApple has obtained full access to Google's Gemini model, allowing it to use 'model distillation' to create smaller, smarter on-device AI models.

QHow does the 'distillation' technology work in this context?

AIt involves Apple using the full Gemini model to generate detailed reasoning processes, which its smaller models then learn from, enabling high performance with low computational demands.

QWhat are the expected capabilities of the new Siri in iOS27?

AIt is expected to handle complex queries, provide emotional support, tell stories, scan and understand documents, summarize information, and perform tasks like booking travel, all while running locally on the device.

QWhy is Apple pursuing this collaboration instead of relying solely on its own AI development?

AApple is adopting a 'short-term borrowing, long-term self-development' strategy, using third-party technology to quickly enter the market while continuing to develop its own core foundation models internally.

QWhat are the main benefits of running AI models locally on devices like the iPhone?

ALocal operation improves response speed, reduces the need for cloud connectivity, and enhances user privacy by keeping data on the device.

Пов'язані матеріали

CPU Makes a Comeback to the Table, A $170 Billion "Power Seizure" Drama Begins

A new era is dawning for the server CPU (Central Processing Unit), driven by the shift from AI model training to large-scale reasoning and the rise of Agentic AI. This article explores how the CPU is reclaiming a central role in the AI data center. For years, the focus has been on the GPU (Graphics Processing Unit) for AI training. However, as AI moves to the inference and Agent phase—where tasks involve complex, multi-step reasoning, tool calls, and data management—the workload balance is flipping. Studies show CPUs now handle over 70% of the workload in Agentic AI, up from 10-30% in training. This is because Agent tasks generate massive intermediate data (KV Cache) that exceeds GPU memory, forcing it to be offloaded to the CPU's larger, more scalable memory pools. This increased importance is translating into market changes. Major players are taking note: NVIDIA launched its first standalone CPU line, Vera, based on ARM architecture and optimized for Agent performance. AMD doubled its server CPU market forecast to over $1200 billion by 2030. Analyst reports project the total server CPU market could reach $1700 billion by 2030, with AI-driven demand being a primary driver. Furthermore, the classic ratio of CPUs to GPUs in AI servers is rapidly changing, converging from 1:8 toward 1:1 for Agent deployments. This surge in demand has led to a rare industry-wide price increase of 10-15% for server CPUs from Intel and AMD, breaking a decade-long trend of "more performance for the same price." Demand is bifurcating into high-core-count CPUs for in-rack GPU support and moderate-core CPUs for standalone Agent task orchestration. In China, this global trend presents an opportunity for domestic CPU manufacturers like Hygon (海光信息) and Huawei Kunpeng, who are bolstered by both growing AI infrastructure needs and national policies promoting technological self-reliance ("xin chuang"). The maturity of their software ecosystems is also accelerating, evidenced by faster adaptation to new AI models. In conclusion, the narrative is shifting from a GPU-centric view to one where CPU-GPU synergy is critical. The CPU is no longer a peripheral component but a performance-defining bottleneck and a key growth driver in the AI hardware stack, opening a massive new market estimated in the hundreds of billions of dollars.

marsbit3 год тому

CPU Makes a Comeback to the Table, A $170 Billion "Power Seizure" Drama Begins

marsbit3 год тому

TechFlow Intelligence: AMD AI Director Publicly Criticizes Claude Code for "Becoming Dumber and Lazier", Trump Claims Full Ceasefire in Hormuz But Strait Still Has 80 Unexploded Mines

TechFlow Intelligence Report: This daily digest covers key developments in AI, crypto, hardware, and geopolitics. In AI, SK Telecom faces US export control scrutiny over its partnership with Anthropic, while a Gemini user reports being misled in a scam scenario, sparking safety debates. China's Z.AI launches the GLM-5.2 model, rivaling Claude Opus without NVIDIA chips. In crypto, Bithumb lists ReProtocol, and Upbit delists KernelDAO. On the hardware front, MIT researchers build a custom OS to study chips, ASML denies US claims its advanced lithography machines are in China, and Amazon considers selling its in-house AI chips. Apple's future A21 Pro chip may use TSMC's latest N2P process. Major tech issues include 10,000 GitHub repositories distributing malware and Apple patching a critical eavesdropping flaw in Beats earbuds. US stocks rise, led by semiconductors, with Intel surging 10.6%, while SpaceX falls 3.5%. Geopolitically, despite a US-Iran deal, the Strait of Hormuz remains risky with ~80 uncleared mines, stalling 80M barrels of oil on standby tankers. Iran postpones Switzerland talks, and Trump calls the agreement an "unconditional surrender." The report highlights a contrast: temporary geopolitical calm versus the ongoing, fundamental restructuring of tech supply chains and chip independence.

marsbit3 год тому

TechFlow Intelligence: AMD AI Director Publicly Criticizes Claude Code for "Becoming Dumber and Lazier", Trump Claims Full Ceasefire in Hormuz But Strait Still Has 80 Unexploded Mines

marsbit3 год тому

Торгівля

Спот
Ф'ючерси

Популярні статті

Як купити T

Ласкаво просимо до HTX.com! Ми зробили покупку Threshold Network Token (T) простою та зручною. Дотримуйтесь нашої покрокової інструкції, щоб розпочати свою криптовалютну подорож.Крок 1: Створіть обліковий запис на HTXВикористовуйте свою електронну пошту або номер телефону, щоб зареєструвати обліковий запис на HTX безплатно. Пройдіть безпроблемну реєстрацію й отримайте доступ до всіх функцій.ЗареєструватисьКрок 2: Перейдіть до розділу Купити крипту і виберіть спосіб оплатиКредитна/дебетова картка: використовуйте вашу картку Visa або Mastercard, щоб миттєво купити Threshold Network Token (T).Баланс: використовуйте кошти з балансу вашого рахунку HTX для безперешкодної торгівлі.Треті особи: ми додали популярні способи оплати, такі як Google Pay та Apple Pay, щоб підвищити зручність.P2P: Торгуйте безпосередньо з іншими користувачами на HTX.Позабіржова торгівля (OTC): ми пропонуємо індивідуальні послуги та конкурентні обмінні курси для трейдерів.Крок 3: Зберігайте свої Threshold Network Token (T)Після придбання Threshold Network Token (T) збережіть його у своєму обліковому записі на HTX. Крім того, ви можете відправити його в інше місце за допомогою блокчейн-переказу або використовувати його для торгівлі іншими криптовалютами.Крок 4: Торгівля Threshold Network Token (T)Легко торгуйте Threshold Network Token (T) на спотовому ринку HTX. Просто увійдіть до свого облікового запису, виберіть торгову пару, укладайте угоди та спостерігайте за ними в режимі реального часу. Ми пропонуємо зручний досвід як для початківців, так і для досвідчених трейдерів.

458 переглядів усьогоОпубліковано 2024.12.10Оновлено 2026.06.02

Як купити T

Обговорення

Ласкаво просимо до спільноти HTX. Тут ви можете бути в курсі останніх подій розвитку платформи та отримати доступ до професійної ринкової інформації. Нижче представлені думки користувачів щодо ціни T (T).

活动图片