MilikMilik

Apple Abandons Homegrown AI Servers for Google Cloud and Nvidia

Apple Abandons Homegrown AI Servers for Google Cloud and Nvidia
Interest|High-Quality Software

What Apple’s New Siri Shift Actually Means

Apple’s latest Siri overhaul is a strategic pivot where the company abandons its own AI server infrastructure and instead runs complex Siri requests on Google Cloud infrastructure powered by Nvidia Blackwell chips, reshaping how Apple Intelligence balances on-device processing, cloud performance, and user privacy. The redesigned Siri, expected alongside iOS 27 around September 2026, is described as a full rebuild that turns the assistant into a modern AI system rather than an incremental update. Apple’s plan is to split workloads: routine tasks like timers or settings stay on Apple Silicon, while demanding jobs—such as document summaries or multi‑step actions across apps—move to the cloud. This hybrid model underpins the new Apple Siri redesign, promising more capable conversational AI, but it also exposes the gap between Apple’s earlier promise of Apple Intelligence running only on its own chips and the realities of large‑scale AI server infrastructure.

From Homegrown AI Servers to Google’s Faster Path

Behind the scenes, Apple reportedly spent years building proprietary AI servers for Siri and Private Cloud Compute, only to hit a wall when scaling to state‑of‑the‑art language models. According to The Tech Portal, Apple is now paying for access to a customized Google Gemini model with about 1.2 trillion parameters, far larger than its estimated 150‑billion‑parameter in‑house cloud models. AppleInsider reports that Apple tried running a version of Gemini under its own Private Cloud Compute stack but found it too slow to be usable, pushing the company to seek a faster route through Google Cloud infrastructure instead. This move looks like a rare admission that Apple could not optimize the entire stack alone at the required speed. Model distillation will allow Apple’s smaller models to learn from Gemini’s outputs over time, but for now, the brains of the new Siri largely sit outside Apple’s own data centers.

Why Nvidia Blackwell and Google Cloud Power the New Siri

To make the new Siri responsive and scalable, Apple is tying its AI server infrastructure to Nvidia Blackwell B200 GPUs running inside Google Cloud data centers. These chips are built for large, complex AI workloads, making them suitable for a 1.2‑trillion‑parameter Gemini model handling global Siri traffic. AppleInsider notes that this setup includes enabling Nvidia’s confidential computing features, which encrypt data while it is being processed to limit exposure even inside the cloud. That arrangement is meant to approximate Apple’s Private Cloud Compute guarantees, where only user prompts travel to the data center and are not stored for training. The reported purchase of Nvidia servers and reliance on third‑party hardware marks an unusual step for Apple, which historically controlled its full stack. Here, speed and capability appear to have outweighed the habit of strict vertical integration.

Apple Intelligence Privacy: Promise vs. Practice

Apple has heavily promoted Apple Intelligence privacy, emphasizing that many features run on-device and that cloud requests should stay on Apple‑controlled servers. AppleInsider recalls Craig Federighi’s 2024 statement that using only Apple servers was “essential for privacy and security.” The shift to Google Cloud and Nvidia Blackwell chips complicates that story. Apple aims to maintain its promises with several measures: confidential computing on Nvidia hardware, encrypted in‑processing data, and rules that prevent prompts from being retained for training. Still, users may question how “on‑device” their data remains when complex Siri tasks depend on Google’s infrastructure and models. The Apple Siri redesign will need clear messaging on when requests leave the device, what is logged, and how long data persists. The success of Apple Intelligence now hinges as much on transparent privacy explanations as on technical safeguards in the cloud.

What to Expect When the Redesigned Siri Launches

When the revamped Siri arrives around September 2026 with iOS 27, users can expect a more conversational assistant that behaves closer to modern AI chat systems. The new Siri should keep context across messages, support follow‑up questions, and tap personal data—from Mail, Messages, Calendar, Photos, and Notes—to deliver richer, more helpful answers. Multi‑step actions, like drafting an email, attaching a file, and scheduling a meeting in one request, will be handled by a mix of on‑device models and cloud‑based Gemini processing. Performance will lean on Google Cloud infrastructure and Nvidia Blackwell chips for heavy lifting, while simpler commands stay local to Apple Silicon. This hybrid Apple Intelligence design tries to combine speed, capability, and privacy. How well Apple explains and enforces its privacy safeguards will decide whether users see this as an upgrade or a compromise.

Milik earns a commission when you shop through our links, at no extra cost to you. Editorial content is independently selected by our team.

You May Also Like

Comments
Say something...
No comments yet. Be the first to share your thoughts!