MilikMilik

Apple Ditches Its Own AI Servers for Siri as Google and Nvidia Step In

Apple Ditches Its Own AI Servers for Siri as Google and Nvidia Step In
Interest|High-Quality Software

What Apple’s Siri Redesign and AI Infrastructure Shift Means

Apple’s Siri redesign and AI infrastructure strategy shift refers to the company’s move from relying primarily on its own on-device and in-house cloud systems to a hybrid model that runs Apple Intelligence across Apple Silicon, Google Cloud, and Nvidia’s Blackwell AI chips. This change centers on a rebuilt Siri, expected around September 2026 alongside iOS 27, that aims to evolve from a simple voice assistant into a modern AI system comparable to current large language model tools. The core redesign keeps basic tasks such as alarms and device controls on-device, while complex requests move to cloud processing. That cloud layer will no longer live only on Apple’s own servers, but will depend on Google’s Gemini AI models and data centers powered by Nvidia GPUs. For a company famous for vertical integration, this marks a major strategic reversal with direct implications for privacy, performance, and competitive positioning.

From Apple-Built AI Servers to Google Cloud and Nvidia Blackwell

Apple originally tried to run Siri’s next generation entirely on its own AI servers, using Private Cloud Compute to keep processing under tight control. According to AppleInsider, Apple attempted to run a version of Google’s Gemini model under this system, but the setup “was too slow to be usable.” The Information, cited by The Tech Portal, reports that Apple has now turned to Google’s Gemini models, paying around USD 1 billion (approx. RM4.6 billion) per year for access to a customized 1.2 trillion‑parameter system, far larger than Apple’s estimated 150 billion‑parameter in-house models. These larger models will run on Google Cloud infrastructure equipped with Nvidia Blackwell B200 GPUs, designed for large-scale AI workloads. In practice, this means Siri’s most demanding tasks—long document summaries, multi-step app workflows, and complex questions—will execute on third-party hardware, while Apple continues to refine smaller models through techniques like model distillation.

Apple Intelligence: A Hybrid of On-Device and Third-Party AI

The new Siri sits at the center of Apple Intelligence, which now spans both Apple Silicon and third-party processors. Apple still highlights on-device processing for simple, latency-sensitive actions: setting timers, toggling settings, or pulling quick facts can run without an internet connection. For richer interactions, Siri will tap cloud-based Gemini models, turning the assistant into a more conversational system that maintains context over multiple prompts. The Tech Portal notes that Siri will reach deeper into apps such as Mail, Messages, Calendar, Photos, and Notes to answer more personal, context-aware queries and automate multi-step workflows like drafting emails or managing tasks. Model distillation lets Apple’s smaller models “learn” from Gemini’s responses, closing some of the capability gap over time. This hybrid approach balances Apple’s desire for tight hardware-software integration with the performance and scale advantages of specialized cloud AI infrastructure.

Private Cloud Computing, Privacy Promises, and New Risks

Apple’s move to Google Cloud Nvidia infrastructure raises immediate questions about private cloud computing and Siri privacy. Previously, Craig Federighi said it was “essential for privacy and security” that cloud-based Apple Intelligence ran only on Apple servers. That stance has softened under the Google Gemini partnership, which AppleInsider reports includes Google Cloud running Nvidia Blackwell B200 chips. Apple plans to enable confidential compute on these GPUs, so data stays encrypted even during processing. Today, Apple’s Private Cloud Compute limits what user prompts are sent to the cloud and prevents them from being retained for training. Apple says these rules continue to apply, even as third-party hardware enters the path. Still, routing sensitive Siri requests through external infrastructure introduces new trust dependencies and potential attack surfaces. The challenge for Apple is to prove that its privacy guarantees can survive this more complex, shared AI stack without eroding user confidence.

Why Apple’s Strategy Shift Matters for the Wider AI Industry

Apple’s decision to abandon its own AI servers for Siri and rely on Google Cloud Nvidia hardware reflects a larger industry trend: even the most vertically integrated companies are adopting specialized AI hardware and cloud providers to reach competitive performance. Reports of Apple acquiring Nvidia NVL72 servers underscore how central Blackwell-class GPUs have become for large-scale models. At the same time, this move reshapes Apple’s identity as a company that traditionally controlled the entire stack, from chips to services. Competitors will see validation for partnering on AI infrastructure, while regulators and privacy advocates gain a new case study in how personal assistant data flows through multi-party systems. For users, the redesigned Siri promises more capable, conversational experiences, but also a shift in the privacy contract. Apple Intelligence is no longer an all-Apple affair; it is a hybrid ecosystem where control, responsibility, and trust are shared.

Milik earns a commission when you shop through our links, at no extra cost to you. Editorial content is independently selected by our team.

You May Also Like

Comments
Say something...
No comments yet. Be the first to share your thoughts!