What Apple Intelligence Is and How Its Architecture Has Changed
Apple Intelligence is Apple’s AI system that combines on-device AI models with cloud-based Apple Foundation Models to power features like a revamped Siri and personalized assistance, while claiming a privacy-first design that limits how user data is processed, stored, and shared across devices and servers. Initially, Apple framed Apple Intelligence as running only on Apple Silicon, reinforcing the idea that on-device AI models were the default for privacy. That story has now shifted. Apple still runs many tasks locally, but complex requests are routed to larger cloud models, including a 20‑billion‑parameter on-device model and even larger Apple Foundation Model (AFM) variants in the cloud. This hybrid cloud AI approach underpins the new Siri, which can manage multi-step, context-aware tasks. The key question for users is no longer whether data stays on the device, but when and how it moves to Apple’s servers and beyond.

From On-Device Only to Hybrid Cloud AI
Apple’s messaging has moved from “on-device only” toward a more honest description of a layered system orchestrator that routes each request. Routine or sensitive tasks—like interacting with your messages or calendar—are intended to stay on-device, using smaller, efficient models tailored for Apple Silicon. When Siri or Apple Intelligence faces a complex query, the request can escalate to a cloud-based AFM. According to Apple’s WWDC presentations, this system orchestrator is “key to the privacy architecture of our entire system,” deciding whether a request can be served locally or must go to the cloud. The 20‑billion‑parameter on-device model pushes the limits of local processing, but it still cannot match frontier-scale models. That gap drives the need for AFM Cloud and AFM 3 Cloud Pro, which handle more demanding tasks while Apple tries to keep data exposure controlled and time-limited.
Google and Nvidia Inside Apple’s ‘Private’ Cloud
Apple now confirms that its AFM Cloud Pro runs inside Google Cloud on Nvidia GPUs, including Nvidia’s Blackwell B200 chips. This means Apple Intelligence relies on an Nvidia Google partnership at the infrastructure level, even as Apple insists that user data remains shielded from those partners. Apple distills its AFM 3 Pro from a licensed Google Gemini model, then pre-trains and post-trains it within its own Apple Foundation Model Cloud. Apple’s Private Cloud Compute (PCC) framework extends to Google Cloud with “NVIDIA Confidential Computing with NVIDIA GPUs, Intel CPUs with TDX, and Google’s Titan chip.” In theory, confidential compute keeps data encrypted even while processed on these chips. Apple further states that it maintains a cryptographically verifiable, append-only ledger for all Google Cloud hardware in the PCC fleet, and that AFM 3 Cloud and ADM 3 Cloud (Image) run on Apple’s own servers alongside AFM 3 Cloud Pro.

Why Apple Talks Up Nvidia and Downplays Google
Apple’s public story highlights Nvidia while soft-pedaling Google’s role. In marketing and technical clarifications, Apple emphasizes that AFM 3 Pro is “its own creation,” even while acknowledging it was distilled from a 1.2‑trillion‑parameter Gemini model licensed from Google. This framing allows Apple to claim independence at the model level, despite depending on Google’s cloud infrastructure for AFM Cloud Pro. At the same time, Apple’s executives praise Nvidia’s latest GPUs as the hardware foundation that makes fast, private cloud computing possible for the new Siri and Apple Intelligence. By stressing Nvidia’s confidential computing features, Apple can reassure users that even when data leaves the device, the hardware providers cannot see it. The result is a carefully curated narrative: Apple ingenuity at the top, Nvidia as the trusted engine, and Google kept mostly in the background, even though its technology and cloud are central to the system.

What Users Should Know About Apple Intelligence Privacy
For users, the core Apple Intelligence privacy question is when data leaves the device and who touches it. Apple says many requests stay local, but anything that needs AFM Cloud or AFM 3 Cloud Pro will go to Apple’s Private Cloud Compute, which now spans both Apple data centers and Google Cloud with Nvidia GPUs. Apple claims that initial parsing happens in isolated processes, inference software is recycled quickly, and keys sit in separate confidential VMs. Apple also promises “the industry’s most comprehensive transparency guarantees,” including public research tooling and research-mode access to live PCC nodes via the Apple Security Bounty Program. Still, the hybrid cloud AI design means your Siri and Apple Intelligence interactions can cross into third-party infrastructure. To use these features knowingly, users should watch for settings that explain when cloud processing is used, how Apple Intelligence privacy controls work, and which tasks remain fully on-device.






