RTX Spark superchip and unified memory for Windows

What RTX Spark Is and Why It Matters for Windows PCs

NVIDIA RTX Spark is an ARM-based Windows processor that combines a 20-core Grace CPU, a Blackwell RTX GPU, and up to 128GB of unified memory into a single superchip to deliver 1 petaflop AI performance and enable powerful local AI agents without depending on the cloud. Positioned as an “Apple Silicon moment” for Windows, RTX Spark moves beyond separate CPU and GPU chips by using a shared memory pool and a high-speed NVLink-C2C interconnect. That design removes many of the data transfer delays that limit traditional PCs when they run large AI models or high-resolution creative workloads. NVIDIA claims RTX Spark systems can run 120-billion-parameter models locally and support context windows up to one million tokens, while still pushing modern games at 1440p above 100 frames per second with ray tracing and DLSS 4.5.

Inside the RTX Spark Superchip: Grace CPU, Blackwell RTX GPU, and NVLink-C2C

At the silicon level, the RTX Spark superchip centers on a custom 20-core Grace CPU built with MediaTek and a Blackwell RTX GPU featuring 6,144 CUDA cores and fifth-generation Tensor Cores. The components share a single TSMC 3nm package and communicate over NVIDIA’s NVLink-C2C, a chip-to-chip interconnect designed to move data faster than a traditional PCIe link. According to NVIDIA, this setup delivers up to one petaFLOP of AI compute using FP4 precision, putting workstation-class AI performance into slim laptops and compact desktops. An integrated NPU augments the GPU for lower-power inference tasks and background AI services. Because RTX Spark also carries the full RTX software stack—CUDA, TensorRT, DLSS 4.5, Reflex, G-SYNC, and ray tracing—developers can treat it like a unified platform instead of juggling separate desktop and mobile GPU configurations.

RTX Spark Superchip Brings Petaflop AI and Unified Memory to Windows

Unified Memory Architecture: NVIDIA’s Answer to Apple Silicon

The unified memory architecture is what makes RTX Spark more than a faster GPU. Up to 128GB of LPDDR5X memory is shared between the Grace CPU and Blackwell RTX GPU, so both processors see the same address space without copies across a bus. This mirrors Apple Silicon’s approach, where CPU and GPU share a large unified memory pool, but RTX Spark is tuned for the Windows ecosystem and NVIDIA’s CUDA-centric workflows. By tying unified memory to NVLink-C2C and high AI throughput, RTX Spark can render 90GB 3D scenes, edit 12K 4:2:2 video, and generate 4K AI video without the usual CPU–GPU data bottlenecks. For AI workloads, unified memory means local large language models up to 120 billion parameters can live in one coherent space, improving both latency and throughput for complex prompts and long context windows.

Local AI Agents on Windows: From Privacy to Performance

RTX Spark’s design is explicitly aimed at local AI agents running on a primary Windows PC instead of in the cloud. NVIDIA says RTX Spark systems can power agents that handle everything from code generation and document analysis to creative assistance, all while using local models when privacy or latency is critical. On the software side, NVIDIA and Microsoft are adding new Windows security primitives and an OpenShell runtime that lets users control which files and sensors AI agents can access, route requests to local models first, and mask personal data before any cloud call. This focus on privacy-aware local AI parallels efforts on Apple hardware, but with a Windows-first stack including CUDA, TensorRT, and OpenShell-friendly frameworks such as Hermes Agent and OpenClaw. The result is a PC that treats AI agents as first-class, always-on workloads rather than optional add-ons.

ARM-Based Windows Strategy and the Road Ahead

RTX Spark also signals a broader shift toward ARM-based Windows processors. By pairing an ARM Grace CPU with a Blackwell RTX GPU and promising full Windows 11 compatibility, NVIDIA is stepping directly into territory occupied by Apple Silicon and Snapdragon X. NVIDIA says it is working with Microsoft, Adobe, and others to "guarantee" that all apps, including games, will run on RTX Spark devices, while Adobe is rearchitecting Photoshop and Premiere specifically for the platform. Early RTX Spark systems from ASUS, Dell, HP, Lenovo, MSI, and Microsoft Surface will be thin 14–16 inch laptops and compact desktops arriving in Q3, with more models from Acer and Gigabyte later. If compatibility holds and performance matches NVIDIA’s claims, RTX Spark could push Windows laptops toward unified memory architectures as the default and make ARM-based Windows PCs a serious option for AI developers, creators, and gamers.