RTX Spark superchip for powerful Windows AI PCs

What RTX Spark Is and Why It Matters

RTX Spark is an AI-focused “superchip” platform for Windows PCs that combines a Grace CPU and a Blackwell RTX GPU to deliver high-performance local AI processing, enabling large language models, creative workloads, and gaming tasks to run directly on the device instead of relying on cloud data centers. Designed by NVIDIA with Microsoft and co-developed with MediaTek, the RTX Spark superchip is built around the GB10 Grace Blackwell architecture and targets premium Windows AI PCs for developers, creators, and power users. It offers up to 1 petaflop of AI compute along with 128GB of unified memory, making it suitable for running massive language models and complex AI agents on a laptop or compact desktop. According to NVIDIA, RTX Spark can run models with up to 120 billion parameters and 1 million-token contexts locally, which pushes it into workstation-grade territory for many users.

Inside the Grace Blackwell RTX Spark Superchip

At the hardware level, the RTX Spark superchip pairs a 20-core NVIDIA Grace CPU with an NVIDIA Blackwell RTX GPU that includes 6,144 CUDA cores and 5th-generation Tensor Cores using FP4 precision. An NVLink-C2C interconnect links CPU and GPU for fast data exchange, which is important when models and datasets are too large for typical consumer hardware. RTX Spark is positioned around GB10, a Grace Blackwell package initially introduced for desktop AI systems, now adapted for premium Windows AI PC designs. This configuration allows up to 1 petaflop of AI performance, moving it beyond the lightweight neural processing units found in many early AI laptops. The chip is also designed to support demanding graphics and gaming, with NVIDIA claiming it can run AA titles at 1440p with ray tracing, DLSS 4.5 Ray Reconstruction, and over 100 FPS while still delivering strong AI performance.

Unified Memory and Local AI Processing on Windows

One of RTX Spark’s defining features is its unified memory design, offering up to 128GB of shared memory between the Grace CPU and Blackwell GPU. Instead of copying data back and forth between separate pools, both processors work from the same large memory space, which allows bigger models and datasets to stay resident and reduces bottlenecks. Microsoft is tuning Windows for this architecture with workload profile scheduling, Prism emulation for older applications, and unified-memory optimization on RTX Spark systems. This means a Windows AI PC built on RTX Spark can keep local inference workloads on-device while still running legacy Windows software. For developers and creators, unified memory can make 200-billion-parameter model targets and heavy media workloads more practical locally, turning a high-end notebook into a kind of portable AI workstation rather than a thin client for cloud services.

Windows AI PC Integration, Agents, and Privacy

RTX Spark is closely integrated with Microsoft’s Windows agent ecosystem, providing a foundation for native AI assistants and agent-style applications that run locally. NVIDIA’s OpenShell runtime is designed to give these agents secure, policy-controlled execution with privacy and containment, so sensitive data does not have to leave the device. This supports advanced agents such as Hermes Agent and OpenClaw, which can operate on private documents, creative assets, and application workflows without constant cloud access. Windows ties into the hardware with new security primitives and scheduling tools that balance traditional apps with large local models. For many users, this changes the AI experience from cloud-dependent chatbots to persistent, context-aware agents embedded in the operating system and apps, using the Grace Blackwell GPU and unified memory to keep response times low and reduce exposure of personal data to remote servers.

When You Can Buy RTX Spark Systems and Who They Are For

NVIDIA and Microsoft plan to bring RTX Spark to premium Windows AI PCs starting in fall 2026, with laptops and desktops from Microsoft Surface, ASUS, Dell, HP, Lenovo, MSI, and later Acer and GIGABYTE. These devices are expected to include slim laptops as thin as 14mm and under 3 pounds, as well as compact desktops and higher-end builds. Early GB10-based systems are projected to cost between USD 3,000 and USD 4,000 (approx. RM13,800–RM18,400), which places them at the high end of the consumer market and targets developers, creative professionals, and serious gamers first. Over time, broader OEM support will show whether RTX Spark becomes a mainstream Windows AI PC tier or stays a specialist option. For anyone who values low-latency AI features, strong privacy, and powerful local models, RTX Spark marks a clear shift away from cloud-first AI toward on-device supercomputing.