RTX Spark Superchip Brings Petaflop AI to Windows PCs

What RTX Spark Is and Why It Matters for Windows AI PCs

RTX Spark is NVIDIA’s first Windows-focused superchip for the emerging Windows AI PC category, combining a Blackwell RTX GPU, a 20-core Grace CPU, and up to one petaflop of AI performance to run large local AI models, personal AI agents, and advanced graphics workloads without relying on cloud data centers. Announced at GTC Taipei with Microsoft as a core partner, RTX Spark brings NVIDIA’s CUDA, RTX, DLSS, and TensorRT stack into a single package designed for slim laptops and compact desktops. Jensen Huang describes it as “the new PC. The personal AI computer,” highlighting a shift from launching apps to asking an AI-driven system to do the work instead. For Windows users, the promise is clear: local AI processing with petaflop performance, lower latency, and more privacy-friendly AI agents running directly on their own hardware.

NVIDIA RTX Spark Superchip Lands on Windows PCs with One Petaflop of Local AI Power

Inside the RTX Spark Superchip: Architecture and Petaflop Performance

At the heart of the RTX Spark superchip is a pairing of an NVIDIA Blackwell RTX GPU and a 20-core NVIDIA Grace CPU, linked by the high-speed NVLink-C2C interconnect. The GPU includes 6,144 CUDA cores and fifth-generation Tensor Cores with FP4 precision, tuned for high-throughput AI inference rather than only traditional graphics. The Grace CPU, co-developed with MediaTek and based on Arm, is built for efficiency and connectivity in portable Windows AI PCs. This design supports up to 128GB of unified memory, so AI models and applications see a single large memory pool instead of separate CPU and GPU memory blocks. According to NVIDIA’s announcements, the platform delivers up to one petaflop of AI compute, bringing what used to be workstation-class or data-center-class performance into consumer-oriented Windows systems.

NVIDIA AI Agents on Windows: OpenShell, Security, and Local Models

RTX Spark is built around NVIDIA AI agents, not only gaming or creative workloads. Together with Microsoft, NVIDIA is introducing new Windows security primitives and the NVIDIA OpenShell runtime to run AI agents locally with clear policy controls. Users can define what agents are allowed to access, route queries to local AI models when privacy is important, and mask sensitive data before any cloud call is made. This local-first approach aims to reduce latency and improve privacy by keeping more work on-device. NVIDIA says RTX Spark can run language models with up to 120 billion parameters and context windows of up to 1 million tokens locally, a scale that previously demanded cloud infrastructure. Projects such as Hermes Agent and OpenClaw are already targeting this stack, building native Windows agent applications that live directly on the user’s PC instead of in a browser tab.

Competing in the Windows AI PC Market Against AMD and the Cloud

By bringing a petaflop-class RTX Spark superchip to Windows AI PCs, NVIDIA is moving directly into the same on-device AI space pursued by AMD’s Ryzen AI Max and other neural processing hardware. Where many current AI PCs add a small NPU alongside a conventional CPU-GPU pair, RTX Spark fuses CPU and GPU into a unified superchip aimed at heavier AI agent workloads and local model inference. The NVIDIA–Microsoft partnership positions Spark as a reference platform for Windows-native AI agents, tying it tightly into Windows’ new agent ecosystem and security model. For users, the important shift is that an AI-capable PC no longer needs constant cloud access for powerful assistants, code copilots, or creative tools. As more software targets OpenShell and local inference, RTX Spark systems could reduce dependence on remote GPUs and subscription services for everyday AI tasks.

What One Petaflop of Local AI Processing Enables for Everyday Users

For Windows PC owners, one petaflop of local AI processing means AI agents and models can feel immediate and always available, even offline. RTX Spark-class systems can run large language models with long contexts locally, so personal assistants can remember more of a user’s history, projects, and files without sending them to remote servers. Creative users gain enough headroom to edit 12K 4:2:2 video, render 90GB-plus 3D scenes, or generate 4K AI video directly on their desktop or laptop. Gamers still get high-end RTX features, including ray tracing, DLSS and Reflex, plus support for DLSS 4.5 Ray Reconstruction and RTX Video with 4x Frame Generation. Satya Nadella has described RTX Spark as “a real breakthrough” toward “unmetered intelligence to every home and every desk with Windows,” capturing how local AI compute may become a standard expectation, not a premium add-on.