RTX Spark superchip brings petaflop AI to PCs

What RTX Spark Is and Why It Matters

RTX Spark is NVIDIA’s first superchip designed specifically for Windows PCs, combining a Blackwell-generation GPU and Grace CPU into a single package that delivers up to one petaflop of AI performance for local AI agents, gaming and creative work. It is built to turn a traditional PC into what NVIDIA calls a “personal AI computer”, where users interact through natural language and intelligent assistants instead of manually launching and managing apps. With up to 128GB of unified memory, RTX Spark can host large language models and complex media workloads directly on the device. This pushes AI from remote data centres down to laptops and compact desktops, promising faster responses, better privacy and offline capability. In practical terms, RTX Spark aims to make on-device AI processing as central to the PC as CPUs and GPUs once were.

RTX Spark Superchip Brings Petaflop-Scale AI to Everyday PCs

Inside the RTX Spark Superchip Architecture

At the heart of the RTX Spark superchip is a tight integration of GPU and CPU tuned for AI-heavy tasks. The Blackwell RTX GPU brings 6,144 CUDA cores and fifth-generation Tensor Cores with FP4 precision, connected via NVLink-C2C to a 20-core Grace CPU co-designed with MediaTek on an Arm-based architecture. According to NVIDIA, the platform “delivers up to one petaflop of AI performance and supports up to 128GB of unified memory,” allowing local execution of 120-billion-parameter language models with context windows reaching 1 million tokens. Unified memory means AI agents, 3D scenes and video timelines share a single pool instead of copying data between CPU and GPU. This superchip design consolidates what used to be separate components into a cohesive AI-first processor for Windows PCs, improving efficiency and enabling thin laptops with all-day battery life as well as compact desktops.

Local AI Agents on Windows: From Apps to Autonomous Workflows

RTX Spark is built expressly for local AI agents on Windows, shifting everyday computing from point-and-click apps to conversations and task automation. NVIDIA and Microsoft are adding a Windows layer for native agents that covers identity, containment and policy, while NVIDIA OpenShell provides a runtime so agents can run safely on-device. This framework lets users define which files, apps or system capabilities agents may access, and route queries between local and cloud models depending on privacy rules. Agent developers such as Hermes Agent and OpenClaw are adopting this stack to enable cross-app workflows, semantic search over local files, and agent-triggered actions inside standard Windows software. The result is a move away from cloud-dependent assistants toward on-device AI processing, where personal agents can keep working with sensitive data, even offline, without sending everything to remote servers.

Gaming and Creative Workloads Meet On-Device AI

Beyond local AI agents, RTX Spark targets gamers and creators who need dense compute in a portable machine. NVIDIA says RTX Spark PCs can run AAA games at 1440p above 100 frames per second with ray tracing, DLSS and Reflex enabled, echoing the company’s push to keep high-end gaming alongside AI features. For creators, the superchip supports 90GB-plus 3D scenes, 12K 4:2:2 video editing, and 4K AI video generation, all handled locally through unified memory and Blackwell decoding. Adobe is rearchitecting Photoshop and Premiere to take advantage of RTX Spark, claiming up to 2x faster AI and effects performance across editing workflows. Other tools like Blender, Blackmagic software and ComfyUI are being optimised too. This convergence means the same hardware that runs petaflop-scale AI can also accelerate ray-traced graphics and heavy editing timelines without round-tripping to the cloud.

A Shift in How Consumers Access and Control AI

RTX Spark marks a broader change in consumer AI accessibility: powerful models and agents move from distant data centres to personal devices. Instead of depending on streaming responses from cloud services, users can host sizeable models locally, with latency measured in device cycles rather than network hops. NVIDIA frames this as a reinvention of the PC, where “for forty years, you launched apps… With RTX Spark and Microsoft Windows, you ask — and the PC does the work.” On-device AI processing aligns with rising concerns about data privacy and service reliability, since sensitive documents or creative projects never need to leave the machine for many tasks. At the same time, the superchip keeps gaming and traditional applications first-class citizens. For consumers, RTX Spark signals a future where AI assistants, content creation and gaming all live side by side on the same Windows PC, under the user’s direct control.