What RTX Spark Is and Why NVIDIA Calls This the Age of Agents
RTX Spark is NVIDIA’s new AI PC platform that combines a powerful GPU, CPU, and agent-ready software stack so personal computers can run on-device AI agents that understand context, call tools, and complete multi-step tasks with minimal user input. At Computex, Jensen Huang framed this as a shift from traditional applications to an agentic computing platform, where the main “user” of software is an AI agent rather than a person clicking menus. Instead of launching discrete apps, you give goals: draft a report, edit footage, or summarize a day’s work across services. A large language model sits inside a “harness” that connects to tools and distributed infrastructure, turning the PC into an active assistant. RTX Spark is the consumer face of this strategy, bridging NVIDIA’s data center systems and everyday Windows machines while keeping most interaction grounded in familiar PC form factors.

From Apps to AI Agents: Rebuilding PC Architecture
Jensen Huang describes an agent as a blend of model, harness, tools, and runtime, with the model doing the reasoning and the harness wiring everything together. This framing pulls the PC away from the old pattern of application code in an app window and toward an AI agent PC architecture that runs continuous workflows. The RTX Spark AI PC is built to keep that agent close to the user, on-device, instead of relying only on remote inference in the cloud. RTX hardware and CUDA libraries become skills the agent can call, while Windows remains the familiar shell. According to PCMag, the company’s argument is that “the PC stops being a tool you operate and becomes an assistant that operates tools on your behalf.” It is still early; many of these behaviors are roadmap-level rather than shipping features, but the underlying silicon and software stack are now aligned around agents.

Inside RTX Spark: Vera CPU, Nemotron Ultra, and Unified Silicon
Under the hood, RTX Spark draws directly from NVIDIA’s data center work, blending CPU, GPU, memory, and models into a single on-device AI computing platform. In the broader roadmap, the Vera CPU is described as explicitly designed for AI agents, with 88 custom Olympus ARM cores and LPDDR5X memory tuned for low latency and fast core-to-core communication. On the model side, NVIDIA Nemotron Ultra (also referred to as Nemotron 3 Ultra) is a large mixture-of-experts system built for agentic use, using a hybrid state space architecture to keep inference fast and affordable while matching high-end open models. In client form, the RTX Spark superchip pairs a Blackwell GPU with a Grace CPU in a 70‑billion‑transistor SoC, delivering up to 1 petaflop of FP4 AI performance and unified memory up to 128GB, which lets agents move across tasks without the usual GPU-to-CPU overhead.
How RTX Spark Changes Everyday PC Use
On the surface, RTX Spark laptops and desktops look like premium Windows machines. Their real shift is how users interact with them. Instead of treating each application as a separate island, the RTX Spark AI PC is designed so an on-device agent can coordinate across apps, files, and online services. You might describe a project, and the agent drafts documents, pulls data, edits media, and schedules tasks, calling traditional tools only when needed. Unified memory and high FP4 performance mean language models and other AI workloads can run locally, preserving responsiveness even when networks are slow. NVIDIA says RTX Spark performance is “in the same class as an RTX 5070 laptop GPU,” but the more important change is workload behavior: the SoC’s tight CPU–GPU coupling favors continuous, mixed AI tasks over short, bursty graphics spikes. The PC’s value shifts from raw frames per second to sustained, agent-led productivity.
Beyond the PC: A Shared Agentic Pattern for Physical AI
RTX Spark is only one node in a wider move toward physical AI built on the same agentic pattern. In the data center, Vera Rubin systems tie together dozens of Vera CPUs and Rubin GPUs at pod scale, forming what NVIDIA describes as AI supercomputers for running fleets of agents. The company’s DSX blueprint turns these into full “AI factories,” coordinating power, cooling, and networking around agent workloads. The key idea is that cars, robots, data centers, and PCs all share a common agent design: a model-driven brain, a harness wired to tools, and specialized hardware to keep latency and cost in check. That means an assistant on an RTX Spark AI PC can, in theory, speak the same language as agents in a warehouse robot or a cloud service. The PC becomes the personal endpoint in a broader agentic computing platform that spans from home desks to industrial racks.






