NVIDIA Microsoft Agentic AI Stack for Developers

What the Unified Agentic AI Stack Is and Why It Matters

The NVIDIA–Microsoft unified agentic AI stack is a cross-platform architecture that connects Windows RTX devices, local infrastructure and Azure cloud services so developers can build, govern and scale AI agents with consistent runtimes, tooling and data access across edge, on‑premises and cloud environments. Announced at Microsoft Build in a keynote conversation between Jensen Huang and Satya Nadella, the NVIDIA Microsoft partnership is about more than faster GPUs or new laptops. Its goal is to make Windows a managed endpoint for personal and enterprise agents, long‑running reasoning workloads and hybrid AI infrastructure. The stack combines fast hardware, secure runtimes such as NVIDIA OpenShell, and an accelerated data layer powered by Microsoft Fabric. For AI agent development teams, this unified approach promises a single path from prototype to production that spans RTX Spark PCs, DGX Station for Windows, Azure Local, Microsoft Foundry and beyond.

Inside NVIDIA and Microsoft’s Unified Agentic AI Stack

Windows RTX AI: From Personal Agents to Deskside Supercomputers

On the client side, Windows RTX AI takes shape through RTX Spark systems and DGX Station for Windows. RTX Spark laptops and small desktops are the first Windows PCs purpose‑built for personal agents, offering 1 petaflop of AI performance, up to 128 GB of unified memory and all‑day battery life while keeping full AI and graphics performance unplugged. Systems are expected from Microsoft Surface, ASUS, Dell, HP, Lenovo and MSI. For enterprise AI agent development, DGX Station for Windows brings a deskside AI supercomputer based on the NVIDIA GB300 Grace Blackwell Ultra Desktop Superchip, with up to 748 GB of coherent memory and 20 petaflops of FP4 performance to run models with up to 1 trillion parameters locally. According to NVIDIA, both RTX Spark and DGX Station for Windows run the OpenShell secure‑by‑design runtime for autonomous agents.

Secure Runtimes and Local Control: OpenShell, Azure Local and Hybrid Agents

Agentic AI depends on more than raw compute; it needs predictable security and governance. The NVIDIA Microsoft partnership extends NVIDIA OpenShell, a secure runtime for autonomous agents, across Windows and Azure. On Windows, OpenShell runs on top of Microsoft Execution Containers, a policy‑driven layer that controls what an agent can access at runtime, which is key when agents can call tools, access data or perform system actions. This same agentic AI stack is designed to reach into Azure Local and other on‑premises deployments, so organizations can run sensitive workloads close to their data while keeping a common execution model with the cloud. For developers, that means the same AI agent development patterns can span personal Windows machines, local RTX‑powered servers and Azure AI deployment targets, while security and access rules follow agents wherever they run.

Foundry, Nemotron and Fabric: Building Cloud-Scale Agentic Systems

In the cloud, the stack centers on Microsoft Foundry and Microsoft Fabric, both accelerated by NVIDIA GPUs. Foundry hosts NVIDIA, Anthropic and OpenAI models, plus Hermes special agents, so enterprises can compose systems of agents on Azure with built‑in identity and governance. NVIDIA Nemotron 3 Ultra, a new open frontier reasoning model for long‑running agents in coding, research and enterprise workflows, is available on Foundry managed compute, alongside Nemotron 3.5 ASR and Nemotron 3.5 Content Safety. NVIDIA’s Agent Toolkit and NemoClaw blueprints give developers an open‑source path to production agents that can call CUDA‑X skills such as cuDF, cuOpt, AI‑Q and NeMo. On the data side, Microsoft Fabric Data Warehouse now includes NVIDIA accelerated computing, with Microsoft reporting SQL execution up to 6x faster than a CPU‑powered baseline for high‑concurrency workloads, helping agents query data at interactive speeds.

What This Unified Stack Means for AI Agent Developers

For developers, the unified agentic AI stack means AI agent development no longer has to split between isolated local prototypes and separate cloud architectures. You can start by building a personal assistant on an RTX Spark Windows laptop, then scale that same agent pattern to DGX Station for Windows to handle frontier‑scale models, and finally deploy orchestrated special agents on Microsoft Foundry running Nemotron and other models. The same OpenShell runtime and execution‑container model help keep behavior consistent across environments. With NVIDIA‑accelerated Microsoft Fabric, agents can continuously query enterprise data warehouses without bottlenecks, while Azure Local offers a bridge for low‑latency, data‑sensitive workloads. In practice, this stack enables end‑to‑end workflows: design agents locally, test them against realistic data, then push them to Azure AI deployment targets where they run alongside managed models, identity, and governance controls.