NVIDIA Microsoft Partnership for Agentic AI Deployment

What a Unified Agentic AI Stack Means for Developers

A unified stack for agentic AI deployment is an integrated set of hardware, models, runtimes and data services that lets developers design, run and scale autonomous AI agents consistently across personal devices, on‑premises infrastructure and cloud platforms without rewriting applications for each environment. NVIDIA and Microsoft are now pushing this idea from concept to reality. Announced during Microsoft Build with NVIDIA CEO Jensen Huang joining Microsoft CEO Satya Nadella’s keynote, the expanded partnership ties Windows, Azure Local infrastructure, Azure cloud services and development tools into one coordinated platform. The goal is to turn Windows PCs into managed endpoints for local agents while giving enterprises a matching stack in the data center and the cloud. That unified approach aims to cut fragmentation and reduce the integration effort that has slowed edge to cloud AI projects in many organizations.

NVIDIA and Microsoft Unite a Single Stack for Agentic AI

RTX Spark PCs: Personal Agentic AI on Windows Devices

On the client side, RTX Spark PCs are the entry point for edge to cloud AI in this stack. These Windows laptops and small desktops are built to run personal agents locally, with 1 petaflop of AI performance and up to 128GB of unified memory for on-device inference and long-running reasoning. Systems are expected from Microsoft Surface, ASUS, Dell, HP, Lenovo and MSI, with a Surface RTX Spark Dev Box aimed at developers who need 128GB of unified memory and a 100W thermal envelope for local model and agent workloads. According to NVIDIA’s Build announcement, RTX Spark systems bring “over 30 years of NVIDIA innovation, including CUDA, RTX, DLSS and TensorRT” into a Windows form factor, so developers can prototype and test agentic AI applications on the same kind of devices end users will carry.

DGX Station for Windows and Azure Local: From Desk to Data Center

For enterprise teams, the same NVIDIA Microsoft partnership extends agentic AI deployment from deskside hardware into Azure Local infrastructure. DGX Station for Windows introduces a deskside AI supercomputer based on the NVIDIA GB300 Grace Blackwell Ultra Desktop Superchip, with up to 748GB of coherent memory and 20 petaflops of FP4 performance, enough to run models of up to 1 trillion parameters locally. It supports Windows enterprise management and Linux AI toolchains through Windows Subsystem for Linux, so IT teams can manage it like any other Windows endpoint. In parallel, Azure Local and Foundry Local bring NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs and Nemotron models into on-premises environments. Together, these systems let enterprises keep sensitive workloads on site while using the same agent architectures and tools that run in Azure cloud regions.

Foundry, Fabric and Open Models: Scaling Agents in the Cloud

In the cloud, Microsoft Foundry and Microsoft Fabric form the data and model backbone of the unified stack. Foundry hosts NVIDIA open models, Anthropic Claude, OpenAI models and Hermes special agents in the Foundry Agent Service, giving enterprises managed identity and governance for multi-model agent systems on Azure. NVIDIA Nemotron 3 Ultra, designed for long-running reasoning across coding, research and enterprise workflows, joins Nemotron 3.5 ASR and Nemotron 3.5 Content Safety so developers can mix frontier and local models by cost and quality. On the data side, Microsoft Fabric Data Warehouse now includes NVIDIA accelerated computing; internal benchmarks report SQL execution up to 6x faster than CPU baselines and up to 7x faster than three other leading cloud data warehouse providers, helping agents query and reason over live enterprise data without hitting performance bottlenecks.

OpenShell, GitHub Copilot and a Less Fragmented Future

Security and runtime control are central to making agentic AI deployable at scale. OpenShell, NVIDIA’s secure runtime for autonomous agents, is being brought to Windows on top of Microsoft Execution Containers, enforcing policies on what an agent can access at runtime across RTX Spark PCs and DGX Station for Windows. OpenShell also appears in GitHub Copilot, giving developers a consistent environment for building and testing production-grade agents. Tooling such as NVIDIA Agent Toolkit and NemoClaw blueprints on Foundry, plus CUDA‑X libraries like cuDF, cuOpt, AI‑Q and NeMo as domain-specific skills, help teams build agents that can reason, plan and act against real workloads. By tying Windows devices, Azure Local infrastructure, Foundry, Fabric and Copilot into one stack, the NVIDIA Microsoft partnership aims to replace today’s patchwork of tools with a single path from prototype to enterprise deployment.

NVIDIA and Microsoft Unite a Single Stack for Agentic AI

What a Unified Agentic AI Stack Means for Developers

RTX Spark PCs: Personal Agentic AI on Windows Devices

DGX Station for Windows and Azure Local: From Desk to Data Center

Foundry, Fabric and Open Models: Scaling Agents in the Cloud

OpenShell, GitHub Copilot and a Less Fragmented Future

You May Also Like