AI agent infrastructure for production-ready agents

From impressive demos to production-ready agents

Microsoft Foundry is an AI agent infrastructure platform that packages runtime, tools, memory, grounding, models, observability, and governance so enterprises can move agents from experimental demos to reliable, production-ready agents without rebuilding their stack from scratch. The recent Build announcements respond to a clear gap: agent demos are plentiful, but systems that survive real traffic, messy data, and compliance audits are rare. Microsoft frames Foundry as an "AI app and agent factory" where teams can build, ground, and govern agents with shared policy and observability. Instead of every team stitching together its own queues, storage, evaluation, and monitoring, Foundry adds a common platform layer. That platform is designed to support autonomous workflow systems and multi-agent orchestration on Azure, so enterprises can focus on business logic rather than plumbing and still meet reliability expectations for large-scale enterprise AI deployment.

Microsoft Foundry Turns AI Agents into Production Systems

A managed runtime that avoids rewrites

At the center of Foundry’s push toward production-ready agents is Foundry Agent Service, a managed runtime where each agent session runs in its own sandbox with dedicated compute, memory, and durable filesystem access. Hosted agents expose a stateful Responses API plus an invocations protocol for custom request and response formats, which lets teams keep existing orchestration while adopting the platform. According to Nick Brady, Foundry brings "runtime, tools, memory, grounding, models, observability, and governance" together instead of shipping a single new model endpoint. The same runtime supports long-running agents with durable state and routines, now in public preview, so teams can schedule agents for overnight ticket triage or daily reporting. Because Foundry supports frameworks such as Microsoft Agent Framework, GitHub Copilot SDK, LangGraph, and others without rewrites, it reduces deployment friction that has slowed enterprise AI deployment beyond proof-of-concept builds.

Toolboxes, memory, and knowledge as platform functions

Foundry treats tools, memory, and knowledge retrieval as shared platform functions rather than one-off code inside each agent. Toolboxes, now in public preview, give agents a single managed endpoint for tools, skills, Model Context Protocol clients, and enterprise integrations, so teams register tools once and let Foundry handle authentication, lifecycle, and governance. Skills are versioned in a project-scoped catalog and exposed as MCP resources, while tool search helps pick a small, relevant set per task instead of flooding the model. Memory in Foundry Agent Service now spans procedural, user, and session memory, all managed centrally with controls for retention and inspection. Microsoft reports that procedural memory, which helps agents learn how to do the work across runs, yields "7 to 14 percent absolute success rate gains at near baseline cost" on early Tau benchmarks, which directly supports more reliable autonomous workflow systems.

Governance, monitoring, and multi-agent orchestration

Governance and observability are where Foundry begins to look less like a model playground and more like an agent governance platform. Microsoft’s ASSERT framework turns written policies into measurable evaluations, generating targeted scenarios to catch safety and quality problems before agents reach production. This approach moves beyond static benchmarks toward policy-driven testing across frameworks such as LangChain, CrewAI, LightLLM, and OpenAI. Earlier releases introduced multi-agent orchestration, Agent to Agent APIs, and shared observability across agents, which underpin complex enterprise workflows where several agents cooperate. Toolboxes connect directly to Microsoft IQ services, including Work IQ, Fabric IQ, and Fabric’s data agent, so agents can query enterprise data and live web sources through a unified retrieval endpoint instead of custom connectors. Together, these capabilities make agent behavior inspectable, auditable, and more predictable under load—key hurdles for scalable enterprise AI deployment.

Shifting the AI battle to infrastructure and reliability

The Foundry announcements signal a shift in the AI platform competition from raw capability toward production-grade infrastructure. Where model quality once dominated buying decisions, enterprises now ask how quickly they can move from lab prototypes to governed, production-ready agents that align with security and compliance policies. Foundry’s tight integration with Azure services, Microsoft 365, and Microsoft Teams distribution channels means agents can be deployed directly into employee workflows with identity, permissions, and policy applied automatically. For organizations, the value lies in having a consistent runtime, shared tooling, and clear governance patterns rather than rebuilding these foundations for each autonomous workflow system. By focusing on reliability, governability, and multi-agent orchestration, Microsoft positions Foundry as a central AI agent infrastructure layer, betting that the next competitive edge comes from who can run AI agents safely at scale, not who has a slightly better base model.