Microsoft bets on open source AI to cut Copilot costs

Microsoft’s New Cost Focus: Swapping Frontier Models for Open Source

Microsoft’s shift toward open source AI models for Copilot Cowork is a strategy to replace costly proprietary systems like Claude and GPT with self-hosted alternatives that offer comparable performance while sharply reducing AI infrastructure spending for long-running, agentic workflows. Copilot Cowork, now generally available to Microsoft 365 Copilot customers, powers complex task chains across Outlook, Teams, and Excel, but that depth drives heavy compute usage and higher bills under usage-based pricing. As companies are billed in metered “Copilot Credits”, Microsoft has a clear incentive to make each unit of computation cheaper. That means asking where Anthropic and OpenAI models are essential, and where lighter, cost effective AI can do the job. The aim is not to abandon premium frontier APIs, but to reserve them for high-value tasks while routing routine work to cheaper, self hosted models on Azure.

Microsoft Turns to Open Source AI Models to Cut Copilot Costs

DeepSeek V4 and the Rise of Self-Hosted, Open Models

Reports indicate that Microsoft is exploring a fine-tuned, self-hosted version of DeepSeek V4 as a lower-cost option alongside Anthropic and OpenAI models for Copilot Cowork. DeepSeek V4 is one of several open source AI models being evaluated, and early results show strong performance at much lower prices, sometimes a fraction of the cost per token of frontier systems. Any DeepSeek-based option would run entirely on Azure, keeping customer data inside Microsoft’s existing security, compliance, and data residency framework. That framing matters for enterprises wary of external dependencies yet keen on cost effective AI. By fine-tuning and operating DeepSeek V4 and similar self hosted models in its own cloud, Microsoft can absorb more of the stack, trim AI infrastructure spending, and offer customers a model selection that aligns cost with the complexity and sensitivity of each task.

A Modular Copilot: Swapping Models Without Rewriting the System

Behind the scenes, Microsoft is redesigning Copilot Cowork around a clear separation between the orchestration harness and the model layer. According to TestingCatalog, this architecture lets Microsoft swap underlying open and open-weight models depending on task type, latency needs, and cost, while keeping the agentic workflow engine stable for users. In practice, the system could send demanding reasoning tasks to frontier APIs, delegate routine summarisation or formatting to cheaper self-hosted models, and, over time, move simple workloads to small local models as they mature. This modular approach gives enterprises more predictable behaviour and governance while letting Microsoft experiment aggressively with different open source AI models. It also turns model choice into a tunable business control: customers gain flexible pricing and quality options without needing to re-architect how staff work with Copilot Cowork every time the model mix changes.

Usage-Based Pricing and the Pressure to Control AI Spend

Microsoft’s move to usage-based pricing for Copilot Cowork, billed through Copilot Credits, highlights how agentic AI strains traditional flat-fee models. Long-running workflows that sort emails, prepare meetings, and compile reports can generate large volumes of tokens, making per-task economics central to adoption. Enterprises now see, in clearer terms, how AI infrastructure spending grows with real use, and many are pushing back against open-ended exposure to premium model costs. Offering open source and self hosted models as alternatives gives Microsoft a way to keep powerful features while easing that pressure. Companies may choose cheaper models for standard workloads and reserve high-end systems for specialised cases. This tiered approach lets AI teams match model choice to business value, which could be decisive for turning Copilot Cowork from a pilot project into a standard part of everyday work.

How Open Source Adoption Could Reshape the AI Vendor Landscape

Microsoft’s evaluation of open and open-weight models for Copilot Cowork also has strategic implications for the wider AI market. If DeepSeek V4 and similar open source AI models prove strong enough to power key features, internal teams building Microsoft’s MAI models face sharper benchmarks, and external proprietary providers must compete on price and performance, not exclusivity. TestingCatalog notes that open-model developments are still under active testing, not yet in production, but the direction is clear: a world where enterprises can swap models with minimal disruption reduces lock-in for any single vendor. As more workloads shift to cost effective AI and self hosted models, proprietary players like Anthropic and OpenAI may need to differentiate with clear quality gains, specialised capabilities, or friendlier pricing. For enterprises, this growing competition should translate into better value and more control over how AI powers their core workflows.