On-device AI in Microsoft Edge for private browsing

What on-device AI in Microsoft Edge means for privacy

On-device AI in Microsoft Edge is the ability to run language, translation, and speech models directly on a user’s computer, so the browser processes data locally instead of sending it to remote cloud servers for inference. This shifts AI in the browser from being network-dependent to being a privacy-first AI option where sensitive text, prompts, and documents can stay on the machine. Microsoft is building this model-first approach around the Phi-4-mini model and its Prompt and Writing Assistance APIs, introduced at Build 2025, and is now extending it with new on-device AI browser features. The core idea is that Edge can host and manage small, efficient AI models that power writing help, local language processing, and speech recognition. Users keep more control over their data, while developers gain direct access to these Microsoft Edge AI models through JavaScript APIs, without relying on external AI services.

From Phi-4-mini to Aion: AI models that run on mainstream hardware

The Phi-4-mini model gave Edge a capable 4B-parameter language engine for the Prompt and Writing Assistance APIs, but its hardware requirements limited which PCs could run it well. To widen support, Microsoft is previewing the Aion-1.0-Instruct small language model in Edge’s Canary and Dev channels, starting with version 150.0.4070. Aion is smaller, faster, and more efficient than Phi-4-mini, and it can run on less capable GPUs and even through CPU inference when no GPU is available. According to Microsoft’s Edge developer team, Aion’s goal is to “expand support to significantly more devices” while keeping strong quality for typical web use cases. This makes the on-device AI browser story more credible: instead of being a feature reserved for high-end hardware, Edge can test how compact models behave across a broad base of everyday PCs before Aion’s planned open-source release on Hugging Face.

Local language processing: Detector, Translator, and speech APIs

Beyond general language models, Microsoft is adding task-specific local language processing to Edge. Version 148 introduces Language Detector and Translator APIs that run on-device and can be called from JavaScript by websites and browser extensions. These models can identify the language of user text and translate between language pairs, with support for more than 145 languages. Because the work happens locally, developers gain privacy-first AI translation with zero network dependency and no per-request translation costs. Microsoft notes that the Translator API can stream translated output as it is generated, which fits live interfaces such as chat or document editors. Edge is also experimenting with on-device speech recognition wired through the Web Speech API in Canary and Dev channels. Together, these APIs turn Edge into a platform where language detection, translation, and speech-to-text can run under the browser’s control without relying on external cloud providers.

Why local AI is a win for privacy, security, and user control

Running AI models directly in Microsoft Edge changes the privacy profile of browser assistants. When prompts, documents, and spoken input stay on the device, there is no need to send raw content to a remote server for analysis. That reduces exposure to interception, misrouting, or data retention outside the user’s control. Local processing also helps with latency and reliability, because responses do not depend on an always-on internet connection. For security-conscious organizations, on-device AI can reduce the number of external AI endpoints that must be vetted. At the same time, developers gain predictable APIs: they can call the Prompt API, Language Detector, Translator, or Web Speech interfaces knowing the browser manages model downloads, storage, and sandboxing. The tradeoff is that developers must handle capability checks and performance differences, since not every device will load the same model at the same speed or with the same resources.

A new developer landscape for AI-powered web apps

Edge’s on-device AI browser architecture gives web developers a new way to add intelligence to sites and extensions without shipping their own models or depending on external AI endpoints. With the Prompt and Writing Assistance APIs, they can build writing aids, summarizers, and chat-like helpers powered by the Phi-4-mini model today and Aion-1.0-Instruct in preview. The Language Detector and Translator APIs add local language processing for localization, multilingual interfaces, and assistive tools. Experimental on-device speech recognition extends this into voice-first experiences. Microsoft is testing Aion in Canary and Dev channels to see whether a compact language model can be delivered reliably to a wide range of consumer-grade devices. This mirrors browser competitors experimenting with their own small models and shows that AI in the browser is becoming about model management, privacy-first AI execution, and offline support, not only about connecting to large cloud-based systems.