MilikMilik

JetBrains and Microsoft Race to Build AI Coding Tools That Stay Local

JetBrains and Microsoft Race to Build AI Coding Tools That Stay Local
Interest|High-Quality Software

What Self-Hosted Coding AI Means for Developers

Self-hosted coding AI is the practice of running large language models for software development on infrastructure you control, rather than relying on third-party cloud APIs, giving teams tighter control over data privacy, latency, and operational behavior at the cost of managing their own compute resources. This shift sits at the intersection of open source coding models, local language models, and traditional cloud services. Developers want coding AI alternatives that can handle code completion, refactoring, and multi-step reasoning without sending proprietary repositories to external providers. At the same time, teams are balancing cost, performance, and compliance demands from security and legal stakeholders. As new models arrive, the choice is no longer between a single cloud giant and nothing; it is increasingly between fully hosted APIs, self-hosted AI development stacks, and hybrid setups that combine both.

JetBrains Mellum2: Open-Source Coding AI That Does Not Phone Home

JetBrains’ Mellum2 is an open source, 12B-parameter coding model designed for the infrastructure layer of agentic AI systems, from routing and retrieval pipelines to sub-agent tasks. The model’s Mixture-of-Experts design activates only 2.5B parameters per token, making it behave more like a smaller local language model while still offering higher capacity when needed. According to JetBrains’ technical report, “Mellum2 scores 78.4% in its thinking variant on EvalPlus, ahead of Qwen3.5-9B at 71.8% and Seed-Coder-8B at 73.8%.” Mellum2 ships with base, instruct, and thinking checkpoints under Apache 2.0, giving enterprises the option to keep coding workflows entirely inside their own networks. That makes it one of the most visible open source coding models built specifically for self-hosted AI development, targeting scenarios where sending code to Anthropic, OpenAI, or other external APIs is not acceptable.

Microsoft MAI-Thinking-1 and MAI-Code-1: Frontier Power, Platform Lock-In

Microsoft’s new MAI-Thinking-1 is its first large language model that the company says matches the performance of Anthropic’s Claude Opus 4.6, signaling a push to own more of its AI stack rather than rely solely on OpenAI. The model underpins Copilot, Bing, PowerPoint, and Azure Speech, and will be available in Foundry for developers. Alongside it, Microsoft introduced MAI-Code-1-Flash, a “vibe coding” model that converts written descriptions into application and website source code. Mustafa Suleyman described this as “a new era of AI that you control on your terms,” but that control is tied to Microsoft’s cloud. These models are powerful coding AI alternatives, yet they remain cloud-first, API-centric services. For teams focused on self-hosted AI development, they offer performance and integration but not the full operational ownership Mellum2 provides.

JetBrains and Microsoft Race to Build AI Coding Tools That Stay Local

Privacy, Control, and Performance: The Core Tradeoffs

The emerging split between open source coding models and proprietary cloud LLMs centers on tradeoffs in privacy, control, and performance. Self-hosted AI development with tools like Mellum2 lets enterprises keep source code and telemetry inside their own infrastructure, meet stricter compliance requirements, and fine-tune models for internal stacks. The cost is managing GPUs, scaling inference, and accepting that focal models trained mainly on code may trail broader models in general reasoning. Cloud-based systems such as MAI-Thinking-1 and MAI-Code-1-Flash offer cutting-edge performance, tight integration with productivity suites, and less operational overhead, but they depend on external APIs and platform terms. Developers are increasingly asking for coding AI alternatives that do not phone home, whether for latency, cost predictability, or legal constraints. The likely outcome is a hybrid future where local language models handle high-frequency, sensitive tasks and frontier cloud models cover broader, occasional workloads.

Milik earns a commission when you shop through our links, at no extra cost to you. Editorial content is independently selected by our team.

You May Also Like

Comments
Say something...
No comments yet. Be the first to share your thoughts!