Vulnerability detection AI: Mythos’ power and false positives

What Mythos Is: A New Kind of Vulnerability Detection AI

Mythos is a frontier vulnerability detection AI that scans live code and infrastructure to identify high-risk security flaws, chaining multi-step exploit paths in ways earlier tools could not, while still generating false positives that security teams must triage before patching. In Anthropic’s Project Glasswing tests, Mythos searched infrastructure code and core software stacks, surfacing more than 10,000 high-risk or critical vulnerabilities in under a month. Partners reported dramatic gains in code vulnerability scanning speed: Cloudflare alone found over 2,000 bugs in its core infrastructure, with 400 rated critical or high risk, and Mozilla identified 271 security bugs in a new Firefox release, roughly ten times more than with earlier AI systems. These numbers position Mythos security model output as both a breakthrough and a practical challenge, since sheer volume does not equal usable, verified fixes.

Project Glasswing’s Numbers: Scale, Speed, and the Human Bottleneck

Glasswing’s early results show Mythos scanning infrastructure code and open source projects at unusual scale. According to Anthropic, the model has examined more than 1,000 open source projects and flagged 6,202 bugs as high or critical severity, contributing to the broader total of over 10,000 serious findings across Glasswing testing. Independent evaluations add weight: the UK AI Safety Institute observed Mythos performing an entire multi‑stage hack in sandbox conditions, while XBOW reported that the system outperformed other agents at locating hidden web vulnerabilities. Yet these gains expose a new constraint. Automated discovery now outpaces human response, as teams must confirm issues, design patches, run tests, and roll out safe updates. Anthropic notes that Mythos Preview alone has led to 530 reported bugs, with only 75 patched so far, turning post-detection workflows into the new choke point for defenders.

False Positive Rates: When High Volume Becomes Noise

Mythos’ strength in code vulnerability scanning comes with a measurable false positive cost. Anthropic says it sent 28% of the model’s high or critical findings—1,752 bugs—to six independent security firms. Those firms reported a 9.4% false positive rate and validated 62.4% as genuinely high or critical. By industry standards, that false positive rate is acceptable, but Mythos’ scale changes the math: thousands of alerts mean hundreds of wrong leads and many more ambiguous edge cases. Cloudflare’s Chief Security Officer, Grant Bourzikas, warned that “hedged findings vastly outnumber the solid ones” and that this bias is “ruinous” for a triage queue. Because Mythos is probabilistic, repeated scans can yield different answers, so security teams face an investigation burden where every speculative “could in theory” issue consumes scarce analyst time, undermining the apparent efficiency gains.

Operational Reality: Balancing Detection Power and Investigation Overhead

For defenders, Mythos illustrates the tension between more detection and manageable workloads. On one hand, it uncovers high-stakes bugs such as the WolfSSL issue CVE-2026-5194, rated CVSS 9.1 and linked to certificate forgery risk, proving the model can find serious flaws that manual reviews might miss. On the other, every false or low‑value alert adds investigation overhead, raising the risk that teams drown in noise. Mythos is not a simple scanner: its ability to chain weaknesses and propose multi-step exploits pushes it closer to an automated security analyst. That power demands disciplined triage. Anthropic’s partnerships with the Open Source Security Foundation’s Alpha‑Omega project and vendors like Cisco show an emerging ecosystem focused on prioritization, workflow automation, and shared frameworks, so that vulnerability detection AI adds net value instead of overwhelming already stretched incident queues.

How Mythos Compares to Other Security LLMs

Compared with earlier security-focused LLMs, Mythos stands out on both capability and risk. Independent testers like XBOW found it “by far superior” at finding obscure web exploits, and Glasswing participants report about ten times more bugs than older AI tools in some environments. Cloudflare and Mozilla’s results suggest Mythos security model outputs are richer and more varied, ranging from straightforward misconfigurations to complex chains involving multiple components. Yet the model’s hallucinations and hedged predictions underline its limits as a drop-in replacement for human expertise. Unlike traditional static analyzers, Mythos can generate multi-step exploit reasoning but can also overstate risk or misinterpret edge cases. For now, the model looks most valuable as a high-powered assistant: it amplifies skilled security teams, accelerates code vulnerability scanning, and reshapes bug discovery workflows, while still needing strong human review and guardrails to keep false positive rates tolerable.