AI vulnerability detection: Mythos’s promise and limits

What Mythos AI and Project Glasswing Reveal About Modern Security

AI vulnerability detection refers to using machine learning models to scan software and infrastructure for security flaws, prioritise likely exploits, and help defenders address bugs faster than traditional manual review or rule-based tools. Anthropic’s Mythos AI model, tested through Project Glasswing, is a clear example of this shift. In less than a month, Mythos surfaced more than 10,000 high-risk or critical vulnerabilities across core software, according to Anthropic’s early results. Partners reported dramatic gains: one infrastructure provider saw over 2,000 bugs flagged, with 400 rated high or critical, while a browser maker identified 271 security bugs in a single release. These results show how automated code scanning can scale far beyond human capacity, but they also expose a new problem: the security bottleneck is less about finding vulnerabilities and more about verifying, prioritising, and patching them in a controlled way.

An Impressive Vulnerability Haul with Hidden Costs

Anthropic says Mythos has scanned more than 1,000 open source projects and classified 6,202 bugs as high or critical severity, making it one of the most visible examples of AI vulnerability detection applied to live codebases. This sits alongside partner findings that push the total bug count from Glasswing testing above 10,000 serious issues. Such scale highlights how automated code scanning can uncover latent weaknesses in operating systems, browsers, and security libraries. However, volume alone does not translate to safer systems. Each flagged issue must be reproduced, understood, and ranked against existing workloads. Mythos can even chain multi-step exploits and supply proofs of concept, which helps security teams but also increases the depth of investigation needed per alert. The result is a flood of data that risks overwhelming teams if their triage processes and tooling are not ready for AI-driven discovery rates.

False Positives, Hallucinations, and the Trust Problem

Mythos’s raw detection power is tempered by concerns around false positives security teams cannot ignore. Anthropic passed 28% of its high or critical findings—1,752 bugs—to six independent security firms. Those reviewers reported a 9.4% false positive rate and confirmed 62.4% as genuinely high or critical issues. In relative terms that rate is acceptable, but at Mythos’s scale it still produces hundreds of wrong or overstated alerts. Cloudflare’s chief security officer warned that when you “ask a model to find bugs, it will find them, whether the code has any or not,” resulting in hedged findings that clog triage queues. Because Mythos remains a probabilistic model, repeating the same request can yield different answers, undermining consistent workflows. For teams, this means that AI vulnerability detection is only as useful as the processes that sit on top of it: clear severity criteria, human review, and disciplined suppression of noisy or unverifiable findings.

From Finding Bugs to Fixing Them: The New Bottleneck

Project Glasswing’s results show that the main constraint in security is shifting. Anthropic notes that Mythos transforms discovery from a slow hunt for zero days into an almost continuous stream of bugs. Yet patching remains bounded by human capacity to fix, test, and deploy updates without breaking production systems. Early data underscores this: out of 530 bugs disclosed to open source maintainers, only 75 have been patched and 65 have public advisories so far. That lag reveals how AI-driven discovery can strain an already overloaded ecosystem. Glasswing also exposed that multi-stage exploits, like the WolfSSL vulnerability CVE-2026-5194, demand careful remediation to avoid side effects. Anthropic therefore recommends shorter development cycles and more automated patch pipelines, but such changes require significant organisational discipline and investment in secure deployment practices.

Designing AI-Assisted Security Workflows that Scale

For organisations, the lesson from Mythos is not to abandon AI vulnerability detection but to adopt it with realistic expectations and stronger workflows. Mythos appears most useful when paired with structured triage: prioritising bugs with clear proofs of concept, reserving human analysts for complex chains, and downgrading findings couched in speculative language. Partners and foundations are already experimenting with this model. Anthropic is working with the Open Source Security Foundation’s Alpha-Omega project to help maintainers manage AI-generated reports, while companies such as Cisco are open-sourcing security frameworks tailored for AI-assisted analysis. Security teams implementing automated code scanning should plan for verification overhead, not treat AI findings as ground truth. The strategic goal is to turn raw model output into actionable security insights, where fewer, better-validated fixes reach production faster than attackers can exploit newly discovered weaknesses.