MilikMilik

AI Agents Are Rewriting How Fast Software Flaws Are Found

AI Agents Are Rewriting How Fast Software Flaws Are Found
Interest|High-Quality Software

What AI-Driven Vulnerability Discovery Means Now

AI vulnerability discovery is the use of autonomous software agents and machine-learning tools to inspect large codebases, generate proof-of-concept exploits, and detect security vulnerabilities, including latent zero-days, far faster and at far greater scale than human-led or traditional testing methods alone. In the last few months, these agents have shifted from research experiments to production tools that are changing how security teams work. They no longer only fuzz inputs or flag suspicious patterns; they now chain bugs into working exploits, triage findings, and in some cases even write patches. The result is a sharp rise in zero-day detection and automated threat detection. Vendors from database platforms to browsers are feeling pressure to ship fixes more often, rethink security patch cycles, and prepare for a future where attackers and defenders both have AI assistants tied directly into their development pipelines.

FFmpeg: 21 Zero-Days and Two-Decade-Old Bugs

FFmpeg, the media engine behind countless video tools and pipelines, became a test case for what autonomous AI agents can do. Security startup depthfirst pointed its agent at FFmpeg’s roughly 1.5 million lines of C and emerged with 21 confirmed zero-day vulnerabilities, each backed by a reproducible proof-of-concept input. Several FFmpeg vulnerabilities had been dormant for 15 to 20 years; one stack overflow in service-description-table code dates to 2003, meaning the flaw sat in production code for 23 years before AI found it. Most of these FFmpeg vulnerabilities are heap or stack overflows in parsers and demuxers, from the TS demuxer to the VP9 decoder. According to depthfirst, the scan cost around USD 1,000 (approx. RM4,600), showing how cheap and repeatable AI vulnerability discovery runs can be when focused on widely deployed open-source libraries.

AI Agents Are Rewriting How Fast Software Flaws Are Found

Chrome 149 and the New Volume of Security Fixes

Google’s Chrome 149 release shows the other side of the AI effect: scale. Chrome 149 shipped patches for 429 security bugs, the largest number in a single Chrome release so far, with over 100 rated critical or high severity. The worst of these, CVE-2026-10881 in the ANGLE graphics engine, is an out-of-bounds read and write that lets a crafted web page escape the sandbox and run code on the host. Google has not said these issues were directly found by AI, but it overhauled its bug bounty rules in April in response to a surge of AI-generated reports, asking for concise reproducers instead of long, auto-written narratives. The message is clear: automated threat detection is driving up report volume, while internal teams and external researchers use AI tools to probe complex subsystems faster than before.

Redis RCE: Autonomous AI Finds a Two-Year-Old Cloud-Scale Threat

In Redis, an autonomous AI tool named Xint Code uncovered CVE-2026-23479, a use-after-free bug in blocking-client code that enables remote code execution for authenticated users. The flaw appeared in Redis 7.2.0 after two separate commits and persisted unnoticed across all stable branches for over two years, despite multiple security reviews. The vulnerability sits in unblockClientOnKey(), which calls processCommandAndResetClient() and then continues using a client pointer that may already have been freed, creating a classic CWE-416 scenario. A published exploit chain leaks a heap address, frees a client, reuses the memory with a fake client structure, and abuses memory accounting to overwrite a function pointer in the Global Offset Table so strcasecmp() points to system(). Redis patched the bug on May 5 across five branches, underlining how fast patch cycles now need to move once an AI-discovered exploit is public and cloud-exposed.

Shorter Security Patch Cycles and What Teams Should Do

These cases show a pattern: AI is compressing the time between vulnerability introduction, zero-day detection, and public disclosure, which is forcing shorter security patch cycles. Chrome 149’s 429 fixes, depthfirst’s 21 FFmpeg zero-days, and Redis’s rapid response to CVE-2026-23479 all point to the same conclusion: the backlog of hidden flaws is being drained faster than ever. Defenders need to assume that any widely used codebase is now under continuous autonomous scrutiny. That means prioritizing auto-update mechanisms, treating dependency upgrades as security work, and cataloging every embedded copy of libraries like FFmpeg across containers, Python wheels, and appliances. For internet-facing services such as Redis, teams should pair timely patching with stronger defaults, including stricter ACLs and TLS, so that even if AI-driven automated threat detection surfaces a new bug, the blast radius is limited before patches land.

Milik earns a commission when you shop through our links, at no extra cost to you. Editorial content is independently selected by our team.

You May Also Like

Comments
Say something...
No comments yet. Be the first to share your thoughts!