Hackers Used AI to Build the First Zero-Day Exploit Caught in the...

🔥 What Happened

Google caught something historic last week: the first confirmed AI-written zero-day exploit being prepped for a mass cyberattack in the wild. A crime ring teamed up to build a Python script that bypasses two-factor authentication (2FA) on a popular open-source admin tool — and it was almost certainly generated by an LLM.

Google Threat Intelligence Group (GTIG) disclosed the discovery on May 11 in a major report on AI-enabled threats. The exploit was intercepted before deployment, and Google worked with the affected vendor to patch the vulnerability quietly. But the implications are a 10-alarm fire for anyone who thought "AI hacking" was still theoretical.

The smoking gun? The Python script came straight out of textbook-LLM-characteristics: hallucinated CVSS scores, educational docstrings, clean ANSI color classes, and detailed help menus that no seasoned hacker would waste time writing.

🧠 Why This Matters

This is the line-crossing moment security researchers have been dreading. The theory was always: "AI will eventually help hackers find zero-days." That "eventually" arrived.

Here's what's different from previous AI-Augmented hacking:

Not just code completion. The AI didn't just suggest snippets — it found a logic flaw that fuzzers, static analyzers, and human reviewers all missed.
Zero-day quality. This wasn't a known CVE repackaged. It was a novel vulnerability in a widely deployed admin tool.
Planned for mass exploitation. Google says the crime group was gearing up for exactly that — a "mass vulnerability exploitation operation" — not a targeted strike.
2FA bypass. If you manage servers through any web-based admin panel, you need to care. The exploit let attackers waltz past your second factor.

If an LLM can find and weaponize one of these, it can find and weaponize dozens. This is a step-change in the attacker-defender timeline.

📊 Deep Dive

GTIG's report (based on Mandiant incident response data, Gemini telemetry, and proactive research) lays out exactly what happened:

The exploit: A Python script designed to bypass 2FA on a "popular open-source, web-based system administration tool." Google declined to name the tool — likely to give stragglers time to patch.

The AI fingerprints:

Hallucinated CVSS score — the LLM invented a severity rating that doesn't correspond to any real CVE database entry
Abundant educational docstrings — a telltale sign of LLM-generated code, like comments explaining what each function does
Clean, structured formatting — textbook Pythonic style with detailed help menus that no experienced exploit developer would include
The `_C` ANSI color class — a common LLM pattern for handling colored terminal output

The vulnerability type: A "high-level semantic logic flaw." Translation: it wasn't a memory corruption bug or SQL injection. The developer hardcoded a trust assumption that looked functional but was strategically broken from a security standpoint. Traditional fuzzers and static analyzers are designed to catch memory bugs and input sanitization failures — they completely miss this class of error.

Why LLMs are uniquely dangerous here: Frontier models can read source code and understand *intent*. They spot contradictions between "we enforce 2FA here" and "but here's a hardcoded exception that bypasses it." This is contextual reasoning — something no automated scanner does well. A human code reviewer *could* catch it, but they'd need to know what to look for.

The crime ring: Multiple cybercrime groups partnered for this operation — a worrying sign of specialization and resource-sharing in the underground economy.

⚠️ The Catch

A few important caveats:

This exploit requires valid credentials. You can't bypass 2FA on a system you can't log into. The attacker still needs to phish or steal usernames and passwords first. 2FA didn't become useless — just slightly less effective against a sophisticated AI-augmented attacker.

Google says Gemini wasn't used. They're confident the AI model behind this wasn't their own. That's good for Google, but irrelevant for everyone else — the capability exists regardless of which model produced it.

The vendor patched. Because Google disclosed responsibly, the window of active exploitation was closed before mass deployment. No confirmed victims. The crisis was averted — this time.

Unnamed tool. We don't know which admin tool was vulnerable. If it's something you run on your infrastructure, you may have already been patched without ever knowing the threat existed.

Still — "averted" is very different from "solved." The next one might not be caught before it's weaponized.

🎯 What Happens Next

This isn't a one-off incident. GTIG's report reveals an entire ecosystem of AI-enabled threat activity:

State-backed actors are already doing this at scale. Chinese group UNC2814 used persona-driven jailbreaking to probe TP-Link firmware for pre-auth RCE vulnerabilities. North Korea's APT45 sent "thousands of repetitive prompts" recursively analyzing CVEs and validating exploits. They're treating LLMs as a force multiplier — the AI equivalent of a sweatshop for vulnerability research.

Russian groups are deploying AI-generated polymorphic malware. Malware families like CANFAIL and LONGSTREAM use LLM-generated decoy code — coherent but inert functions — to camouflage their real payload. Think of it as digital camouflage: the AI generates thousands of lines of fake admin logic to hide the three lines that exfiltrate your data.

Android malware is going autonomous. The PROMPTSPY backdoor uses Gemini APIs to read your screen, capture biometric gestures (PINs, unlock patterns), and replay them autonomously. It can even block the "Uninstall" button by serving an invisible overlay. This isn't a human driving this malware — it's a screen-reading AI agent.

AI models themselves are being hoarded. Threat actors are building automated pipelines to create thousands of premium-tier AI accounts, cycling through trials, and renting out access. This infrastructure lets them bypass usage limits and maintain operational security — all while making the model providers foot the bill.

Expect to see more AI-crafted zero-days in the coming months. The barrier to entry just cratered.

🧩 Bigger Picture

This story isn't really about one Python script. It's about the collapse of a traditional assumption in cybersecurity: that finding zero-days requires rare, expensive human expertise.

What LLMs change:

Speed. What took a top researcher weeks now takes an AI hours or minutes
Coverage. LLMs spot logic flaws that every existing tool misses
Volume. A thousand parallel prompts can analyze a thousand different code paths simultaneously
Accessibility. You don't need a PhD in binary exploitation anymore

The old model of "patch faster than attackers can find bugs" already wasn't working. AI collapses timelines further.

But there's a flip side. Google's own Big Sleep agent uses AI to autonomously hunt vulnerabilities. Their CodeMender system uses Gemini to automatically patch them. The same technology that lowers the barrier for attackers also supercharges defenders — it's an arms race, not a surrender.

The question is whether defensive AI gets enough deployment, fast enough, ahead of the wave that's clearly coming.

For now: patch your admin tools, assume AI is already probing your infrastructure, and maybe reconsider trusting anything that has "educational docstrings" in its exploit code.

> Sources: Google Threat Intelligence Group (May 11, 2026 report — "Adversaries Leverage AI for Vulnerability Exploitation"), CNBC, Bloomberg, NYT, The Hacker News, BleepingComputer, The Register, SecurityWeek

Hackers Used AI to Build the First Zero-Day Exploit Caught in the...

🔥 What Happened

🧠 Why This Matters

📊 Deep Dive

⚠️ The Catch

🎯 What Happens Next

🧩 Bigger Picture

Jen-Yu Teng

Enjoying this? There's a lot more where that came from.

More from

The Ultimate Guide to Developer Documentation

Gigabyte Just Dropped a WOLED Gaming Monitor and It Changes Everything

Cerebras Surges 68% in Record-Breaking IPO: What It Means for the AI...