system: OPERATIONAL
← back to all hacks
OFFENSIVE AI MEDIUM

Apple's May 2026 bulletin formally credits Claude on two macOS CVEs

On May 11, 2026, Apple's macOS Tahoe 26.5 advisory named Claude alongside its researchers on two CVEs — a kernel integer overflow and a WebKit use-after-free. AI-assisted vulnerability research is now in the official changelog.

2026-05-27 // 6 min affects: macos-tahoe-26.5, macos-sequoia-15.7.7, macos-sonoma-14.8.7, ios-18.7.9, ipados-18.7.9, safari-26.5

What is this?

On May 11, 2026, Apple published the security content of macOS Tahoe 26.5, iOS/iPadOS 18.7.9, macOS Sequoia 15.7.7, macOS Sonoma 14.8.7 and Safari 26.5. Two entries in that bulletin are unusual: their researcher credit names a model, not just a person.

Kernel — CVE-2026-28952: “An integer overflow was addressed with improved input validation. Calif.io in collaboration with Claude and Anthropic Research.”

WebKit — CVE-2026-28942 (Bugzilla 312180): “A use-after-free issue was addressed with improved memory management. Milad Nasr and Nicholas Carlini with Claude, Anthropic.”

Two CVEs in the same release cycle, both with Claude formally listed in the acknowledgements alongside the human researchers who used it. This is the line item we are covering here — not because the bugs themselves are exotic (an integer overflow and a UAF; the patches are described as input validation and memory-management fixes), but because the vendor changelog is now an artifact of AI-assisted vulnerability research. The kernel finding came out of Calif.io’s five-day exploit chain on Apple M5, built with help from Anthropic’s restricted Claude Mythos under Project Glasswing.

How it works

The “how” here is the workflow, not the exploit. No payloads are reproduced — the canonical references are Apple’s advisory text, the NVD entry for CVE-2026-28952, and Calif.io’s 55-page technical report delivered in person to Cupertino (9to5Mac coverage).

The shape of the collaboration, as described by the participants, looks like this:

Stage                        Human                        Model
---------------------------  ---------------------------  -----------------------------------
Surface mapping              Researcher picks target      Model summarises the attack surface
                             subsystem (kernel, WebKit)   from public source + symbols
Pattern recall               Researcher frames the bug    Model recognises the class (integer
                             class to look for            overflow, UAF) and points at
                                                          candidate code paths
Candidate triage             Researcher reviews hits      Model proposes minimal repros
Exploit chaining             Researcher composes the      Model proposes pivots and
                             primitive chain              auxiliary primitives
Report drafting              Researcher writes the        Model drafts sections, refines
                             write-up                     prose, cross-checks references
Vendor disclosure            Researcher delivers report   —
                             to vendor (in person, here)

Two things are worth noting about this division of labour. First, the bug classes are old — integer overflows and UAFs have been in every vulnerability taxonomy for thirty years. What the model contributed, per Calif.io’s own description, was speed of recognition in a target where the surface is large and the rewards for finding the right kernel path are concentrated. Second, the human in the loop is non-optional. The chain that turned the kernel finding into an actual exploit against Apple’s Memory Integrity Enforcement was assembled by experienced researchers; the model was a force multiplier on the discovery half, not an autonomous attacker.

The WebKit CVE-2026-28942 follows the same template, on a smaller scale. Milad Nasr and Nicholas Carlini — well-known names in adversarial ML research — used Claude to surface a use-after-free in WebKit’s process-crash handling and disclosed it to Apple, which addressed it in Safari 26.5 on May 13, 2026.

Why it matters

The bugs are patched. The story is downstream of that.

The first shift is provenance in the changelog. Vendor advisories have credited tools before (fuzzers, static analyzers, hardware probes), but always implicitly via the team that ran them. A line that reads “X with Claude, Anthropic” makes the model an explicit research contributor in a public, lawyer-reviewed document. For the next twelve months, expect more bulletins from large vendors to adopt the same form. If you maintain a CVE-tracking pipeline, your researcher-name parser is about to start seeing model names.

The second shift is time-to-finding compression. The Calif.io timeline — Bruce Dang spots the bugs on April 25, Dion Blazakis joins on April 27, a working exploit by May 1 — is five days from start to working chain for a class of bug that historically takes a senior team weeks. The relevant comparison is not “AI can do what humans do” but “the lead time between a class of bug becoming discoverable and the first credible exploit landing in a vendor’s queue is shrinking”. Patch SLAs for kernel and browser components need to absorb that, not just for Apple.

The third shift is access asymmetry. The model that did this work — Claude Mythos Preview — is not generally available; it is gated through Project Glasswing to roughly fifty vetted organisations. Apple is one of them. So is Mozilla, where the same model contributed to 271 latent vulnerabilities patched in Firefox 150. For defenders not in the program, the disclosure record is the only window into what frontier-tier offensive AI is actually capable of against your stack. Read it accordingly.

Defenses

There is nothing to “fix” here beyond installing the May 11 updates. The defensive playbook is about workflow, not patches.

  1. Patch and inventory. Confirm fleet-wide deployment of macOS Tahoe 26.5, macOS Sequoia 15.7.7, macOS Sonoma 14.8.7, iOS/iPadOS 18.7.9 and Safari 26.5. CVE-2026-28952 is a kernel integer overflow leading to unexpected system termination; CVE-2026-28942 is a WebKit UAF. Both are part of a 60+ CVE bundle in the May 11 release.

  2. Re-baseline your patch SLA. If your internal target for browser engine and kernel CVEs is still “30 days from disclosure”, that number was set when the rate of credible findings was lower. Bring kernel/browser SLAs in line with what AI-assisted vendor research is now producing — closer to 7-14 days on internet-facing assets, with an explicit out-of-cycle path for actively exploited bugs.

  3. Add model credits to your CVE pipeline schema. Most CVE ingestion code parses acknowledgements as free text. A normalised field for “AI tools cited” — Claude, Mythos, GPT-class, internal LLM — lets your team correlate which classes of bug are being surfaced by AI vs. classical methods, and where your own static-analysis stack is being out-recalled.

  4. Mirror the workflow on the defensive side. The Calif.io / Anthropic pattern (human direction, model pattern-matching, human chaining, human disclosure) is reproducible for blue-team uses: triaging your own code for unpatched analogues of disclosed CVEs, reviewing dependency upgrades for known bug classes, generating minimal repros from advisory text. None of this requires Mythos-class access; the same shape works with publicly available models on internal code.

  5. Threat-model “AI-assisted but unverified attacker”. Glasswing is a controlled-access program, but the underlying capability — pattern recall over a target’s source — is already partially available in public frontier models. Plan for the case where an unverified attacker assembles a comparable workflow against your stack, with weeks rather than days of lead time. The disclosure cadence you are now seeing in vendor changelogs is the defensive half of that distribution.

  6. Read the credits, not just the CVSS. A medium-severity entry credited to a model + researcher pair often signals a class of bug rather than a one-off. CVE-2026-28952 is “an integer overflow” — that pattern almost never lives alone in a codebase. Treat the next adjacent commits in the same subsystem as worth a re-review.

Status

ItemReferenceDateNotes
macOS Tahoe 26.5 advisoryApple2026-05-11Includes CVE-2026-28952, CVE-2026-28942
CVE-2026-28952 (Kernel)NVD2026-05-11Integer overflow, Calif.io with Claude, Anthropic Research
CVE-2026-28942 (WebKit)Apple / WebKit Bugzilla 3121802026-05-11 → 2026-05-13UAF, Milad Nasr & Nicholas Carlini with Claude, Anthropic
Calif.io technical write-up9to5Mac2026-05-145-day exploit chain on Apple M5 + MIE bypass
Project Glasswing contextLLM Hacking2026-05-26Restricted Mythos access for ~50 partners, Apple included
Companion releasesApple2026-05-11 → 2026-05-13iOS/iPadOS 18.7.9, macOS Sequoia 15.7.7, Sonoma 14.8.7, Safari 26.5

The right framing is not “an AI found Apple bugs” — that headline is already older than this advisory. It is “a vendor’s official security changelog now lists a model among the credited researchers, and the workflow that produced the finding is becoming reproducible at other large vendors”. The job for defenders is to stop being surprised when it happens to a product they ship.

Sources