AI Safety

Small models also found the vulnerabilities that Mythos found

LessWrong AI April 12, 2026

⚡Open-source models costing $0.11/M tokens found the same vulnerabilities as the $18B-valued Mythos agent.

Deep Dive

A viral analysis from the LessWrong forum reveals that small, open-weight AI models can match the vulnerability detection capabilities of Anthropic's high-profile Mythos cybersecurity agent. Researchers isolated the specific code vulnerabilities highlighted in Anthropic's Mythos announcement and ran them through eight different smaller models. The results were striking: every model, including one with only 3.6 billion active parameters costing just $0.11 per million tokens, successfully detected Mythos's flagship FreeBSD exploit. A slightly larger 5.1B-parameter open model also recovered the core chain of a 27-year-old OpenBSD bug. This suggests that the initial 'finding' stage of vulnerability research may be more accessible than previously thought.

However, the community discussion quickly contextualized these findings. Commenters, including security experts, pointed out a critical distinction: while small models can identify potential vulnerabilities, they lack Mythos's sophisticated ability to chain multiple vulnerabilities together and develop functional, weaponized exploits. The smaller models also demonstrated a significantly higher false-positive rate—one test showed a 2/3 false positive rate on patched code—meaning their output requires extensive manual verification by skilled security professionals. In essence, they can find the needle in the haystack but cannot effectively build the tool to use it, whereas Mythos aims to deliver a directly actionable result for attackers or defenders.

Key Points

All 8 tested small models, including a 3.6B-parameter model, detected the same FreeBSD exploit as Anthropic's Mythos.
Small models showed a high false-positive rate (e.g., 2/3 on patched code), requiring heavy manual review, unlike Mythos's polished output.
Experts emphasize the gap between detection and exploitation; small models cannot chain vulnerabilities or write working exploits like Mythos.

Why It Matters

This tempers the hype around frontier AI for security, showing core detection may be commoditized while advanced reasoning remains a premium capability.

Read Original Article

Small models also found the vulnerabilities that Mythos found

Why It Matters

Stay Ahead in AI