Founder Denis Shilov's universal jailbreak prompt bypassed safety filters of ChatGPT, Claude and others, leading to 1.4M views and invitations to bug bounty programs at Anthropic, OpenAI, and Hugging Face?

Founder Denis Shilov's universal jailbreak prompt bypassed safety filters of ChatGPT, Claude and others, leading to 1.4M views and invitations to bug bounty programs at Anthropic, OpenAI, and Hugging Face.

White Circle's platform monitors AI inputs/outputs in real time via a single API, detecting harmful content, prompt injection, model drift, and abusive users across 150 languages?

White Circle's platform monitors AI inputs/outputs in real time via a single API, detecting harmful content, prompt injection, model drift, and abusive users across 150 languages.

Raised $11M seed from industry leaders including Romain Huet (OpenAI), Dirk Kingma (Anthropic), François Chollet (Keras), and others; platform is SOC 2 Type I/II and HIPAA compliant?

Raised $11M seed from industry leaders including Romain Huet (OpenAI), Dirk Kingma (Anthropic), François Chollet (Keras), and others; platform is SOC 2 Type I/II and HIPAA compliant.

Viral Wire

White Circle raises $11M seed for AI control platform after viral safety exposé

White Circle (via Business Wire) May 12, 2026

⚡Founder who bypassed ChatGPT, Claude safety filters in 2024 now builds enterprise guardrails.

Deep Dive

Paris-based White Circle has secured $11M in seed funding from a star-studded roster of AI industry figures, including Romain Huet (OpenAI), Dirk Kingma (ex-OpenAI, now Anthropic), Guillaume Lample (Mistral), Thomas Wolf (Hugging Face), and François Chollet (Keras). The platform was founded by Denis Shilov, who made headlines in 2024 when he bypassed safety filters of ChatGPT, Claude, and other major models with a single universal jailbreak prompt—amassing 1.4M views and earning invitations to Anthropic's bug bounty program. The funding will accelerate product development and global expansion as 'vibe coding' enables anyone to ship AI products, increasing the risk of models breaking safety rails.

The White Circle platform provides a single API for real-time monitoring of AI inputs and outputs, catching harmful content, hallucinations, prompt injection attacks, model drift, and abusive users. Custom policies allow automated enforcement actions like rate limiting or banning bad actors. Specific use cases include detecting a fintech model leaking sensitive customer data, blocking an AI agent from executing malicious instructions (e.g., deleting files), and flagging rising user frustration when a model fails. The system supports 150 languages, is SOC 2 Type I/II certified, and HIPAA compliant. In May 2025, White Circle also published CircleGuardBench, a benchmark for testing AI moderation models under real-world conditions. Head of Design Elena Iumagulova emphasized that the platform makes AI behavior visible to any team, technical or not.

Key Points

Founder Denis Shilov's universal jailbreak prompt bypassed safety filters of ChatGPT, Claude and others, leading to 1.4M views and invitations to bug bounty programs at Anthropic, OpenAI, and Hugging Face.
White Circle's platform monitors AI inputs/outputs in real time via a single API, detecting harmful content, prompt injection, model drift, and abusive users across 150 languages.
Raised $11M seed from industry leaders including Romain Huet (OpenAI), Dirk Kingma (Anthropic), François Chollet (Keras), and others; platform is SOC 2 Type I/II and HIPAA compliant.

Why It Matters

As AI deployment accelerates, White Circle gives enterprises a single platform to ensure safety, compliance, and accountability at scale.

Read Original Article

White Circle raises $11M seed for AI control platform after viral safety exposé

Why It Matters

Related Articles

🚀 Stay Ahead in AI