Developer Tools

Meta's AI support chatbot tricked into stealing Instagram accounts via prompt injection

Hackers used VPN and simple prompts to hijack accounts worth millions.

Deep Dive

Hackers successfully exploited Meta's AI support chatbot to hijack Instagram accounts by simply asking it to change the associated email address—a classic prompt injection attack. Using a VPN to spoof the target's location, attackers initiated a password reset and then prompted the chatbot to modify the account's email, effectively bypassing normal authentication. High-profile accounts compromised include the Barack Obama White House account and the Chief Master Sergeant of Space Force's account, both of which briefly posted pro-Iranian content. The exploit, active since February 2024, allowed hackers to flip valuable accounts like @hey and @jowo, which together had a gray-market valuation exceeding $1 million. Meta deployed an emergency patch on May 29, but researchers noted the technique had been circulating for months among Telegram groups.

The vulnerability exemplifies the 'confused deputy' problem, where an AI agent with elevated permissions can be tricked into misusing them. Unlike deterministic programs, the probabilistic nature of LLMs makes them susceptible to natural language manipulation. Crucially, accounts with multifactor authentication (MFA)—even SMS-based codes—remained immune. The incident underscores the danger of deploying AI agents with direct access to modify or delete critical user data without robust safeguards. Security experts recommend out-of-band verification, rate limiting, and activity logging for any AI-driven account changes. This breach serves as a stark warning: as companies rush to integrate AI support bots, prompt injection attacks could become the new SQL injection, threatening the integrity of user accounts at scale.

Key Points
  • Hackers used VPN spoofing and a simple prompt to trick Meta's AI chatbot into changing account emails, bypassing password resets.
  • High-value accounts like @hey and @jowo, worth over $1M combined, were stolen and resold on the gray market.
  • Accounts with MFA (including SMS) were protected; Meta patched the exploit on May 29 after months of active exploitation.

Why It Matters

Prompt injection attacks on AI agents with account-changing permissions create a new class of security vulnerabilities, threatening user data at scale.