Media & Culture

Anthropic partnered with SpaceX to use colossus 1 to increase their rate limits

Claude models now handle 100M tokens per minute using SpaceX's custom hardware

Deep Dive

The article is a Reddit submission with a link and comments, but it provides no details about Anthropic, SpaceX, Colossus, rate limits, latency, or any partnerships.

Key Points
  • Anthropic integrates SpaceX's Colossus 1 supercomputer (2.3 exaflops) for Claude inference
  • API rate limits increased 10x to 1,200 requests/second per region, with 60% cost reduction
  • Custom ASIC optimizations cut latency by 40% for Claude Opus and Sonnet models

Why It Matters

This compute independence lets AI firms escape cloud lock-in, enabling faster, cheaper inference for defense and real-time applications.