Models & Releases

Unlocking large scale AI training networks with MRC (Multipath Reliable Connection)

OpenAI unveiled MRC, a new protocol that slashes network failures in massive supercomputing clusters.

Deep Dive

OpenAI released MRC (Multipath Reliable Connection), a new supercomputer networking protocol, through the Open Compute Project to boost resilience and performance in large-scale AI training clusters.

Key Points
  • MRC (Multipath Reliable Connection) is an open networking standard released via OCP for AI supercomputing.
  • Multipath routing avoids single points of failure, reducing packet loss and downtime in distributed training.
  • Improves GPU cluster utilization by up to 15-20% under high-congestion workloads.

Why It Matters

Faster, more reliable training for frontier AI models, cutting costs and enabling larger-scale experiments.