Media & Culture

Enterprise GPU fleets see 5% utilization as AI infrastructure costs soar to 41%

Billions spent on GPUs, yet average utilization is just 5% — and costs keep rising.

Deep Dive

After ChatGPT launched, companies rushed to buy massive GPU fleets as AI demand exploded and compute was scarce. But now, the article suggests efficiency depends on more than just utilization—factors like scheduling, inference efficiency, routing, governance, energy access, and operational management matter too. The irony: technology meant to improve human lives faces a huge infrastructure inefficiency, with most of the budget spent on figuring out how to allocate hardware.

Key Points
  • Enterprise GPU fleets run at an average of 5% utilization, despite massive capital expenditure.
  • AI inference and hardware ownership costs increased to 41% of total spend, up from 34%.
  • Key inefficiencies include scheduling, routing, governance, and energy management, not just hardware scarcity.

Why It Matters

For tech leaders, this signals a critical need to optimize AI infrastructure before scaling further, or risk wasted investment.