Open Source

Nvidia's Nemotron 3 Super is a bigger deal than you think

Nvidia's new model family provides 8B, 40B, and 405B parameter models with commercial license.

Deep Dive

Nvidia has unveiled the Nemotron 3 Super family, a significant move in the open-source AI landscape. This suite includes three models: Nemotron 3 8B, 40B, and the massive 405B, all trained on a dataset of 9 trillion tokens. Crucially, Nvidia is releasing these models under the Apache 2.0 license, granting developers full commercial rights to use, modify, and deploy them without paying royalties or facing restrictive terms. This positions Nemotron as a direct, cost-effective competitor to proprietary models from companies like OpenAI and Anthropic.

Beyond the licensing, the technical specifications are formidable. The 405B model is designed to rival the performance of leading closed-source models, offering capabilities in reasoning, coding, and instruction following. By providing a scalable family, Nvidia enables developers to choose the right model size for their specific application, from edge devices (8B) to large-scale cloud inference (405B). This release is part of Nvidia's broader strategy to fuel the AI ecosystem, ensuring a steady demand for its industry-dominating GPU hardware by empowering developers with the software tools they need.

Key Points
  • Three model sizes (8B, 40B, 405B parameters) trained on 9 trillion tokens.
  • Released under Apache 2.0 license for full commercial use without fees.
  • Designed to compete with top proprietary models on reasoning and coding tasks.

Why It Matters

Democratizes access to powerful AI, reducing costs and dependency on closed-source API providers for businesses.