Stability AI's Stable Audio 3.0 generates 6-minute songs with open weights
New model quadruples length, offers open-source small/medium versions
Stability AI, known for Stable Diffusion, has unveiled Stable Audio 3.0, a new family of audio models that significantly expand music generation capabilities. The lineup includes four models: small SFX (459M parameters), small (459M), medium (1.4B), and large (2.7B). The small models can generate up to two minutes of sound effects or music, making them suitable for on-device use. In contrast, the medium and large models can produce full compositions of 6 minutes 20 seconds while maintaining musical structure and melodic tone—more than double the length of Stable Audio 2.0 from 2024. This is a massive improvement over the previous open-source Stable Audio Open, which only offered 47-second clips.
Stability AI is releasing the small SFX, small, and medium models with open weights, allowing anyone to use and modify them freely. The large model, however, is only accessible via API or self-hosted paid services, and companies with over $1 million in revenue require an enterprise license. To address copyright concerns, the company has built these models on fully licensed data, including partnerships with Warner Music Group and Universal Music Group. Additionally, Stability AI hired Ethan Kaplan, former chief digital officer at Universal Audio and Fender, to lead its professional music product suite, signaling a move toward serving professional musicians. This launch positions Stability AI against competitors like Google and ElevenLabs, while navigating the legal challenges faced by Suno and Udio.
- Four models: small SFX (459M), small (459M), medium (1.4B), large (2.7B) – small models generate up to 2 min, medium/large up to 6 min 20 sec.
- Small, small SFX, and medium are open-weight; large only via API or self-hosting with revenue-based licensing.
- Built on licensed data from Warner Music Group and Universal Music Group; hired ex-Universal/Fender exec Ethan Kaplan for professional products.
Why It Matters
Open-weight long-form music generation gives creators and developers a powerful, licensed alternative to closed AI music tools.