Research & Papers

SoproTTS v1.5: $100 voice cloning model runs 20x real-time on a CPU

r/MachineLearning February 13, 2026

⚡This $100 side project could make professional voice cloning accessible to everyone.

Deep Dive

SoproTTS v1.5 is a new 135M parameter text-to-speech model capable of zero-shot voice cloning. It was trained for only about $100 on a single GPU. The model runs at an impressive 0.05 Real-Time Factor, meaning it's about 20 times faster than real-time on a base MacBook M3 CPU, with a 250ms latency for streaming. The training code is promised to be released soon, opening the door for wider experimentation.

Why It Matters

It dramatically lowers the cost and hardware barrier for creating high-quality, fast synthetic voices.

Read Original Article

SoproTTS v1.5: $100 voice cloning model runs 20x real-time on a CPU

Why It Matters

Related Articles

🚀 Stay Ahead in AI