Audio & Speech

TC-BiMamba: Trans-Chunk bidirectionally within BiMamba for unified streaming and non-streaming ASR

arXiv eess.AS February 13, 2026

⚡This breakthrough unifies real-time and offline speech recognition in a single model.

Deep Dive

Researchers introduced TC-BiMamba, a new architecture for unified streaming and non-streaming automatic speech recognition (ASR). It uses a novel Trans-Chunk mechanism for dynamic chunk size training, enabling a single model to handle both offline decoding and low-latency streaming. The method achieves a 1.3x training speedup, reduces training memory by 50%, and matches or outperforms prior models like U2++ and LC-BiMamba while using a smaller model size.

Why It Matters

It dramatically cuts the cost and complexity of developing high-performance, versatile speech recognition systems for apps and devices.

Read Original Article

TC-BiMamba: Trans-Chunk bidirectionally within BiMamba for unified streaming and non-streaming ASR

Why It Matters

Stay Ahead in AI