Distributed FastMNMF uses block-diagonal spatial covariance matrices to run matrix inversions within each subarray, cutting computational cost?

Distributed FastMNMF uses block-diagonal spatial covariance matrices to run matrix inversions within each subarray, cutting computational cost.

In fixed-geometry simulations, the method ran faster than full-array FastMNMF and outperformed single-subarray FastMNMF in source-to-distortion ratio?

In fixed-geometry simulations, the method ran faster than full-array FastMNMF and outperformed single-subarray FastMNMF in source-to-distortion ratio.

It successfully separated five sources using only four-microphone subarrays, a scenario where single-subarray approaches fail?

It successfully separated five sources using only four-microphone subarrays, a scenario where single-subarray approaches fail.

Audio & Speech

Distributed FastMNMF slashes blind source separation compute by using block-diagonal covariances

arXiv eess.AS May 20, 2026

⚡Researchers achieve faster multichannel audio separation without losing accuracy across distributed mic arrays.

Deep Dive

Blind source separation (BSS) with distributed microphone arrays — such as arrays in smart rooms or sensor networks — typically requires either heavy computation (processing all microphones jointly) or poor performance (processing only one subarray). A new paper from Hirotaka Nishikori and colleagues at the University of Tokyo introduces distributed FastMNMF, a method that strikes a practical balance. The key innovation is enforcing a block-diagonal structure on the source spatial covariance matrices, which lets matrix inversions happen independently within each subarray. Meanwhile, the NMF-based source spectrogram model is shared across the entire array, aggregating activity information from all microphones while discarding inter-subarray covariance details.

In synchronized, noiseless simulations with fixed room and source geometry, distributed FastMNMF required less computation time than conventional full-array FastMNMF and achieved higher average source-to-distortion ratios than single-subarray FastMNMF. Critically, it worked in a five-source scenario where each four-microphone subarray was locally underdetermined — something a single-subarray approach cannot handle. The trade-off is that it ignores phase relationships between subarrays, but the results suggest this cost is acceptable for many practical applications. The paper is available on arXiv (2605.19388) and has implications for smart assistants, teleconferencing, and acoustic monitoring over large areas.

Key Points

Distributed FastMNMF uses block-diagonal spatial covariance matrices to run matrix inversions within each subarray, cutting computational cost.
In fixed-geometry simulations, the method ran faster than full-array FastMNMF and outperformed single-subarray FastMNMF in source-to-distortion ratio.
It successfully separated five sources using only four-microphone subarrays, a scenario where single-subarray approaches fail.

Why It Matters

Scales blind source separation to large distributed microphone arrays without exploding compute costs.

Read Original Article

Distributed FastMNMF slashes blind source separation compute by using block-diagonal covariances

Why It Matters

Related Articles

🚀 Stay Ahead in AI