New single-microphone localization technique beats multi-mic methods
One microphone, no array needed: new AI leverages room echoes to pinpoint sound sources...
A new technique from Matthew Maciejewski, published at IEEE ICASSP 2026, solves audio source localization using just one microphone. Traditional methods rely on multiple microphones to triangulate a source's position using time-of-arrival differences. This work flips the problem: it uses the reverberation tail—the late part of sound reflections that depends only on the room, not on the exact source position. That invariance provides a stable reference to compare signals.
The method applies Weighted Prediction Error (WPE) dereverberation to estimate the late tail, then uses a probabilistic framework to compute the likelihood that two audio clips came from the same location. Testing on speaker diarization tasks shows accuracy rivaling multi-microphone systems. This means future smart speakers, meeting transcribers, or hearing aids could locate speakers without a complex array, using only a single mic and clever signal processing.
- Only one microphone needed for localization, reducing hardware cost and complexity.
- Leverages WPE dereverberation to estimate room-invariant late-tail reverberation.
- Outperforms or matches multi-microphone baselines on speaker diarization in both synthetic and real rooms.
Why It Matters
Enables precise audio source localization with minimal hardware, lowering barriers for smart assistants and conferencing systems.