Learning-based acoustic source-microphone distance estimation using the coherent-to-diffuse power ratio

Brendel A, Kellermann W (2018)


Publication Language: English

Publication Type: Conference contribution

Publication year: 2018

Pages Range: 61-65

Event location: Calgary CA

ISBN: 978-1-5386-4658-8

DOI: 10.1109/icassp.2018.8462474

Abstract

We propose a method for estimating the distance between a sound source and a pair of recording microphones. The developed algorithm operates in the short-time Fourier transform domain and is based on estimates of the coherent-to-diffuse power ratio, which provides a measure for the amount of reverberation in each time-frequency bin. For a direct use of these estimates, precise knowledge on the room characteristics is necessary, which is in practice usually not available and hard to obtain. Therefore, we use a learning-based method, which adapts to the characteristics of the room in a training phase and estimates the source-microphone distance in a testing phase. The experiments comprise various setups with simulated and real data. It is shown that the proposed method generalizes well for different microphone positions and works robustly for different source signals, directions of arrival, reverberation times, and signal observation intervals. This leads to a high estimation accuracy at a low computational complexity with a small amount of training data.

Authors with CRIS profile

How to cite

APA:

Brendel, A., & Kellermann, W. (2018). Learning-based acoustic source-microphone distance estimation using the coherent-to-diffuse power ratio. In Proceedings of the IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) (pp. 61-65). Calgary, CA.

MLA:

Brendel, Andreas, and Walter Kellermann. "Learning-based acoustic source-microphone distance estimation using the coherent-to-diffuse power ratio." Proceedings of the IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), Calgary 2018. 61-65.

BibTeX: Download