Online blind reverberation time estimation using CRNNs

Deng S, Mack W, Habets E (2020)


Publication Type: Conference contribution

Publication year: 2020

Publisher: International Speech Communication Association

Book Volume: 2020-October

Pages Range: 5061-5065

Conference Proceedings Title: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Event location: Shanghai, CHN

DOI: 10.21437/Interspeech.2020-2156

Abstract

The reverberation time, T60, is an important acoustic parameter in speech and acoustic signal processing. Often, the T60 is unknown and blind estimation from a single-channel measurement is required. State-of-the-art T60 estimation is achieved by a convolutional neural network (CNN) which maps a feature representation of the speech to the T60. The temporal input length of the CNN is fixed. Time-varying scenarios, e.g., robot audition, require continuous T60 estimation in an online fashion, which is computationally heavy using the CNN. We propose to use a convolutional recurrent neural network (CRNN) for blind T60 estimation as it combines the parametric efficiency of CNNs with the online estimation of recurrent neural networks and, in contrast to CNNs, can process time-sequences of variable length. We evaluated the proposed CRNN on the Acoustic Characterization of Environments Challenge dataset for different input lengths. Our proposed method outperforms the state-of-the-art CNN approach even for shorter inputs at the cost of more trainable parameters.

Authors with CRIS profile

How to cite

APA:

Deng, S., Mack, W., & Habets, E. (2020). Online blind reverberation time estimation using CRNNs. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp. 5061-5065). Shanghai, CHN: International Speech Communication Association.

MLA:

Deng, Shuwen, Wolfgang Mack, and Emanuël Habets. "Online blind reverberation time estimation using CRNNs." Proceedings of the 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020, Shanghai, CHN International Speech Communication Association, 2020. 5061-5065.

BibTeX: Download