Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition

Conference contribution
(Conference Contribution)


Publication Details

Author(s): Sehr A, Maas R, Kellermann W
Publication year: 2010
Pages range: 4298-4301
ISBN: 978-1-4244-4295-9
ISSN: 1520-6149
Language: English


Abstract


The REMOS (REverberation MOdeling for Speech recognition) concept for reverberation-robust distant-talking speech recognition, introduced in [1] for melspectral features, is extended in this contribution to logarithmic melspectral (logmelspec) features. Based on a combined acoustic model consisting of a hidden Markov model network and a reverberation model, REMOS determines clean-speech and reverberation estimates during recognition by an inner optimization operation. A reformulation of this inner optimization problem for logmelspec features, allowing an efficient solution by nonlinear optimization algorithms, is derived in this paper so that an efficient implementation of REMOS for logmelspec features becomes possible. Connected digit recognition experiments show that the proposed REMOS implementation significantly outperforms reverberantly-trained HMMs in highly reverberant environments. ©2010 IEEE.


FAU Authors / FAU Editors

Kellermann, Walter Prof. Dr.-Ing.
Professur für Nachrichtentechnik
Maas, Roland
Lehrstuhl für Multimediakommunikation und Signalverarbeitung
Sehr, Armin Dr.-Ing.
Professur für Nachrichtentechnik


How to cite

APA:
Sehr, A., Maas, R., & Kellermann, W. (2010). Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition. In Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 (pp. 4298-4301). Dallas, TX, US.

MLA:
Sehr, Armin, Roland Maas, and Walter Kellermann. "Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition." Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010, Dallas, TX 2010. 4298-4301.

BibTeX: 

Last updated on 2019-08-05 at 15:23