Strategies for modeling reverberant speech in the feature domain

Conference contribution
(Conference Contribution)


Publication Details

Author(s): Sehr A, Kellermann W
Publication year: 2009
Pages range: 3725-3728
ISBN: 9781424423545


Abstract


The length of the room impulse response characterizing the acoustic path between speaker and microphone is significantly larger than the length of the analysis window used for feature extraction in automatic speech recognition (ASR) systems. Therefore, reverberation caused by multi-path propagation of sound waves from the speaker to distant-talking microphones has a dispersive effect on speech feature sequences. This dispersive effect causes a mismatch between the input speech and the acoustic models of the recognizer, usually trained on clean speech, and leads to a significant reduction of recognition performance. In this contribution, different strategies for obtaining acoustic models capturing the dispersive effect of reverberation are investigated in terms of modeling accuracy, flexibility with respect to changing reverberation conditions, effort for obtaining the reverberation representation and decoding complexity. ©2009 IEEE.



FAU Authors / FAU Editors

Kellermann, Walter Prof. Dr.-Ing.
Professur für Nachrichtentechnik
Sehr, Armin Dr.-Ing.
Professur für Nachrichtentechnik


How to cite

APA:
Sehr, A., & Kellermann, W. (2009). Strategies for modeling reverberant speech in the feature domain. (pp. 3725-3728). Taipei, CN.

MLA:
Sehr, Armin, and Walter Kellermann. "Strategies for modeling reverberant speech in the feature domain." Proceedings of the 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009, Taipei 2009. 3725-3728.

BibTeX: 

Last updated on 2018-17-12 at 20:50