A new uncertainty decoding scheme for DNN-HMM hybrid systems with multichannel speech enhancement

Conference contribution
(Conference Contribution)


Publication Details

Author(s): Hümmer C, Schwarz A, Maas R, Barfuß H, Astudillo RF, Kellermann W
Publisher: Institute of Electrical and Electronics Engineers Inc.
Publication year: 2016
Pages range: 5760-5764
ISBN: 9781479999880


Abstract


Uncertainty decoding combines a probabilistic feature description with the acoustic model of a speech recognition system. For DNN-HMM hybrid systems, this can be realized by averaging the DNN outputs produced by a finite set of feature samples (drawn from an estimated probability distribution). In this article, we employ this sampling approach in combination with a multi-microphone speech enhancement system. We propose a new strategy for generating feature samples from multichannel signals, based on modeling the spatial coherence estimates between different microphone pairs as realizations of a latent random variable. From each coherence estimate, a spectral enhancement gain is computed and an enhanced feature vector is obtained, thus producing a finite set of feature samples, of which we average the respective DNN outputs. In the experimental part, this new uncertainty decoding strategy is shown to consistently improve the recognition accuracy of a DNN-HMM hybrid system for the 8-channel REVERB Challenge task.



FAU Authors / FAU Editors

Barfuß, Hendrik
Professur für Nachrichtentechnik
Hümmer, Christian
Professur für Nachrichtentechnik
Kellermann, Walter Prof. Dr.-Ing.
Professur für Nachrichtentechnik
Maas, Roland Dr.-Ing.
Lehrstuhl für Multimediakommunikation und Signalverarbeitung
Schwarz, Andreas
Professur für Nachrichtentechnik


External institutions
Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa (INESC-ID)


How to cite

APA:
Hümmer, C., Schwarz, A., Maas, R., Barfuß, H., Astudillo, R.F., & Kellermann, W. (2016). A new uncertainty decoding scheme for DNN-HMM hybrid systems with multichannel speech enhancement. (pp. 5760-5764). Shanghai, CN: Institute of Electrical and Electronics Engineers Inc..

MLA:
Hümmer, Christian, et al. "A new uncertainty decoding scheme for DNN-HMM hybrid systems with multichannel speech enhancement." Proceedings of the 41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016, Shanghai Institute of Electrical and Electronics Engineers Inc., 2016. 5760-5764.

BibTeX: 

Last updated on 2018-07-08 at 22:27