Matching the acoustic model to front-end signal processing for ASR in noisy and reverberant environments

Conference contribution


Publication Details

Author(s): Maas R, Schwarz A, Reindl K, Zheng Y, Meier S, Sehr A, Kellermann W
Publication year: 2012
Pages range: 637-638
ISBN: 978-3-939296-04-1
Language: English


Abstract

Distant-talking automatic speech recognition (ASR) represents an extremely challenging task. The major reason is that unwanted additive interference and reverberation are picked up by the microphones besides the desired signal. A hands-free human-machine interface should therefore comprise a powerful acoustic preprocessing unit in line with a robust ASR back-end. However, since perfect speech enhancement cannot be achieved in practice, the output of the front-end will always contain some residual interference and some distortion of the desired signal. It is hence of decisive importance to carefully adjust the hidden Markov models (HMMs) of the ASR system to the front-end. In this contribution, we present a two-channel acoustic front-end based on blind source separation along with Wiener filtering. For the front-end integration into the ASR system, different  types of multi-style as well as adaptive training and HMM adaptation are investigated.


FAU Authors / FAU Editors

Kellermann, Walter Prof. Dr.-Ing.
Professur für Nachrichtentechnik
Maas, Roland
Lehrstuhl für Multimediakommunikation und Signalverarbeitung
Meier, Stefan
Professur für Nachrichtentechnik
Reindl, Klaus
Professur für Nachrichtentechnik
Schwarz, Andreas
Professur für Nachrichtentechnik
Sehr, Armin Dr.-Ing.
Professur für Nachrichtentechnik
Zheng, Yuanhang
Lehrstuhl für Multimediakommunikation und Signalverarbeitung


How to cite

APA:
Maas, R., Schwarz, A., Reindl, K., Zheng, Y., Meier, S., Sehr, A., & Kellermann, W. (2012). Matching the acoustic model to front-end signal processing for ASR in noisy and reverberant environments. In Proceedings of the Deutsche Jahrestagung für Akustik (DAGA) (pp. 637-638). Darmstadt, DE.

MLA:
Maas, Roland, et al. "Matching the acoustic model to front-end signal processing for ASR in noisy and reverberant environments." Proceedings of the Deutsche Jahrestagung für Akustik (DAGA), Darmstadt 2012. 637-638.

BibTeX: 

Last updated on 2019-03-06 at 07:14