Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition

Journal article
(Original article)


Publication Details

Author(s): Yoshioka T, Sehr A, Delcroix M, Kinoshita K, Maas R, Nakatani T, Kellermann W
Journal: IEEE Signal Processing Magazine
Publisher: Institute of Electrical and Electronic Engineers
Publication year: 2012
Volume: 29
Journal issue: 6
Pages range: 114-126
ISSN: 1053-5888


Abstract


Speech recognition technology has left the research laboratory and is increasingly coming into practical use, enabling a wide spectrum of innovative and exciting voice-driven applications that are radically changing our way of accessing digital services and information. Most of today's applications still require a microphone located near the talker. However, almost all of these applications would benefit from distant-talking speech capturing, where talkers are able to speak at some distance from the microphones without the encumbrance of handheld or body-worn equipment [1]. For example, applications such as meeting speech recognition, automatic annotation of consumer-generated videos, speech-to-speech translation in teleconferencing, and hands-free interfaces for controlling consumer-products, like interactive TV, will greatly benefit from distant-talking operation. Furthermore, for a number of unexplored but important applications, distant microphones are a prerequisite. This means that distant talking speech recognition technology is essential for extending the availability of speech recognizers as well as enhancing the convenience of existing speech recognition applications. © 2012 IEEE.



FAU Authors / FAU Editors

Kellermann, Walter Prof. Dr.-Ing.
Professur für Nachrichtentechnik
Maas, Roland Dr.-Ing.
Lehrstuhl für Multimediakommunikation und Signalverarbeitung


External institutions
Nippon Telegraph and Telephone (NTT) / 日本電信電話株式会社
Telefonaktiebolaget L. M. Ericsson


How to cite

APA:
Yoshioka, T., Sehr, A., Delcroix, M., Kinoshita, K., Maas, R., Nakatani, T., & Kellermann, W. (2012). Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition. IEEE Signal Processing Magazine, 29(6), 114-126. https://dx.doi.org/10.1109/MSP.2012.2205029

MLA:
Yoshioka, Takuya, et al. "Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition." IEEE Signal Processing Magazine 29.6 (2012): 114-126.

BibTeX: 

Last updated on 2018-10-08 at 08:55