Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition

Yoshioka T, Sehr A, Delcroix M, Kinoshita K, Maas R, Nakatani T, Kellermann W (2012)

Publication Status: Published

Publication Type: Journal article, Original article

Publication year: 2012

Journal

IEEE Signal Processing Magazine Institute of Electrical and Electronic Engineers

Publisher: Institute of Electrical and Electronic Engineers

Book Volume: 29

Pages Range: 114-126

Article Number: 6296524

Journal Issue: 6

DOI: 10.1109/MSP.2012.2205029

Abstract

Speech recognition technology has left the research laboratory and is increasingly coming into practical use, enabling a wide spectrum of innovative and exciting voice-driven applications that are radically changing our way of accessing digital services and information. Most of today's applications still require a microphone located near the talker. However, almost all of these applications would benefit from distant-talking speech capturing, where talkers are able to speak at some distance from the microphones without the encumbrance of handheld or body-worn equipment [1]. For example, applications such as meeting speech recognition, automatic annotation of consumer-generated videos, speech-to-speech translation in teleconferencing, and hands-free interfaces for controlling consumer-products, like interactive TV, will greatly benefit from distant-talking operation. Furthermore, for a number of unexplored but important applications, distant microphones are a prerequisite. This means that distant talking speech recognition technology is essential for extending the availability of speech recognizers as well as enhancing the convenience of existing speech recognition applications. © 2012 IEEE.

Authors with CRIS profile

Roland Maas Lehrstuhl für Multimediakommunikation und Signalverarbeitung (LMS) Walter Kellermann Professur für Signalverarbeitung

Involved external institutions

Nippon Telegraph and Telephone (NTT) / 日本電信電話株式会社

Japan (JP) Telefonaktiebolaget L. M. Ericsson

Sweden (SE)

How to cite

APA:

Yoshioka, T., Sehr, A., Delcroix, M., Kinoshita, K., Maas, R., Nakatani, T., & Kellermann, W. (2012). Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition. IEEE Signal Processing Magazine, 29(6), 114-126. https://doi.org/10.1109/MSP.2012.2205029

MLA:

Yoshioka, Takuya, et al. "Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition." IEEE Signal Processing Magazine 29.6 (2012): 114-126.

BibTeX: Download