Haderlein T, Stemmer G, Nöth E, Haderlein T (2003)
Publication Status: Published
Publication Type: Conference contribution, Conference Contribution
Publication year: 2003
Publisher: Springer-Verlag
City/Town: Berlin
Book Volume: 2807
Pages Range: 173-180
Conference Proceedings Title: Proceedings on the 6th International Conference on Text, Speech, Dialogue - TSD 2003
Event location: Ceske Budejovice
URI: https://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=9444285489&origin=inward
One of the goals of the EMBASSI project is the creation of a speech interface between a user and a TV set or VCR. The interface should allow spontaneous speech recorded by microphones far away from the speaker. This paper describes experiments evaluating the robustness of a speech recognizer against reverberation. For this purpose a speech corpus was recorded with several different distortion types under real-life conditions. On these data the recognition results for reverberated signals using μ-law companded features were compared to an MFCC baseline system. Trained with clear speech, the word accuracy for the μ-law features on highly reverberated signals was 3 percent points better than the baseline result.
APA:
Haderlein, T., Stemmer, G., Nöth, E., & Haderlein, T. (2003). Speech recognition with μ-law companded features on reverberated signals. In Matouzsek V.; Mautner P. (Eds.), Proceedings on the 6th International Conference on Text, Speech, Dialogue - TSD 2003 (pp. 173-180). Ceske Budejovice, CZ: Berlin: Springer-Verlag.
MLA:
Haderlein, Tino, et al. "Speech recognition with μ-law companded features on reverberated signals." Proceedings of the 6th International Conference on Text, Speech, Dialogue - TSD 2003, Ceske Budejovice Ed. Matouzsek V.; Mautner P., Berlin: Springer-Verlag, 2003. 173-180.
BibTeX: Download