Application of automatic speech recognition to quantitative assessment of tracheoesophageal speech with different signal quality

Haderlein T, Riedhammer KT, Nöth E, Toy H, Schuster M, Eysholdt U, Hornegger J, Rosanowski F (2009)


Publication Type: Journal article, Original article

Publication year: 2009

Journal

Original Authors: Haderlein T., Riedhammer K., Nöth E., Toy H., Schuster M., Eysholdt U., Hornegger J., Rosanowski F.

Publisher: Karger

Book Volume: 61

Pages Range: 12-17

Journal Issue: 1

DOI: 10.1159/000187620

Abstract

Tracheoesophageal voice is state-of-the-art in voice rehabilitation after laryngectomy. Intelligibility on a telephone is an important evaluation criterion as it is a crucial part of social life. An objective measure of intelligibility when talking on a telephone is desirable in the field of postlaryngectomy speech therapy and its evaluation. Patients and Methods: Based upon successful earlier studies with broadband speech, an automatic speech recognition (ASR) system was applied to 41 recordings of postlaryngectomy patients. Recordings were available in different signal qualities; quality was the crucial criterion for this study. Results: Compared to the intelligibility rating of 5 human experts, the ASR system had a correlation coefficient of r = -0.87 and Krippendorff's α of 0.65 when broadband speech was processed. The rater group alone achieved α = 0.66. With the test recordings in telephone quality, the system reached r = -0.79 and α = 0.67. Conclusion: For medical purposes, a comprehensive diagnostic approach to (substitute) voice has to cover both subjective and objective tests. An automatic recognition system such as the one proposed in this study can be used for objective intelligibility rating with results comparable to those of human experts. This holds for broadband speech as well as for automatic evaluation via telephone. Copyright © 2008 S. Karger AG, Basel.

Authors with CRIS profile

How to cite

APA:

Haderlein, T., Riedhammer, K.T., Nöth, E., Toy, H., Schuster, M., Eysholdt, U.,... Rosanowski, F. (2009). Application of automatic speech recognition to quantitative assessment of tracheoesophageal speech with different signal quality. Folia Phoniatrica Et Logopaedica, 61(1), 12-17. https://doi.org/10.1159/000187620

MLA:

Haderlein, Tino, et al. "Application of automatic speech recognition to quantitative assessment of tracheoesophageal speech with different signal quality." Folia Phoniatrica Et Logopaedica 61.1 (2009): 12-17.

BibTeX: Download