50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders

Pérez-Toro PA, Klumpp P, Vasquez-Correa JC, Schuster M, Nöth E, Orozco-Arroyave JR, Arias Vergara T (2022)


Publication Type: Conference contribution

Publication year: 2022

Journal

Publisher: Springer Science and Business Media Deutschland GmbH

Book Volume: 13502 LNAI

Pages Range: 352-363

Conference Proceedings Title: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Event location: Brno, CZE

ISBN: 9783031162695

DOI: 10.1007/978-3-031-16270-1_29

Abstract

Spectrograms provide a visual representation of the time-frequency variations of a speech signal. Furthermore, the color scales can be used as a pre-processing normalization step. In this study, we investigated the suitability of using different color scales for the reconstruction of spectrograms together with bottleneck features extracted from Convolutional AutoEncoders (CAEs). We trained several CAEs considering different parameters such as the number of channels, wideband/narrowband spectrograms, and different color scales. Additionally, we tested the suitability of the proposed CAE architecture for the prediction of the severity of Parkinson’s Disease (PD) and for the nasality level in children with Cleft Lip and Palate (CLP). The results showed that it is possible to estimate the neurological state for PD with Spearman’s correlations of up to 0.71 using the Grayscale, and the nasality level in CLP with F-scores of up to 0.58 using the raw spectrogram. Although the color scales improved performance in some cases, it is not clear which color scale is the most suitable for the selected application, as we did not find significant differences in the results for each color scale.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Pérez-Toro, P.A., Klumpp, P., Vasquez-Correa, J.C., Schuster, M., Nöth, E., Orozco-Arroyave, J.R., & Arias Vergara, T. (2022). 50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders. In Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 352-363). Brno, CZE: Springer Science and Business Media Deutschland GmbH.

MLA:

Pérez-Toro, Paula Andrea, et al. "50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders." Proceedings of the 25th International Conference on Text, Speech, and Dialogue, TSD 2022, Brno, CZE Ed. Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala, Springer Science and Business Media Deutschland GmbH, 2022. 352-363.

BibTeX: Download