Orozco Arroyave JR, Garcia N, Vargas-Bonilla JF, Nöth E (2015)
Publication Status: Published
Publication Type: Conference contribution
Publication year: 2015
Publisher: Springer-verlag
Book Volume: 9302
Pages Range: 88-95
ISBN: 9783319240329
DOI: 10.1007/978-3-319-24033-6_10
The impact of speech compression in the automatic classification of speakers with Parkinson's disease (PD) and healthy controls (HC) is tested. The set of codecs considered to compress the speech recordings includes G. 722, G. 226, GSM-EFR, AMR-WB, SILK, and Opus. A total of 100 speakers (50 with PD and 50 HC) are asked to read a text with 36 words. The recordings are compressed from bit-rates of 705.6 kbps down to 6.6 kbps. The method addressed to discriminate between speakers with PD and HC consists on the systematic segmentat ion of voiced and unvoiced speech frames. Each kind of frame is characterized independently. For voiced segments noise, perturbation, and cepstral features are considered. The unvoiced segments are characterized with Bark band energies and cepstral features. According to the results the codecs evaluated in this paper do not affect significantly the accuracy of the system, indicating that the addressed methodology could be used for the telemonitoring of PD patients through Internet or through the mobile communications network.
APA:
Orozco Arroyave, J.R., Garcia, N., Vargas-Bonilla, J.F., & Nöth, E. (2015). Automatic Detection of Parkinson's Disease from Compressed Speech Recordings. In Proceedings of the 18th International Conference on Text, Speech and Dialogue, TSD 2015 (pp. 88-95). Springer-verlag.
MLA:
Orozco Arroyave, Juan Rafael, et al. "Automatic Detection of Parkinson's Disease from Compressed Speech Recordings." Proceedings of the 18th International Conference on Text, Speech and Dialogue, TSD 2015 Springer-verlag, 2015. 88-95.
BibTeX: Download