Automatic Detection of Parkinson's Disease from Compressed Speech Recordings

Conference contribution

Publication Details

Author(s): Orozco Arroyave J, Garcia N, Vargas-Bonilla JF, Nöth E
Publisher: Springer-verlag
Publication year: 2015
Volume: 9302
Pages range: 88-95
ISBN: 9783319240329
ISSN: 0302-9743


The impact of speech compression in the automatic classification of speakers with Parkinson's disease (PD) and healthy controls (HC) is tested. The set of codecs considered to compress the speech recordings includes G. 722, G. 226, GSM-EFR, AMR-WB, SILK, and Opus. A total of 100 speakers (50 with PD and 50 HC) are asked to read a text with 36 words. The recordings are compressed from bit-rates of 705.6 kbps down to 6.6 kbps. The method addressed to discriminate between speakers with PD and HC consists on the systematic segmentat ion of voiced and unvoiced speech frames. Each kind of frame is characterized independently. For voiced segments noise, perturbation, and cepstral features are considered. The unvoiced segments are characterized with Bark band energies and cepstral features. According to the results the codecs evaluated in this paper do not affect significantly the accuracy of the system, indicating that the addressed methodology could be used for the telemonitoring of PD patients through Internet or through the mobile communications network.

FAU Authors / FAU Editors

Nöth, Elmar Prof. Dr.-Ing.
Professur für Informatik (Mustererkennung)
Orozco Arroyave, Juan
Professur für Informatik (Mustererkennung)

How to cite

Orozco Arroyave, J., Garcia, N., Vargas-Bonilla, J.F., & Nöth, E. (2015). Automatic Detection of Parkinson's Disease from Compressed Speech Recordings. (pp. 88-95). Springer-verlag.

Orozco Arroyave, Juan, et al. "Automatic Detection of Parkinson's Disease from Compressed Speech Recordings." Proceedings of the 18th International Conference on Text, Speech and Dialogue, TSD 2015 Springer-verlag, 2015. 88-95.


Last updated on 2018-19-04 at 03:19