Haderlein T, Middag C, Hönig FT, Martens JP, Döllinger M, Schützenberger A, Nöth E (2015)
Publication Status: Published
Publication Type: Conference contribution
Publication year: 2015
Publisher: Springer-verlag
Book Volume: 9302
Pages Range: 165-173
DOI: 10.1007/978-3-319-24033-6_19
Language-independent and alignment-free phonological and phonemic features were applied for automatic age estimation based on voice and speech properties. 110 persons (average: 75.7 years) read the German version of the text "The North Wind and the Sun". For comparison with the automatic approach, five listeners estimated the speakers' age perceptually. Support Vector Regression and feature selection were used to compute the best model of aging. This model was found to use the following features: (a) the percentage of voiced frames, (b) eight phonological features, representing vowel height, nasality in consonants, turbulence, and position of the lips, and finally, (c) seven phonemic features. The latter features might be relevant due to altered articulation because of dentures. The mean absolute error between computed and chronological age was 5.2 years (RMSE: 7.0). It was 7.7 years (RMSE: 9.6) for an optimistic trivial estimator and 10.5 years (RMSE: 11.9) for the average listener.
APA:
Haderlein, T., Middag, C., Hönig, F.T., Martens, J.-P., Döllinger, M., Schützenberger, A., & Nöth, E. (2015). Language-Independent Age Estimation from Speech Using Phonological and Phonemic Features. (pp. 165-173). Springer-verlag.
MLA:
Haderlein, Tino, et al. "Language-Independent Age Estimation from Speech Using Phonological and Phonemic Features." Springer-verlag, 2015. 165-173.
BibTeX: Download