Visual comparison of speaker groups

Wankerl S, Hönig FT, Batliner A, Orozco Arroyave JR, Nöth E (2015)

Publication Status: Published

Publication Type: Conference contribution, Conference Contribution

Publication year: 2015

Publisher: International Speech and Communication Association

Pages Range: 2613-2614

URI: http://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84959117949&origin=inward

Abstract

We describe a generic tool for visualising differences between two groups of speakers who produce a given word sequence. We do this by first time-aligning all recordings and then aggregating time-varying information within each group. By that, we can display prototypical loudness and tempo contours, and also spectrograms, together with information on variability and group effect size over time. An optional user-supplied segmentation (just needed for one of the recordings) can be used to relate local differences to individual phonemes. The system is validated with a group of speakers with Parkinson's disease and an age-matched control group. It will be provided as an opensource software package to the community.

Authors with CRIS profile

Sebastian Wankerl Lehrstuhl für Informatik 8 (Theoretische Informatik) Florian Thomas Hönig Lehrstuhl für Informatik 14 (Bild- und Sprachverarbeitung) Anton Batliner Lehrstuhl für Informatik 14 (Bild- und Sprachverarbeitung) Juan Rafael Orozco Arroyave Lehrstuhl für Informatik 14 (Bild- und Sprachverarbeitung) Elmar Nöth Professur für Informatik (Mustererkennung)

How to cite

APA:

Wankerl, S., Hönig, F.T., Batliner, A., Orozco Arroyave, J.R., & Nöth, E. (2015). Visual comparison of speaker groups. In Proceedings of the 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 (pp. 2613-2614). International Speech and Communication Association.

MLA:

Wankerl, Sebastian, et al. "Visual comparison of speaker groups." Proceedings of the 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 International Speech and Communication Association, 2015. 2613-2614.

BibTeX: Download