Wankerl S, Hönig FT, Batliner A, Orozco Arroyave JR, Nöth E (2015)
Publication Status: Published
Publication Type: Conference contribution, Conference Contribution
Publication year: 2015
Publisher: International Speech and Communication Association
Pages Range: 2613-2614
URI: http://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84959117949&origin=inward
We describe a generic tool for visualising differences between two groups of speakers who produce a given word sequence. We do this by first time-aligning all recordings and then aggregating time-varying information within each group. By that, we can display prototypical loudness and tempo contours, and also spectrograms, together with information on variability and group effect size over time. An optional user-supplied segmentation (just needed for one of the recordings) can be used to relate local differences to individual phonemes. The system is validated with a group of speakers with Parkinson's disease and an age-matched control group. It will be provided as an opensource software package to the community.
APA:
Wankerl, S., Hönig, F.T., Batliner, A., Orozco Arroyave, J.R., & Nöth, E. (2015). Visual comparison of speaker groups. In Proceedings of the 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 (pp. 2613-2614). International Speech and Communication Association.
MLA:
Wankerl, Sebastian, et al. "Visual comparison of speaker groups." Proceedings of the 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 International Speech and Communication Association, 2015. 2613-2614.
BibTeX: Download