Black AW, Bunnell HT, Dou Y, Muthukumar PK, Metze F, Perry D, Polzehl T, Prahallad KS, Steidl S, Vaughn C (2012)
Publication Language: English
Publication Type: Conference contribution, Conference Contribution
Publication year: 2012
Original Authors: Black Alan W., Bunnell H. Timothy, Dou Ying, Kumar Muthukumar Prasanna, Metze Florian, Perry Daniel, Polzehl Tim, Prahallad Kishore, Steidl Stefan, Vaughn Callie
Pages Range: 4005-4008
Conference Proceedings Title: Proc. ICASSP 2012
URI: http://www5.informatik.uni-erlangen.de/Forschung/Publikationen/2012/Black12-AFF.pdf
This paper describes some of the results from the project entitled “New Parameterization for Emotional Speech Synthesis” held at the Summer 2011 JHU CLSP workshop. We describe experiments on how to use articulatory features as a meaningful intermediate representation for speech synthesis. This parameterization not only allows us to reproduce natural sounding speech but also allows us to generate stylistically varying speech. We show methods for deriving articulatory features from speech, predicting articulatory features from text and reconstructing natural sounding speech from the predicted articulatory features. The methods were tested on clean speech databases in English and German, as well as databases of emotionally and personality varying speech. The resulting speech was evaluated both objectively, using techniques normally used for emotion identification, and subjectively, using crowd-sourcing.
APA:
Black, A.W., Bunnell, H.T., Dou, Y., Muthukumar, P.K., Metze, F., Perry, D.,... Vaughn, C. (2012). Articulatory Features for Expressive Speech Synthesis. In IEEE (Eds.), Proc. ICASSP 2012 (pp. 4005-4008). Kyoto, JP.
MLA:
Black, Alan W., et al. "Articulatory Features for Expressive Speech Synthesis." Proceedings of the ICASSP 2012, Kyoto Ed. IEEE, 2012. 4005-4008.
BibTeX: Download