LMELectures: a Multimedia Corpus of Acedemic Spoken English

Riedhammer KT, Gropp M, Bocklet T, Hönig FT, Nöth E, Steidl S (2013)


Publication Type: Conference contribution, Conference Contribution

Publication year: 2013

Original Authors: Riedhammer Korbinian, Gropp Martin, Bocklet Tobias, Hönig Florian, Nöth Elmar, Steidl Stefan

Publisher: CEUR-WS

Pages Range: 102-107

Conference Proceedings Title: Proceedings of the First Workshop on Speech, Language and Audio in Multimedia

Event location: Marseille FR

URI: http://www5.informatik.uni-erlangen.de/Forschung/Publikationen/2013/Riedhammer13-LAM.pdf

Abstract

This paper describes the acquisition, transcription and annotation of a multi-media corpus of academic spoken English, the LMELectures. It consists of two lecture series that were read in the summer term 2009 at the computer science department of the University of Erlangen- Nuremberg, covering topics in pattern analysis, machine learning and interventional medical image processing. In total, about 40 hours of high-definition audio and video of a single speaker was acquired in a constant recording environment. In addition to the recordings, the presentation slides are available in machine readable (PDF) format. The manual annotations include a suggested segmentation into speech turns and a complete manual transcription that was done using BLITZSCRIBE2, a new tool for the rapid transcription. For one lecture series, the lecturer assigned key words to each recordings; one recording of that series was further annotated with a list of ranked key phrases by five human annotators each. The corpus is available for non-commercial purpose upon request.

Authors with CRIS profile

How to cite

APA:

Riedhammer, K.T., Gropp, M., Bocklet, T., Hönig, F.T., Nöth, E., & Steidl, S. (2013). LMELectures: a Multimedia Corpus of Acedemic Spoken English. In ISCA SIG on Speech and Language in Multimedia IEEE SIG on Audio and Speech Processing in Multimedia (Eds.), Proceedings of the First Workshop on Speech, Language and Audio in Multimedia (pp. 102-107). Marseille, FR: CEUR-WS.

MLA:

Riedhammer, Korbinian Thomas, et al. "LMELectures: a Multimedia Corpus of Acedemic Spoken English." Proceedings of the SLAM 2013 - First Workshop on Speech, Language and Audio in Multimedia, Marseille Ed. ISCA SIG on Speech and Language in Multimedia IEEE SIG on Audio and Speech Processing in Multimedia, CEUR-WS, 2013. 102-107.

BibTeX: Download