Batliner A, Kompe R, KIESSLING A, Mast M, Niemann H, Nöth E (1998)
Publication Status: Published
Publication Type: Journal article, Original article
Publication year: 1998
Publisher: Elsevier
Book Volume: 25
Pages Range: 193-222
Journal Issue: 4
URI: http://www5.informatik.uni-erlangen.de/Forschung/Publikationen/1998/Batliner98-MSP.pdf
In automatic speech understanding, division of continuous running speech into syntactic chunks is a great problem. Syntactic boundaries are often marked by prosodic means. For the training of statistical models for prosodic boundaries large databases are necessary. For the German Verbmobil (VM) project (automatic speech-to-speech translation), we developed a syntactic-prosodic labelling scheme where different types of syntactic boundaries are labelled for a large spontaneous speech corpus. This labelling scheme is presented and compared with other labelling schemes for perceptual-prosodic, syntactic, and dialogue act boundaries. Interlabeller consistencies and estimation of effort needed are discussed. We compare the results of classifiers (multi-layer perceptrons (MLPs) and n-gram language models) trained on these syntactic-prosodic boundary labels with classifiers trained on perceptual-prosodic and pure syntactic labels. The main advantage of the rough syntactic-prosodic labels presented in this paper is that large amounts of data can be labelled with relatively little effort. The classifiers trained with these labels turned out to be superior with respect to purely prosodic or syntactic labelling schemes, yielding recognition rates of up to 96% for the two-class-problem 'boundary versus no boundary'. The use of boundary information leads to a marked improvement in the syntactic processing of the VM system. (C) 1998 Elsevier Science B.V. All rights reserved.
APA:
Batliner, A., Kompe, R., KIESSLING, A., Mast, M., Niemann, H., & Nöth, E. (1998). M = Syntax + Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases. Speech Communication, 25(4), 193-222.
MLA:
Batliner, Anton, et al. "M = Syntax + Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases." Speech Communication 25.4 (1998): 193-222.
BibTeX: Download