M = Syntax + Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases

Batliner A, Kompe R, KIESSLING A, Mast M, Niemann H, Nöth E (1998)


Publication Status: Published

Publication Type: Journal article, Original article

Publication year: 1998

Journal

Publisher: Elsevier

Book Volume: 25

Pages Range: 193-222

Journal Issue: 4

URI: http://www5.informatik.uni-erlangen.de/Forschung/Publikationen/1998/Batliner98-MSP.pdf

Abstract

In automatic speech understanding, division of continuous running speech into syntactic chunks is a great problem. Syntactic boundaries are often marked by prosodic means. For the training of statistical models for prosodic boundaries large databases are necessary. For the German Verbmobil (VM) project (automatic speech-to-speech translation), we developed a syntactic-prosodic labelling scheme where different types of syntactic boundaries are labelled for a large spontaneous speech corpus. This labelling scheme is presented and compared with other labelling schemes for perceptual-prosodic, syntactic, and dialogue act boundaries. Interlabeller consistencies and estimation of effort needed are discussed. We compare the results of classifiers (multi-layer perceptrons (MLPs) and n-gram language models) trained on these syntactic-prosodic boundary labels with classifiers trained on perceptual-prosodic and pure syntactic labels. The main advantage of the rough syntactic-prosodic labels presented in this paper is that large amounts of data can be labelled with relatively little effort. The classifiers trained with these labels turned out to be superior with respect to purely prosodic or syntactic labelling schemes, yielding recognition rates of up to 96% for the two-class-problem 'boundary versus no boundary'. The use of boundary information leads to a marked improvement in the syntactic processing of the VM system. (C) 1998 Elsevier Science B.V. All rights reserved.

Authors with CRIS profile

How to cite

APA:

Batliner, A., Kompe, R., KIESSLING, A., Mast, M., Niemann, H., & Nöth, E. (1998). M = Syntax + Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases. Speech Communication, 25(4), 193-222.

MLA:

Batliner, Anton, et al. "M = Syntax + Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases." Speech Communication 25.4 (1998): 193-222.

BibTeX: Download