The NITE XML Toolkit: data model and query language

Journal article
(Report)


Publication Details

Author(s): Carletta J, Evert S, Heid U, Kilgour J, Chen Y
Journal: Language Resources and Evaluation
Publication year: 2005
Volume: 39
Journal issue: 4
Pages range: 313-334
ISSN: 1574-020X
Language: English


Abstract


The NITE XML Toolkit (NXT) is open source software for working with language corpora, with particular strengths for multimodal and heavily cross-annotated data sets. In NXT, annotations are described by types and attribute value pairs, and can relate to signal via start and end times, to representations of the external environment, and to each other via either an arbitrary graph structure or a multi-rooted tree structure characterized by both temporal and structural orderings. Simple queries in NXT express variable bindings for n-tuples of objects, optionally constrained by type, and give a set of conditions on the n-tuples combined with boolean operators. The defined operators for the condition tests allow full access to the timing and structural properties of the data model. A complex query facility passes variable bindings from one query to another for filtering, returning a tree structure. In addition to describing NXT’s core data handling and search capabilities, we explain the stand-off XML data storage format that it employs and illustrate its use with examples from an early adopter of the technology.



FAU Authors / FAU Editors

Evert, Stefan Prof. Dr.
Lehrstuhl für Korpus- und Computerlinguistik


External institutions with authors

University of Edinburgh


How to cite

APA:
Carletta, J., Evert, S., Heid, U., Kilgour, J., & Chen, Y. (2005). The NITE XML Toolkit: data model and query language. Language Resources and Evaluation, 39(4), 313-334.

MLA:
Carletta, Jean, et al. "The NITE XML Toolkit: data model and query language." Language Resources and Evaluation 39.4 (2005): 313-334.

BibTeX: 

Last updated on 2018-11-08 at 00:10