A Simple LNRE Model for Random Character Sequences

Evert S (2004)


Publication Language: English

Publication Type: Conference contribution, Conference Contribution

Publication year: 2004

City/Town: Louvain-la-Neuve, Belgium

Pages Range: 411-422

Conference Proceedings Title: Proceedings of the 7èmes Journées Internationales d'Analyse Statistique des Données Textuelles (JADT 2004)

URI: http://purl.org/stefan.evert/PUB/Evert2004a.pdf

Abstract

This paper describes a population model for word frequency distributions based on the Zipf-Mandelbrot law, corresponding to the word frequency distribution induced by a random character sequence. The model, which has convenient analytical and numerical properties, is shown to be adequate for the description of language data extracted by automatic means from large text corpora. It can thus be used to study the problems faced by the statistical analysis of such data in the field of natural language processing.

Authors with CRIS profile

How to cite

APA:

Evert, S. (2004). A Simple LNRE Model for Random Character Sequences. In Proceedings of the 7èmes Journées Internationales d'Analyse Statistique des Données Textuelles (JADT 2004) (pp. 411-422). Louvain-la-Neuve, Belgium.

MLA:

Evert, Stephanie. "A Simple LNRE Model for Random Character Sequences." Proceedings of the Proceedings of the 7èmes Journées Internationales d'Analyse Statistique des Données Textuelles (JADT 2004) Louvain-la-Neuve, Belgium, 2004. 411-422.

BibTeX: Download