Exploring text-initial words, clusters and concgrams in a newspaper corpus

O'Donnell MB, Scott M, Mahlberg M, Hoey M (2012)

Publication Type: Journal article

Publication year: 2012

Journal

Corpus linguistics and linguistic theory Walter de Gruyter

Book Volume: 8

Pages Range: 73-101

Journal Issue: 1

DOI: 10.1515/cllt-2012-0004

Abstract

The notion of textual colligation predicts that certain lexical items have a tendency to occur at particular points in a text, i.e. the beginning or end of texts, paragraphs or sentences. This paper describes new corpus-based methods developed to identify the profile of words, clusters (n-grams) and concgrams (non-contiguous patterns in variant order) in terms of their most common textual locations. Groups of co-occurring text-initial items are then analyzed in terms of their discourse function in relation to theories of newspaper structure. This analysis illustrates how methods from corpus linguistics, when targeted to specific textual positions, can complement text-linguistic analyses. © 2012 Walter de Gruyter.

Authors with CRIS profile

Michaela Mahlberg

Involved external institutions

University of Michigan

United States (USA) (US) Aston University

United Kingdom (GB) University of Nottingham

United Kingdom (GB) The University of Liverpool

United Kingdom (GB)

How to cite

APA:

O'Donnell, M.B., Scott, M., Mahlberg, M., & Hoey, M. (2012). Exploring text-initial words, clusters and concgrams in a newspaper corpus. Corpus linguistics and linguistic theory, 8(1), 73-101. https://doi.org/10.1515/cllt-2012-0004

MLA:

O'Donnell, Matthew Brook, et al. "Exploring text-initial words, clusters and concgrams in a newspaper corpus." Corpus linguistics and linguistic theory 8.1 (2012): 73-101.

BibTeX: Download