Lehrstuhl für Korpus- und Computerlinguistik


The computational corpus linguistics group carries out foundational methodological research on the quantitative analysis of large text corpora. The algorithms and software tools developed by the group support innovative studies in the digital humanities and social sciences as well as practical applications in language technology. A particular focus lies on understanding cooccurrence phenomena and their application in corpus-based discourse analysis.

Bismarckstraße 6
91054 Erlangen

Research Fields

Collocations, multiword expressions and corpus-based discourse analysis
Corpus tools and language technology
Further research
Methodological foundations of corpus research and digital humanities

Related Project(s)

Go to first page Go to previous page 1 of 2 Go to next page Go to last page

RANT: Reconstructing Arguments from Noisy Text (DFG Priority Programme 1999: RATIO)
Prof. Dr. Stefan Evert
(01/01/2018 - 31/12/2020)

(KALLIMACHOS – Centre for digital editions and quantitative analysis at the University of Würzburg):
KALLIMACHOS II: Measures of linguistic complexity for literary stylometry in the KALLIMACHOS Centre for Digital Humanities
Prof. Dr. Stefan Evert
(01/10/2017 - 30/09/2019)

EFE: Exploring the “Fukushima Effect”: Attitudes and opinions towards nuclear power and renewable energy and the emergence of a transnational algorithmic public sphere
Prof. Dr. Stefan Evert
(01/01/2017 - 31/12/2019)

E-SPar: Efficient simulation experiments for large-scale parameter optimisation of machine learning approaches in natural language processing
Prof. Dr. Stefan Evert
(01/10/2016 - 30/09/2017)

Englisches Konstruktikon
Prof. Dr. Stefan Evert; Prof. Dr. Thomas Herbst

Publications (Download BibTeX)

Go to first page Go to previous page 5 of 5 Go to next page Go to last page

Evert, S., & Krenn, B. (2005). Using Small Random Samples for the Manual Evaluation of Statistical Association Measures. Computer Speech and Language, 19(4), 450-466.
Evert, S. (2004). Significance tests for the evaluation of ranking methods. In Proceedings of the 20th International Conference on Computational Linguistics (Coling 2004) (pp. 945-951). Geneva, Switzerland.
Evert, S. (2004). The Statistics of Word Cooccurrences: Word Pairs and Collocations (Dissertation).
Krenn, B., Evert, S., & Zinsmeister, H. (2004). Determining Intercoder Agreement for a Collocation Identification Task. In Proceedings of KONVENS 2004 (pp. 89-96). Vienna, Austria.
Evert, S. (2004). A Simple LNRE Model for Random Character Sequences. In Proceedings of the 7èmes Journées Internationales d'Analyse Statistique des Données Textuelles (JADT 2004) (pp. 411-422). Louvain-la-Neuve, Belgium.
Evert, S., Heid, U., & Berman, S. (2000). Searchable Metaspaces. In Proceedings of the EAGLES/ISLE Workshop on Metadata. Athens, Greece.
Heid, U., Evert, S., Docherty, V., Worsch, W., & Wermke, M. (2000). A data collection for semi-automatic corpus-based updating of dictionaries. In Heid U, Evert S, Lehmann E, Rohrer C (Eds.), Proceedings of the 9th EURALEX International Congress (pp. 183--195). Stuttgart, Germany.
Evert, S., Heid, U., Lehmann, E., & Rohrer, C. (Eds.) (2000). Proceedings of the 9th EURALEX International Congress. Stuttgart, Germany.
Evert, S., Heid, U., & Lezius, W. (2000). Methoden zum Vergleich von Signifikanzmaßen zur Kollokationsidentifikation. In Zühlke W, Schukat-Talamazzini EG (Eds.), KONVENS-2000 Sprachkommunikation (pp. 215--220). Ilmenau, Germany: VDE-Verlag.
Evert, S., Heid, U., & Lüdeling, A. (2000). On Measuring Morphological Productivity. In Zühlke W, Schukat-Talamazzini EG (Eds.), KONVENS-2000 Sprachkommunikation (pp. 57--61). Ilmenau, Germany: VDE-Verlag.

Last updated on 2019-24-04 at 10:19