Methodological foundations of corpus research and digital humanities


Organisation:
Lehrstuhl für Korpus- und Computerlinguistik

Description:


Corpus research in linguistics as well as in the digital humanities and social sciences relies on a wide range of statistical techniques and visualizations. A central goal of our research is to develop sound methodological foundations for corpus linguistics, which address key problems in order to ensure that quantitative analyses are both reliable and meaningful.


Related Project(s)


(KALLIMACHOS – Centre for digital editions and quantitative analysis at the University of Würzburg):
KALLIMACHOS II: Measures of linguistic complexity for literary stylometry in the KALLIMACHOS Centre for Digital Humanities
Prof. Dr. Stefan Evert
(01/10/2017 - 30/09/2019)
E-SPar: Efficient simulation experiments for large-scale parameter optimisation of machine learning approaches in natural language processing
Prof. Dr. Stefan Evert
(01/10/2016 - 30/09/2017)
(KALLIMACHOS – Centre for digital editions and quantitative analysis at the University of Würzburg):
KALLIMACHOS: Corpus-linguistic approaches and statistical methodology in the KALLIMACHOS Centre for Digital Humanities
Prof. Dr. Stefan Evert
(01/10/2014 - 30/09/2017)



Outstanding publications


Evert, S., Wankerl, S., & Nöth, E. (2017). Reliable measures of syntactic and lexical complexity: The case of Iris Murdoch. Paper presentation, Birmingham, GB.
Evert, S., & Neumann, S. (2017). The impact of translation direction on characteristics of translated texts. A multivariate analysis for English and German. In De Sutter G, Lefer M, Delaere I (Eds.), Empirical Translation Studies. New Theoretical and Methodological Traditions (pp. 47-80). Berlin: Mouton de Gruyter.
Evert, S., & Arppe, A. (2015). Some theoretical and experimental observations on naïve discriminative learning. In Proceedings of the 6th Conference on Quantitative Investigations in Theoretical Linguistics (QITL-6). Tübingen, Germany.
Evert, S., Proisl, T., Jannidis, F., Pielström, S., Schöch, C., & Vitt, T. (2015). Towards a better understanding of Burrows's Delta in literary authorship attribution. In Proceedings of the Fourth Workshop on Computational Linguistics for Literature (pp. 79--88). Denver, CO.
Diwersy, S., Evert, S., & Neumann, S. (2014). A weakly supervised multivariate approach to the study of language variation. In Szmrecsanyi B, Wälchli B (Eds.), Aggregating Dialectology, Typology, and Register Analysis. Linguistic Variation in Text and Speech (pp. 174–204). Berlin, Boston: De Gruyter.
Baroni, M., & Evert, S. (2007). Words and Echoes: Assessing and Mitigating the Non-Randomness Problem in Word Frequency Distribution Modeling. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (pp. 904-911). Prague, Czech Republic.
Evert, S. (2006). How Random is a Corpus? The Library Metaphor. Zeitschrift für Anglistik und Amerikanistik, 54(2), 177-190.

Last updated on 2018-24-10 at 15:42