Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods

Proisl T, Evert S, Jannidis F, Schöch C, Konle L, Pielström S (2018)


Publication Language: English

Publication Type: Conference contribution, Original article

Publication year: 2018

Publisher: European Language Resources Association

City/Town: Miyazaki

Pages Range: 3309–3314

Conference Proceedings Title: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

Event location: Miyazaki JP

ISBN: 979-10-95546-00-9

URI: http://www.lrec-conf.org/proceedings/lrec2018/pdf/835.pdf

Open Access Link: http://www.lrec-conf.org/proceedings/lrec2018/pdf/835.pdf

Abstract

Delta measures are a well-established and popular family of authorship attribution methods, especially for literary texts. N-gram tracing is a novel method for authorship attribution designed for very short texts, which has its roots in forensic linguistics. We evaluate the performance of both methods in a series of experiments on English, French and German literary texts, in order to investigate the relationship between authorship attribution accuracy and text length as well as the composition of the comparison corpus. Our results show that, at least in our setting, both methods require relatively long texts and are furthermore highly sensitive to the choice of authors and texts in the comparison corpus.


Authors with CRIS profile

Involved external institutions

How to cite

APA:

Proisl, T., Evert, S., Jannidis, F., Schöch, C., Konle, L., & Pielström, S. (2018). Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods. In Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (pp. 3309–3314). Miyazaki, JP: Miyazaki: European Language Resources Association.

MLA:

Proisl, Thomas, et al. "Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods." Proceedings of the 11th Language Resources and Evaluation Conference, Miyazaki Ed. Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T, Miyazaki: European Language Resources Association, 2018. 3309–3314.

BibTeX: Download