Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods

Beitrag bei einer Tagung
(Originalarbeit)


Details zur Publikation

Autorinnen und Autoren: Proisl T, Evert S, Jannidis F, Schöch C, Konle L, Pielström S
Herausgeber: Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T
Verlag: European Language Resources Association
Verlagsort: Miyazaki
Jahr der Veröffentlichung: 2018
Tagungsband: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Seitenbereich: 3309–3314
ISBN: 979-10-95546-00-9
Sprache: Englisch


Abstract

Delta measures are a well-established and popular family of authorship
attribution methods, especially for literary texts. N-gram tracing is a
novel method for authorship attribution designed for very short texts,
which has its roots in forensic linguistics. We evaluate the performance
of both methods in a series of experiments on English, French and
German literary texts, in order to investigate the relationship between
authorship attribution accuracy and text length as well as the
composition of the comparison corpus. Our results show that, at least in
our setting, both methods require relatively long texts and are
furthermore highly sensitive to the choice of authors and texts in the
comparison corpus.



FAU-Autorinnen und Autoren / FAU-Herausgeberinnen und Herausgeber

Evert, Stefan Prof. Dr.
Lehrstuhl für Korpus- und Computerlinguistik
Proisl, Thomas
Lehrstuhl für Korpus- und Computerlinguistik


Einrichtungen weiterer Autorinnen und Autoren

Julius-Maximilians-Universität Würzburg
Universität Trier


Zitierweisen

APA:
Proisl, T., Evert, S., Jannidis, F., Schöch, C., Konle, L., & Pielström, S. (2018). Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods. In Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (pp. 3309–3314). Miyazaki, JP: Miyazaki: European Language Resources Association.

MLA:
Proisl, Thomas, et al. "Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods." Proceedings of the 11th Language Resources and Evaluation Conference, Miyazaki Ed. Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T, Miyazaki: European Language Resources Association, 2018. 3309–3314.

BibTeX: 

Zuletzt aktualisiert 2018-11-08 um 02:59