Authormagic - An Approach to Author Disambiguation in Large-Scale Digital Libraries

Conference contribution
(Original article)


Publication Details

Author(s): Weiler H, Meyer-Wegener K, Mele S
Title edited volumes: International Conference on Information and Knowledge Management, Proceedings
Publisher: ACM
Publishing place: New York
Publication year: 2011
Conference Proceedings Title: Proc. 20th ACM Conf. on Information and Knowledge Management
Pages range: 2293-2296
ISBN: 978-1-4503-0717-8
Language: English


Abstract


A collaboration of leading research centers in the field of High Energy Physics (HEP) has built INSPIRE, a novel information infrastructure, which comprises the entire corpus of about one million documents produced within the discipline, including a rich set of metadata, citation information and half a million full-text documents, and offers a unique opportunity for author disambiguation strategies. The presented approach features extended metadata comparison metrics and a three-step unsupervised graph clustering technique. The algorithm aided in identifying 200'000 individuals from 6'500'000 author signatures. Preliminary tests based on knowledge of external experts and a pilot of a crowd-sourcing system show a success rate of more than 96% within the selected test cases. The obtained author clusters serve as a recommendation for INSPIRE users to further clean the publication list in a crowd-sourced approach. © 2011 ACM.



FAU Authors / FAU Editors

Meyer-Wegener, Klaus Prof. Dr.-Ing.
Lehrstuhl für Informatik 6 (Datenmanagement)
Weiler, Henning
Lehrstuhl für Informatik 6 (Datenmanagement)


How to cite

APA:
Weiler, H., Meyer-Wegener, K., & Mele, S. (2011). Authormagic - An Approach to Author Disambiguation in Large-Scale Digital Libraries. In Proc. 20th ACM Conf. on Information and Knowledge Management (pp. 2293-2296). Glasgow, Scotland, UK, GB: New York: ACM.

MLA:
Weiler, Henning, Klaus Meyer-Wegener, and Salvatore Mele. "Authormagic - An Approach to Author Disambiguation in Large-Scale Digital Libraries." Proceedings of the CIKM 2011, Glasgow, Scotland, UK New York: ACM, 2011. 2293-2296.

BibTeX: 

Last updated on 2018-19-04 at 02:52