Investigating the Effect of Using Synthetic and Semi-synthetic Images for Historical Document Font Classification

Nikolaidou K, Upadhyay R, Seuret M, Liwicki M (2022)


Publication Type: Conference contribution

Publication year: 2022

Journal

Publisher: Springer Science and Business Media Deutschland GmbH

Book Volume: 13237 LNCS

Pages Range: 613-626

Conference Proceedings Title: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Event location: La Rochelle, FRA

ISBN: 9783031065545

DOI: 10.1007/978-3-031-06555-2_41

Abstract

This paper studies the effect of using various data augmentation by synthetization approaches on historical image data, particularly for font classification. Historical document image datasets often lack the appropriate size to train and evaluate deep learning models, motivating data augmentation and synthetic document generation techniques for creating additional data. This work explores the effect of various semi-synthetic and synthetic historical document images, some of which appear as recent trends and others not published yet, on a font classification task. We use 10K patch samples as baseline dataset, derived from the dataset of Early Printed Books with Multiple Font Groups, and increase its size using DocCreator software and Generative Adversarial Networks (GAN). Furthermore, we fine-tune different pre-trained Convolutional Neural Network (CNN) classifiers as a baseline using the original dataset and then compare the performance with the additional semi-synthetic and synthetic images. We further evaluate the performance using additional real samples from the original dataset in the training process. DocCreator, and the additional real samples improve the performance giving the best results. Finally, for the best-performing architecture, we explore different sizes of training sets and examine how the gradual addition of data affects the performance.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Nikolaidou, K., Upadhyay, R., Seuret, M., & Liwicki, M. (2022). Investigating the Effect of Using Synthetic and Semi-synthetic Images for Historical Document Font Classification. In Seiichi Uchida, Elisa Barney, Véronique Eglin (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 613-626). La Rochelle, FRA: Springer Science and Business Media Deutschland GmbH.

MLA:

Nikolaidou, Konstantina, et al. "Investigating the Effect of Using Synthetic and Semi-synthetic Images for Historical Document Font Classification." Proceedings of the 15th IAPR International Workshop on Document Analysis Systems, DAS 2022, La Rochelle, FRA Ed. Seiichi Uchida, Elisa Barney, Véronique Eglin, Springer Science and Business Media Deutschland GmbH, 2022. 613-626.

BibTeX: Download