Pan-tumor CAnine cuTaneous Cancer Histology (CATCH) dataset

Wilm F, Fragoso M, Marzahl C, Qiu J, Puget C, Diehl L, Bertram CA, Klopfleisch R, Maier A, Breininger K, Aubreville M (2022)

Publication Type: Journal article

Publication year: 2022


Book Volume: 9

Article Number: 588

Journal Issue: 1

DOI: 10.1038/s41597-022-01692-w


Due to morphological similarities, the differentiation of histologic sections of cutaneous tumors into individual subtypes can be challenging. Recently, deep learning-based approaches have proven their potential for supporting pathologists in this regard. However, many of these supervised algorithms require a large amount of annotated data for robust development. We present a publicly available dataset of 350 whole slide images of seven different canine cutaneous tumors complemented by 12,424 polygon annotations for 13 histologic classes, including seven cutaneous tumor subtypes. In inter-rater experiments, we show a high consistency of the provided labels, especially for tumor annotations. We further validate the dataset by training a deep neural network for the task of tissue segmentation and tumor subtype classification. We achieve a class-averaged Jaccard coefficient of 0.7047, and 0.9044 for tumor in particular. For classification, we achieve a slide-level accuracy of 0.9857. Since canine cutaneous tumors possess various histologic homologies to human tumors the added value of this dataset is not limited to veterinary pathology but extends to more general fields of application.

Authors with CRIS profile

Involved external institutions

How to cite


Wilm, F., Fragoso, M., Marzahl, C., Qiu, J., Puget, C., Diehl, L.,... Aubreville, M. (2022). Pan-tumor CAnine cuTaneous Cancer Histology (CATCH) dataset. Scientific Data, 9(1).


Wilm, Frauke, et al. "Pan-tumor CAnine cuTaneous Cancer Histology (CATCH) dataset." Scientific Data 9.1 (2022).

BibTeX: Download