Abstract: Pan-tumor CAnine CuTaneous Cancer Histology (CATCH) Dataset

Wilm F, Fragoso M, Marzahl C, Qiu J, Puget C, Diehl L, Bertram CA, Klopfleisch R, Maier A, Breininger K, Aubreville M (2023)

Publication Type: Conference contribution

Publication year: 2023


Publisher: Springer Science and Business Media Deutschland GmbH

Pages Range: 205-

Conference Proceedings Title: Informatik aktuell

Event location: Braunschweig DE

ISBN: 9783658416560

DOI: 10.1007/978-3-658-41657-7_45


The identification of tumor regions on cutaneous tissue sections and the subsequent differentiation into individual tumor subtypes are routine tasks for veterinary pathologists. However, manual tumor delineation can be time-consuming and morphological similarities of tumor types can make the subtyping task difficult. Deep learning-based algorithms have been successfully applied for both tasks but require a large amount of annotated data to learn robust representations. Acquiring the annotations necessary to train these algorithms can be time-consuming and requires expert knowledge. Publicly available datasets reduce this annotation overhead and allow for a comparison of image analysis algorithms trained for the task at hand. We published a publicly available dataset of 350 whole slide images (WSIs) of seven different canine cutaneous tumors. These were annotated for the tumor subtypes and additionally six skin tissue classes. Overall, the database includes 12,424 polygon annotations for 13 classes, which exceeds most publicly available datasets in annotation extent and label diversity. In our dataset report, we validated the provided annotations through inter-rater experiments on a subset of the presented dataset, where three pathologists demonstrated high consistency in labeling tissue structures, especially for tumor annotations. Furthermore, we performed a technical validation by first training a deep neural network for tissue phenotyping. The model segmented the WSI into tumor and five skin tissue classes, where we achieved a class-averaged Jaccard coefficient of 0.7047 and a Jaccard coefficient of 0.9044 for tumor in particular. Afterward, we classified the segmented tumor regions into the tumor subtypes and achieved a slide-level accuracy of 0.9857. These baseline results provide a starting point for the development of more advanced algorithms. Previous works have shown that canine and human cutaneous tumors share various histologic patterns, which extends the added value of the presented dataset beyond veterinary pathology [1].

Authors with CRIS profile

Involved external institutions

How to cite


Wilm, F., Fragoso, M., Marzahl, C., Qiu, J., Puget, C., Diehl, L.,... Aubreville, M. (2023). Abstract: Pan-tumor CAnine CuTaneous Cancer Histology (CATCH) Dataset. In Thomas M. Deserno, Heinz Handels, Andreas Maier, Klaus Maier-Hein, Christoph Palm, Thomas Tolxdorff (Eds.), Informatik aktuell (pp. 205-). Braunschweig, DE: Springer Science and Business Media Deutschland GmbH.


Wilm, Frauke, et al. "Abstract: Pan-tumor CAnine CuTaneous Cancer Histology (CATCH) Dataset." Proceedings of the Bildverarbeitung für die Medizin Workshop, BVM 2023, Braunschweig Ed. Thomas M. Deserno, Heinz Handels, Andreas Maier, Klaus Maier-Hein, Christoph Palm, Thomas Tolxdorff, Springer Science and Business Media Deutschland GmbH, 2023. 205-.

BibTeX: Download