A comprehensive multi-domain dataset for mitotic figure detection

Publication date

2023-12

Authors

Aubreville, Marc
Wilm, Frauke
Stathonikos, NikolasORCID 0000-0002-5457-7580
Breininger, Katharina
Donovan, Taryn A
Jabari, Samir
Veta, Mitko
Ganz, Jonathan
Ammeling, Jonas
van Diest, PaulORCID 0000-0003-0658-2745ISNI 000000004213151X

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by

Abstract

The prognostic value of mitotic figures in tumor tissue is well-established for many tumor types and automating this task is of high research interest. However, especially deep learning-based methods face performance deterioration in the presence of domain shifts, which may arise from different tumor types, slide preparation and digitization devices. We introduce the MIDOG++ dataset, an extension of the MIDOG 2021 and 2022 challenge datasets. We provide region of interest images from 503 histological specimens of seven different tumor types with variable morphology with in total labels for 11,937 mitotic figures: breast carcinoma, lung carcinoma, lymphosarcoma, neuroendocrine tumor, cutaneous mast cell tumor, cutaneous melanoma, and (sub)cutaneous soft tissue sarcoma. The specimens were processed in several laboratories utilizing diverse scanners. We evaluated the extent of the domain shift by using state-of-the-art approaches, observing notable differences in single-domain training. In a leave-one-domain-out setting, generalizability improved considerably. This mitotic figure dataset is the first that incorporates a wide domain shift based on different tumor types, laboratories, whole slide image scanners, and species.

Keywords

Algorithms, Humans, Mitosis, Neoplasms/pathology, Prognosis, Information Systems, Education, Library and Information Sciences, Statistics and Probability, Computer Science Applications, Statistics, Probability and Uncertainty, Research Support, Non-U.S. Gov't, Dataset, Journal Article

Citation

Aubreville, M, Wilm, F, Stathonikos, N, Breininger, K, Donovan, T A, Jabari, S, Veta, M, Ganz, J, Ammeling, J, van Diest, P J, Klopfleisch, R & Bertram, C A 2023, 'A comprehensive multi-domain dataset for mitotic figure detection', Scientific data, vol. 10, no. 1, 484. https://doi.org/10.1038/s41597-023-02327-4