A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks

Publication date

2025-01-06

Authors

Schuiveling, MarkORCID 0000-0002-2631-7271
Liu, Hong
Eek, Daniel
Breimer, Gerben EORCID 0000-0003-0365-3667
Suijkerbuijk, KarijnORCID 0000-0003-3604-5430ISNI 0000000388512483
Blokx, WillekeORCID 0000-0002-4647-8830
Veta, Mitko

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by

Abstract

BACKGROUND: Melanoma is an aggressive form of skin cancer in which tumor-infiltrating lymphocytes (TILs) are a biomarker for recurrence and treatment response. Manual TIL assessment is prone to interobserver variability, and current deep learning models are not publicly accessible or have low performance. Deep learning models, however, have the potential of consistent spatial evaluation of TILs and other immune cell subsets with the potential of improved prognostic and predictive value. To make the development of these models possible, we created the Panoptic Segmentation of nUclei and tissue in advanced MelanomA (PUMA) dataset and assessed the performance of several state-of-the-art deep learning models. In addition, we show how to improve model performance further by using heuristic postprocessing in which nuclei classes are updated based on their tissue localization. RESULTS: The PUMA dataset includes 155 primary and 155 metastatic melanoma hematoxylin and eosin-stained regions of interest with nuclei and tissue annotations from a single melanoma referral institution. The Hover-NeXt model, trained on the PUMA dataset, demonstrated the best performance for lymphocyte detection, approaching human interobserver agreement. In addition, heuristic postprocessing of deep learning models improved the detection of noncommon classes, such as epithelial nuclei. CONCLUSION: The PUMA dataset is the first melanoma-specific dataset that can be used to develop melanoma-specific nuclei and tissue segmentation models. These models can, in turn, be used for prognostic and predictive biomarker development. Incorporating tissue and nuclei segmentation is a step toward improved deep learning nuclei segmentation performance. To support the development of these models, this dataset is used in the PUMA challenge.

Keywords

Benchmarking, Cell Nucleus, Deep Learning, Humans, Image Processing, Computer-Assisted/methods, Lymphocytes, Tumor-Infiltrating, Melanoma/pathology, Skin Neoplasms/pathology, Journal Article

Citation

Schuiveling, M, Liu, H, Eek, D, Breimer, G E, Suijkerbuijk, K P M, Blokx, W A M & Veta, M 2025, 'A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks', GigaScience, vol. 14, giaf011, pp. 1-12. https://doi.org/10.1093/gigascience/giaf011