|
Persistent Identifier
|
doi:10.18710/PUJOU2 |
|
Publication Date
|
2025-12-19 |
|
Title
| Replication refined dataset for: A lightweight and extensible cell segmentation and classification model for H&E-stained cancer whole slide images |
|
Author
| Shvetsov, NikitaUiT The Arctic University of NorwayORCIDhttps://orcid.org/0000-0002-8472-3702 |
|
Point of Contact
|
Use email button above to contact.
Shvetsov Nikita (UiT The Arctic University of Norway) |
|
Description
| The refined PanNuke and MoNuSAC Cell Segmentation and Classification Dataset is a unified collection of H&E-stained image patches with cell instance annotations and seven cell-type labels. It is created by combining the PanNuke and MoNuSAC datasets while improving label granularity and consistency across both sources.
The dataset is generated using a cross-relabeling workflow that refines broad or ambiguous classes in each dataset using two ResNet50-based cell classifiers trained on extracted single-cell crops. A classifier trained on MoNuSAC immune cells is used to split the PanNuke inflammatory class into lymphocytes, neutrophils, and macrophages. A classifier trained on PanNuke epithelial subclasses is used to split the MoNuSAC epithelial class into epithelial (benign) and neoplastic (malignant). The relabeled instances are merged with the remaining original classes to form a single dataset with harmonized labels.
The resulting refined dataset includes seven cell types with the following instance counts: neoplastic 105,451; epithelial 29,926; lymphocytes 65,275; neutrophils 3,833; macrophages 3,410; connective 50,585; dead 2,908. (2025-12-01) |
|
Subject
| Medicine, Health and Life Sciences |
|
Keyword
| Cell Segmentation
Cell Classification
Histopathological Images
Cancer |
|
Related Publication
| Is Supplement To: Shvetsov, N., Kilvær, T. K., Tafavvoghi, M., Sildnes, A., Møllersen, K., Busund, L. R., & Bongo, L. A. (2025). A lightweight and extensible cell segmentation and classification model for H&E-stained cancer whole slide images. Computers in Biology and Medicine, 199, 111326. doi 10.1016/j.compbiomed.2025.111326 https://doi.org/10.1016/j.compbiomed.2025.111326 |
|
Language
| English |
|
Producer
| UiT The Arctic University of Norway (UiT) https://en.uit.no/ |
|
Contributor
| Researcher: Nikita Shvetsov |
|
Funding Information
| Research Council of Norway: 309439 SFI VI
North Norwegian Health Authority: HNF1521-20 |
|
Distributor
| UiT The Arctic University of Norway (UiT) https://dataverse.no/dataverse/uit |
|
Depositor
| Shvetsov, Nikita |
|
Deposit Date
| 2025-12-01 |
|
Software
| Python, Version: 3.12.7
Numpy, Version: 1.26.4
opencv-python, Version: 4.10.0 |
|
Related Dataset
| R. Verma et al., "MoNuSAC2020: A Multi-Organ Nuclei Segmentation and Classification Challenge," in IEEE Transactions on Medical Imaging, vol. 40, no. 12, pp. 3413-3423, Dec. 2021, doi: 10.1109/TMI.2021.3085712.; Gamper, J., Koohbanani, N. A., Benes, K., Graham, S., Jahanifar, M., Khurram, S. A., Rajpoot, N. (2020). PanNuke Dataset Extension, Insights and Baselines. arXiv [Eess.IV]. Retrieved from http://arxiv.org/abs/2003.10778 |