Lizard: A Large-Scale Dataset for Colonic Nuclear Instance Segmentation and Classification
Ground truth
DOI:
10.48550/arxiv.2108.11195
Publication Date:
2021-01-01
AUTHORS (17)
ABSTRACT
The development of deep segmentation models for computational pathology (CPath) can help foster the investigation interpretable morphological biomarkers. Yet, there is a major bottleneck in success such approaches because supervised learning require an abundance accurately labelled data. This issue exacerbated field CPath generation detailed annotations usually demands input pathologist to be able distinguish between different tissue constructs and nuclei. Manually labelling nuclei may not feasible approach collecting large-scale annotated datasets, especially when single image region contain thousands cells. However, solely relying on automatic will limit accuracy reliability ground truth. Therefore, overcome above challenges, we propose multi-stage annotation pipeline enable collection datasets histology analysis, with pathologist-in-the-loop refinement steps. Using this pipeline, generate largest known nuclear instance classification dataset, containing nearly half million H&E stained colon tissue. We have released dataset encourage research community utilise it drive forward downstream cell-based CPath.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....