TY - JOUR
T1 - Joint breast neoplasm detection and subtyping using multi-resolution network trained on large-scale H&E whole slide images with weak labels
AU - Casson, Adam
AU - Liu, Siqi
AU - Godrich, Ran A.
AU - Aghdam, Hamed
AU - Lee, Donghun
AU - Malfroid, Kasper
AU - Rothrock, Brandon
AU - Kanan, Christopher
AU - Retamero, Juan
AU - Hanna, Matt
AU - Millar, Ewan
AU - Klimstra, David
AU - Fuchs, Thomas
PY - 2023
Y1 - 2023
N2 - Breast cancer is the most commonly diagnosed cancer in the world. The use of artificial intelligence (AI) to help diagnose the disease from digital pathology images has the potential to greatly improve patient outcomes. However, methods for training these models for detecting, segmenting, and subtyping breast neoplasms and other proliferative lesions often rely on costly and time-consuming manual annotation, which can be infeasible for large-scale datasets. In this work, we propose a weakly supervised learning framework to jointly detect, segment, and subtype breast neoplasms. Our approach leverages top-k multiple instance learning to train an initial neoplasm detection backbone network from weakly-labeled whole slide images, which is then used to automatically generate pixel-level pseudo-labels for whole slides. A second network is trained using these pseudo-labels, and slide-level classification is performed by training an aggregator network that fuses the embeddings from both backbone networks. We trained and validated our framework on large-scale datasets with more than 125k whole slide images and demonstrate its effectiveness on tasks including breast neoplasms detection, segmentation, and subtyping.
AB - Breast cancer is the most commonly diagnosed cancer in the world. The use of artificial intelligence (AI) to help diagnose the disease from digital pathology images has the potential to greatly improve patient outcomes. However, methods for training these models for detecting, segmenting, and subtyping breast neoplasms and other proliferative lesions often rely on costly and time-consuming manual annotation, which can be infeasible for large-scale datasets. In this work, we propose a weakly supervised learning framework to jointly detect, segment, and subtype breast neoplasms. Our approach leverages top-k multiple instance learning to train an initial neoplasm detection backbone network from weakly-labeled whole slide images, which is then used to automatically generate pixel-level pseudo-labels for whole slides. A second network is trained using these pseudo-labels, and slide-level classification is performed by training an aggregator network that fuses the embeddings from both backbone networks. We trained and validated our framework on large-scale datasets with more than 125k whole slide images and demonstrate its effectiveness on tasks including breast neoplasms detection, segmentation, and subtyping.
KW - Breast Neoplasm Detection
KW - Digital Pathology
KW - Multiple Instance Learning
UR - http://www.scopus.com/inward/record.url?scp=85189322532&partnerID=8YFLogxK
UR - https://proceedings.mlr.press/v227/casson24a/casson24a.pdf
M3 - Article
AN - SCOPUS:85189322532
SN - 2640-3498
VL - 227
SP - 18
EP - 38
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 6th International Conference on Medical Imaging with Deep Learning, MIDL 2023
Y2 - 10 July 2023 through 12 July 2023
ER -