MA Thesis: Deep Learning for Lymph Node Metastasis Detection in Pancreatic Ductal Adenocarcinoma
Abstract: Pancreatic ductal adenocarcinoma (PDAC) is one of the most lethal cancers, with lymph node metastasis (LNM) being a critical determinant in patient prognosis and therapeutic planning [1-2]. Conventional methods for detecting LNM in PDAC primarily rely on contrast-enhanced CT scans, but these often fall short in sensitivity, especially in early-stage disease. Studies have shown that preoperative imaging tends to underestimate LNM, leading to inaccurate staging that can influence treatment outcomes [3-4]. Given the high stakes associated with accurate LNM detection, there is a pressing need to develop more precise and sensitive models that can provide reliable predictions and aid in clinical decision-making.
Recent advancements in deep learning (DL) have shown promise in enhancing the accuracy of LNM detection for PDAC [5], although external validation remains necessary to confirm the generalizability of these models [6]. Building on this foundation, our previous work [7] integrated imaging features with non-imaging clinical attributes, such as clinical biomarkers, namely CA-19-9, tumor size and tumor location, into a DL pipeline to improve overall LNM detection. Moreover, challenges such as data heterogeneity across multi-institutional cohorts and imbalances in positive and negative LNM cases still need to be addressed in these multi-center datasets, ensuring that the model is both robust and adaptable across different clinical environments.
This thesis aims to further expand on our previous work by including datasets from seven centers across Australia and Germany, encompassing a total of 1,260+ cases. The project will investigate the impact of clinical attributes on model performance through an ablation study, perform feature normalization using Adaptive Instance Normalization (AdaIN) to mitigate center-specific data biases, and analyze self-attention maps from SwinTransformers to verify if the model accurately identifies LNM-positive regions. By extending the DL pipeline to additional centers and rigorously examining the model’s feature fusion and attention mechanisms, we aim to achieve a highly accurate, generalizable model for LNM detection in PDAC that could serve as a valuable tool in clinical practice.
Research Questions:
- Q1) How effectively can an expanded DL model integrating multi-center data from seven centers across Australia and Germany and clinical attributes improve LNM detection?
- Q2) What is the impact of different feature fusion techniques on the model’s ability to accurately detect LNM?
- Q3) How does feature normalization, particularly using Adaptive Instance Normalization (AdaIN), improve model robustness across datasets from diverse centers?
- Q4) Can self-attention maps from the SwinTransformers capture LNM-positive regions effectively, and how well do these maps align with expert-validated annotations of lymph node metastasis?
Clinical Collaborators:
- Patrick Kupczyk, University Hospital Bonn
- Hyun Ko, Peter MacCallum Cancer Centre
- Alexander Semaan, University Hospital Bonn
Dataset: The dataset now comprises a total of 1,260+ cases collected from seven centers in Germany and Australia. Each scan is annotated with histological data (N+ or N0) to confirm LNM presence or absence:
- University Hospital Bonn (UKB): Comprising 180 cases (106 N+ PDAC patients) from the Radiology Department and the Clinic and Polyclinic for General, Visceral, Thoracic, and Vascular Surgery.
- Peter MacCallum Cancer Centre, Melbourne (MEL): 480 cases sourced from four hospitals in Melbourne, including the Royal Melbourne Hospital, Western Health, and Northern Health. Of these, 42 cases are N+ with specialist radiologist-provided lymph node segmentations.
- Berlin Cohort: 400 cases, with preprocessing and annotation required.
- Additional Göttingen Cohort: Over 200 cases, currently in the acquisition process.
Expected Tasks:
- Pipeline Extension: Extend the current model to incorporate data from two additional centers, totaling seven. This requires documenting dataset details from each center (including annotation availability) and balancing the LNM-positive and LNM-negative cases.
- Ablation Study and Feature Fusion Analysis: Perform an ablation study, analyzing the impact of clinical attributes alone, and assess different feature fusion techniques to optimize model performance.
- Feature Normalization across Centers: Investigate the use of Adaptive Instance Normalization (AdaIN) as a data augmentation technique to improve feature consistency across centers, minimizing the effect of domain shifts [8].
- Attention Map Analysis: Use SwinTransformers’ self-attention maps to assess whether the model accurately captures positive lymph nodes (validated by expert radiologist Alex). Relevant research includes the methodology outlined in [9].
Roadmap:
- Review relevant literature on LNM prediction and DL models in medical imaging.
- Expand and preprocess datasets from all centers, including balancing strategies for LNM cases.
- Develop and integrate the baseline DL model with extended data inputs, incorporating feature normalization and fusion mechanisms.
- Conduct experiments, including ablation studies and feature fusion assessments.
- Complete the analysis and write the thesis.
Requirements:
- Interested candidate must be based in Bonn for data acccess within the hopsital (willing to relocate to Bonn with possibility of having her/him on a HiWi position)
- Proficiency in Python and familiarity with machine learning frameworks (e.g., PyTorch).
- Background in deep learning, especially in medical imaging.
- Strong foundation in data preprocessing and data integration.
References:
- Katz, Matthew H. G. et al. “Tumor‐Node‐Metastasis Staging of Pancreatic Adenocarcinoma.” CA: A Cancer Journal for Clinicians 58 (2008).
- National Comprehensive Cancer Network. National comprehensive cancer network (nccn) guidelines, 2023.
- Loch, Florian N. et al. “Accuracy of various criteria for lymph node staging in ductal adenocarcinoma of the pancreatic head by computed tomography and magnetic resonance imaging.” World Journal of Surgical Oncology 18 (2020).
- Perrotta, Gerardo et al. “Accuracy of Clinical Staging in Early-Stage Pancreatic Ductal Adenocarcinoma.” JAMA (2024).
- Castellana, R. et al. “Radiomics and deep learning models for LNM prediction in PDAC: A systematic review and meta-analysis.” European Journal of Radiology 176 (2024).
- Zheng, Zhilin et al. “A deep local attention network for pre-operative lymph node metastasis prediction in pancreatic cancer via multiphase CT imaging.” ArXiv abs/2301.01448 (2023)
- D Gaviria. et al. Deep Learning for Improved Lymph Node Metastasis Detection in Pancreatic Ductal Adenocarcinoma. Accepted at Medical Imaging and Computer-Aided Diagnosis (MICAD) 2024.
- Huang, Xun and Serge J. Belongie. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization. 2017 IEEE International Conference on Computer Vision (ICCV) (2017): 1510-1519.
- Rasoulian, Amir et al. Weakly Supervised Intracranial Hemorrhage Segmentation Using Hierarchical Combination of Attention Maps from a Swin Transformer. MLCN@MICCAI (2022).
For inquiries, please contact Prof. Dr. Shadi Albarqouni