MA Thesis: Development of a Machine Learning Algorithm for Histopathological Classification of Conjunctival Melanocytic Intraepithelial Lesions

Abstract. Conjunctival Melanocytic Intraepithelial Lesions (CMIL) are a significant precursor to conjunctival melanoma, a rare but potentially fatal ocular cancer. The histopathological classification of CMIL is crucial for early diagnosis and treatment planning. Current diagnostic methods rely heavily on the subjective assessment of pathologists, leading to variability and potential inconsistencies in diagnosis, especially for low-grade lesions. To improve diagnostic accuracy and consistency, this thesis aims to develop a machine learning model capable of automatically classifying CMIL based on histopathological images. The model will be trained and validated using a diverse dataset of H&E-stained histopathological sections to achieve a high level of accuracy and reliability.

Research Questions

  1. Can a machine learning model be developed to accurately classify CMIL in histopathological images?
  2. What are the optimal preprocessing techniques for histopathological images to enhance model performance?
  3. How does the machine learning model’s performance compare to that of expert pathologists in classifying CMIL?

Methodology The methodology for this thesis involves several key steps to develop and validate a machine learning algorithm for CMIL classification. First, a comprehensive dataset of histopathological images, including hematoxylin-eosin (H&E) stained sections from at least 100 patients, will be compiled. The dataset will be carefully curated to include images with varying staining techniques and scanner types to ensure robustness and generalizability. The images will undergo preprocessing, including normalization and augmentation, to enhance the model’s performance. A convolutional neural network (CNN) will be developed and trained on 90% of the dataset, with 10% reserved for validation. The model’s performance will be rigorously evaluated against expert pathologists’ classifications, using metrics such as accuracy, sensitivity, specificity, and Cohen’s kappa. The goal is to create a model that can provide objective, reproducible classifications that match or exceed the diagnostic accuracy of human experts.

Roadmap: The thesis will be completed within a 6-month period, following this structured timeline:

  1. Month 1: Literature Review and Data Collection
    • Conduct a comprehensive review of existing literature on CMIL and machine learning in histopathology.
    • Gather histopathological image data from collaborating clinics and institutions.
  2. Month 2: Data Preprocessing and Initial Analysis
    • Preprocess collected data to ensure standardization and quality.
    • Perform exploratory data analysis to understand dataset characteristics and variability.
  3. Month 3: Model Development
    • Develop and train the CNN model using the preprocessed dataset.
    • Experiment with various model architectures and hyperparameters to optimize performance.
  4. Month 4: Model Optimization
    • Implement data augmentation and regularization techniques to prevent overfitting.
    • Fine-tune model hyperparameters to achieve optimal performance.
  5. Month 5: Evaluation and Validation
    • Evaluate the model’s performance using the validation set and external datasets.
    • Compare AI model results with those from expert pathologists to assess diagnostic accuracy.
  6. Month 6: Thesis Writing and Defense Preparation
    • Compile findings and insights into a comprehensive thesis document.
    • Prepare for the thesis defense and submit the final thesis.

Requirements:

  • Solid background in Machine/Deep Learning
  • Familiar with discriminative deep learning models and SOTA architectures
  • Sufficient knowledge of Python programming language and libraries (Scikit-learn)
  • Experience with a mainstream deep learning framework such as PyTorch.
  • Machine/Deep learning hands-on experience

References:

  • Damato B, Coupland SE. Management of conjunctival melanoma. Expert Rev Anticancer Ther. 2009.
  • Grossniklaus HE, Eberhart CG, Kivela TT, editors. WHO classification of tumors of the eye. 4th ed. Lyon: IARC.
  • Jakobiec FA, Folberg R, Iwamoto T. Clinicopathologic characteristics of premalignant and malignant melanocytic lesions of the conjunctiva. Ophthalmology. 1989.
  • Milman T, et al. Validation of the Newly Proposed World Health Organization Classification System for Conjunctival Melanocytic Intraepithelial Lesions. Am J Ophthalmol. 2021.
  • Mudhar HS, et al. A multicenter study validates the WHO 2022 classification of conjunctival melanocytic intraepithelial lesions. Lab Invest. 2024.
  • Jiang, F., et al. Artificial intelligence in healthcare: Past, present and future. Seminars in Cancer Biology, 2018.

Interested, please contact Prof. Dr. Shadi Albarqouni

Shadi Albarqouni
Shadi Albarqouni
Professor of Computational Medical Imaging Research at University of Bonn | AI Young Investigator Group Leader at Helmholtz AI

Related