Learn to Recognize

Last updated on Apr 12, 2020

Follow Contribute

Illustrative figure by Shadi Albarqouni

We started investigating Convolutional Neural Networks for Object Recognition in a supervised fashion, for example, mitotic figure detection in histology imaging (Albarqouni et al. 2016), Catheter electrodes detection and depth estimation in Interventional Imaging (Baur et al. 2016), femur fracture detection in radiology (Kazi et al. 2017), in-depth layer X-ray synthesis (Albarqouni et al. 2017), and pose estimation of mobile X-rays (Bui et al. 2017). One of the first work which has been highly recognized and featured in the media is AggNet (Albarqouni et al. 2016) for Mitotic figure detection in Histology Images. Although the network architecture was shallow, it was trained using millions of multi-scale RGB patches of histology images, achieving outstanding performance (ranked 3rd among 15 participants in AMIDA13 challenge).

During our work, we found out such data-driven models demand a massive amount of annotated data, which might not be available in medical imaging and can not be mitigated by simple data augmentation. Besides, we found out such models are so sensitive to domain shift, i.e., different scanner, and methods such as domain adaptation is required. Therefore, we have focused our research directions to develop fully-automated, high accurate solutions that save export labor and efforts, and mitigate the challenges in medical imaging. For example, i) the availability of a few annotated data, ii) low inter-/intra-observers agreement, iii) high-class imbalance, iv) inter-/intra-scanners variability and v) domain shift.

To mitigate the problem of limited annotated data, we developed models that Learn from a Few Examples by i) leveraging the massive amount of unlabeled data via semi-supervised techniques (Baur and Albarqouni et al. 2017), ii) utilizing weakly labeled data, which is way cheaper than densely one (Kazi et al. 2017), iii) generating more examples through modeling the data distribution (Baur et al. 2018), and finally by iv) investigating unsupervised approaches (Baur et al. 2018, Baur et al. 2019).

Collaboration:

Prof. Peter Nöel, Department of Radiology, University of Pennsylvania, USA
Prof. Guillaume Landry, Department of Radiation Oncology, Medical Center of the University of Munich, Germany
Dr. Benedikt Wiestler, TUM Neuroradiologie, Klinikum rechts der Isar, Germany
Prof. Dr. med. Sonja Kirchhoff, Klinikum rechts der Isar, Germany
Prof. Diana Mateus, Ecole Centrale Nantes, France
Prof. Andreas Maier, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
Prof. Pascal Fallavollita, Ottawa University, Canada

Funding:

Siemens Healthineers
Siemens AG

Shadi Albarqouni

Professor of Computational Medical Imaging Research at University of Bonn | AI Young Investigator Group Leader at Helmholtz AI | Affiliate Scientist at Technical University of Munich

Publications

Duy MH Nguyen, Hoang Nguyen, Nghiem Diep, Tan Ngoc Pham, Tri Cao, Binh Nguyen, Paul Swoboda, Nhat Ho, Shadi Albarqouni, Pengtao Xie, others

January 2024 Advances in Neural Information Processing Systems

Lvm-med: Learning large-scale self-supervised vision models for medical imaging via second-order graph matching

Obtaining large pre-trained models that can be fine-tuned to new tasks with limited annotated samples has remained an open challenge for medical imaging data. While pre-trained deep networks on ImageNet and vision-language foundation models trained on web-scale data are prevailing approaches, their effectiveness on medical tasks is limited due to the significant domain shift between natural and medical images. To bridge this gap, we introduce LVM-Med, the first family of deep networks trained on large-scale medical datasets. We have collected approximately 1.3 million in medical images from 55 publicly available datasets, covering a large number of organs and modalities such as CT, MRI, X-ray, and Ultrasound. We benchmark several state-of-the-art self-supervised algorithms on this dataset and propose a novel self-supervised contrastive learning algorithm using a graph matching formulation. The proposed approach makes three contributions: (i) it integrates prior pair-wise image similarity metrics based on local and global information; (ii) it captures the structural constraints of feature embeddings through a loss function constructed via a combinatorial graph-matching objective; and (iii) it can be trained efficiently end-to-end using modern gradient-estimation techniques for black-box solvers. We thoroughly evaluate the proposed LVM-Med on 15 downstream medical tasks ranging from segmentation and classification to object detection, and both for the in and out-of-distribution settings. LVM-Med empirically outperforms a number of state-of-the-art supervised, self-supervised, and foundation models. For challenging tasks such as Brain Tumor Classification or Diabetic Retinopathy Grading, LVM-Med improves previous vision-language models trained on 1 billion masks by 6-7% while using only a ResNet-50. We release pre-trained models at this link https://github.com/duyhominhnguyen/LVM-Med.

PDF Code Project Project

Christoph Baur, Stefan Denner, Benedikt Wiestler, Nassir Navab, Shadi Albarqouni

January 2021 Medical Image Analysis (IF: 11.148)

Autoencoders for Unsupervised Anomaly Segmentation in Brain MR Images: A Comparative Study

Deep unsupervised representation learning has recently led to new approaches in the field of Unsupervised Anomaly Detection (UAD) in brain MRI. The main principle behind these works is to learn a model of normal anatomy by learning to compress and recover healthy data. This allows to spot abnormal structures from erroneous recoveries of compressed, potentially anomalous samples. The concept is of great interest to the medical image analysis community as it i) relieves from the need of vast amounts of manually segmented training data—a necessity for and pitfall of current supervised Deep Learning—and ii) theoretically allows to detect arbitrary, even rare pathologies which supervised approaches might fail to find. To date, the experimental design of most works hinders a valid comparison, because i) they are evaluated against different datasets and different pathologies, ii) use different image resolutions and iii) different model architectures with varying complexity. The intent of this work is to establish comparability among recent methods by utilizing a single architecture, a single resolution and the same dataset(s). Besides providing a ranking of the methods, we also try to answer questions like i) how many healthy training subjects are needed to model normality and ii) if the reviewed approaches are also sensitive to domain shift. Further, we identify open challenges and provide suggestions for future community efforts and research directions.

PDF Code Project

Agnieszka Tomczak, Slobodan Ilic, Gaby Marquardt, Thomas Engel, Frank Forster, Nassir Navab, Shadi Albarqouni

December 2020 IEEE Transactions on Medical Imaging

Multi-task multi-domain learning for digital staining and classification of leukocytes

oking stained images preserving the inter-cellular structures, crucial for the medical experts to perform classification. We achieve better structure preservation by adding auxiliary tasks of segmentation and direct reconstruction. Segmentation enforces that the network learns to generate correct nucleus and cytoplasm shape, while direct reconstruction enforces reliable translation between the matching images across domains. Besides, we build a robust domain agnostic latent space by injecting the target domain label directly to the generator, i.e., bypassing the encoder. It allows the encoder to extract features independently of the target domain and enables an automated domain invariant classification of the white blood cells. We validated our method on a large dataset composed of leukocytes of 24 patients, achieving state-of-the-art performance on both digital staining and classification tasks.

PDF Project Project

Mai Bui, Tolga Birdal, Haowen Deng, Shadi Albarqouni, Leonidas Guibas, Slobodan Ilic, Nassir Navab

August 2020 16th European Conference on Computer Vision (ECCV)

6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal Inference

We present a multimodal camera relocalization framework that captures ambiguities and uncertainties with continuous mixture models defined on the manifold of camera poses. In highly ambiguous environments, which can easily arise due to symmetries and repetitive structures in the scene, computing one plausible solution (what most state-of-the-art methods currently regress) may not be sufficient. Instead we predict multiple camera pose hypotheses as well as the respective uncertainty for each prediction. Towards this aim, we use Bingham distributions, to model the orientation of the camera pose, and a multivariate Gaussian to model the position, with an end-to-end deep neural network. By incorporating a Winner-Takes-All training scheme, we finally obtain a mixture model that is well suited for explaining ambiguities in the scene, yet does not suffer from mode collapse, a common problem with mixture density networks. We introduce a new dataset specifically designed to foster camera localization research in ambiguous environments and exhaustively evaluate our method on synthetic as well as real data on both ambiguous scenes and on non-ambiguous benchmark datasets.

PDF Project Project

Roger D Soberanis-Mukul, Maxime Kayser, Anna-Maria Zvereva, Peter Klare, Nassir Navab, Shadi Albarqouni

January 2020 arXiv preprint arXiv:2002.02883

A learning without forgetting approach to incorporate artifact knowledge in polyp localization tasks

Colorectal polyps are abnormalities in the colon tissue that can develop into colorectal cancer. The survival rate for patients is higher when the disease is detected at an early stage and polyps can be removed before they develop into malignant tumors. Deep learning methods have become the state of art in automatic polyp detection. However, the performance of current models heavily relies on the size and quality of the training datasets. Endoscopic video sequences tend to be corrupted by different artifacts affecting visibility and hence, the detection rates. In this work, we analyze the effects that artifacts have in the polyp localization problem. For this, we evaluate the RetinaNet architecture, originally defined for object localization. We also define a model inspired by the learning without forgetting framework, which allows us to employ artifact detection knowledge in the polyp localization problem. Finally, we perform several experiments to analyze the influence of the artifacts in the performance of these models. To our best knowledge, this is the first extensive analysis of the influence of artifact in polyp localization and the first work incorporating learning without forgetting ideas for simultaneous artifact and polyp localization tasks.

Code Project

Sharib Ali, Felix Zhou, Barbara Braden, Adam Bailey, Suhui Yang, Guanju Cheng, Pengyi Zhang, Xiaoqiong Li, Maxime Kayser, Roger D Soberanis-Mukul, others

January 2020 Scientific reports

An objective comparison of detection and segmentation algorithms for artefacts in clinical endoscopy

We present a comprehensive analysis of the submissions to the first edition of the Endoscopy Artefact Detection challenge (EAD). Using crowd-sourcing, this initiative is a step towards understanding the limitations of existing state-of-the-art computer vision methods applied to endoscopy and promoting the development of new approaches suitable for clinical translation. Endoscopy is a routine imaging technique for the detection, diagnosis and treatment of diseases in hollow-organs; the esophagus, stomach, colon, uterus and the bladder. However the nature of these organs prevent imaged tissues to be free of imaging artefacts such as bubbles, pixel saturation, organ specularity and debris, all of which pose substantial challenges for any quantitative analysis. Consequently, the potential for improved clinical outcomes through quantitative assessment of abnormal mucosal surface observed in endoscopy videos is presently not realized accurately. The EAD challenge promotes awareness of and addresses this key bottleneck problem by investigating methods that can accurately classify, localize and segment artefacts in endoscopy frames as critical prerequisite tasks. Using a diverse curated multi-institutional, multi-modality, multi-organ dataset of video frames, the accuracy and performance of 23 algorithms were objectively ranked for artefact detection and segmentation. The ability of methods to generalize to unseen datasets was also evaluated. The best performing methods (top 15%) propose deep learning strategies to reconcile variabilities in artefact appearance with respect to size, modality, occurrence and organ type. However, no single method outperformed across all tasks. Detailed analyses reveal the shortcomings of current training strategies and highlight the need for developing new optimal metrics to accurately quantify the clinical applicability of methods.

PDF Project

Nadav Shapira, Julia Fokuhl, Manuel Schultheiß, Stefanie Beck, Felix K Kopp, Daniela Pfeiffer, Julia Dangelmaier, Gregor Pahn, Andreas P Sauter, Bernhard Renger, others

January 2020 Medical Imaging 2020: Physics of Medical Imaging

Benefit of dual energy CT for lesion localization and classification with convolutional neural networks

Dual Energy CT is a modern imaging technique that is utilized in clinical practice to acquire spectral information for various diagnostic purposes including the identification, classification, and characterization of different liver lesions. It provides additional information that, when compared to the information available from conventional CT datasets, has the potential to benefit existing computer vision techniques by improving their accuracy and reliability. In order to evaluate the additional value of spectral versus conventional datasets when being used as input for machine learning algorithms, we implemented a weakly-supervised Convolutional Neural Network (CNN) that learns liver lesion localization and classification without pixel-level ground truth annotations. We evaluated the lesion classification (healthy, cyst, hypodense metastasis) and localization performance of the network for various conventional and spectral input datasets obtained from the same CT scan. The best results for lesion localization were found for the spectral datasets with distances of 8.22 ± 10.72 mm, 8.78 ± 15.21 mm and 8.29 ± 12.97 mm for iodine maps, 40 keV and 70 keV virtual mono-energetic images, respectively, while lesion localization distances of 10.58 ± 17.65 mm were measured for the conventional dataset. In addition, the 40 keV virtual mono-energetic datasets achieved the highest overall lesion classification accuracy of 0.899 compared to 0.854 measured for the conventional datasets. The enhanced localization and classification results that we observed for spectral CT data demonstrates that combining machine-learning technology with spectral CT information may improve the clinical workflow as well as the diagnostic accuracy.

PDF Project

Mohammad Eslami, Solale Tabarestani, Shadi Albarqouni, Ehsan Adeli, Nassir Navab, Malek Adjouadi

January 2020 IEEE Transactions on Medical Imaging

Image-to-Images Translation for Multi-Task Organ Segmentation and Bone Suppression in Chest X-Ray Radiography

Chest X-ray radiography is one of the earliest medical imaging technologies and remains one of the most widely-used for diagnosis, screening, and treatment follow up of diseases related to lungs and heart. The literature in this field of research reports many interesting studies dealing with the challenging tasks of bone suppression and organ segmentation but performed separately, limiting any learning that comes with the consolidation of parameters that could optimize both processes. This study, and for the first time, introduces a multitask deep learning model that generates simultaneously the bone-suppressed image and the organ-segmented image, enhancing the accuracy of tasks, minimizing the number of parameters needed by the model and optimizing the processing time, all by exploiting the interplay between the network parameters to benefit the performance of both tasks. The architectural design of this model, which relies on a conditional generative adversarial network, reveals the process on how the well-established pix2pix network (image-to-image network) is modified to fit the need for multitasking and extending it to the new image-to-images architecture. The developed source code of this multitask model is shared publicly on Github as the first attempt for providing the two-task pix2pix extension, a supervised/paired/aligned/registered image-to-images translation which would be useful in many multitask applications. Dilated convolutions are also used to improve the results through a more effective receptive field assessment. The comparison with state-of-the-art algorithms along with ablation study and a demonstration video are provided to evaluate efficacy and gauge the merits of the proposed approach.

PDF Project

Nadav Shapira, Julia Fokuhl, Manuel Schultheiß, Stefanie Beck, Felix K Kopp, Daniela Pfeiffer, Julia Dangelmaier, Gregor Pahn, Andreas P Sauter, Bernhard Renger, others

January 2020 Biomedical Physics & Engineering Express

Liver lesion localisation and classification with convolutional neural networks: a comparison between conventional and spectral computed tomography

PDF Project

Mai Bui, Christoph Baur, Nassir Navab, Slobodan Ilic, Shadi Albarqouni

January 2019 Proceedings of the IEEE International Conference on Computer Vision Workshops

Adversarial Networks for Camera Pose Regression and Refinement

Despite recent advances on the topic of direct camera pose regression using neural networks, accurately estimating the camera pose of a single RGB image still remains a challenging task. To address this problem, we introduce a novel framework based, in its core, on the idea of implicitly learning the joint distribution of RGB images and their corresponding camera poses using a discriminator network and adversarial learning. Our method allows not only to regress the camera pose from a single image, however, also offers a solely RGB-based solution for camera pose refinement using the discriminator network. Further, we show that our method can effectively be used to optimize the predicted camera poses and thus improve the localization accuracy. To this end, we validate our proposed method on the publicly available 7-Scenes dataset improving upon the results of direct camera pose regression methods.

PDF Project Project

Christoph Baur, Benedikt Wiestler, Shadi Albarqouni, Nassir Navab

January 2019 International Conference on Medical Imaging with Deep Learning

Fusing unsupervised and supervised deep learning for white matter lesion segmentation

Unsupervised Deep Learning for Medical Image Analysis is increasingly gaining attention, since it relieves from the need for annotating training data. Recently, deep generative models and representation learning have lead to new, exciting ways for unsupervised detection and delineation of biomarkers in medical images, such as lesions in brain MR. Yet, Supervised Deep Learning methods usually still perform better in these tasks, due to an optimization for explicit objectives. We aim to combine the advantages of both worlds into a novel framework for learning from both labeled & unlabeled data, and validate our method on the challenging task of White Matter lesion segmentation in brain MR images. The proposed framework relies on modeling normality with deep representation learning for Unsupervised Anomaly Detection, which in turn provides optimization targets for training a supervised segmentation model from unlabeled data. In our experiments we successfully use the method in a Semi-supervised setting for tackling domain shift, a well known problem in MR image analysis, showing dramatically improved generalization. Additionally, our experiments reveal that in a completely Unsupervised setting, the proposed pipeline even outperforms the Deep Learning driven anomaly detection that provides the optimization targets.

PDF Project

Mhd Hasan Sarhan, Abouzar Eslami, Nassir Navab, Shadi Albarqouni

January 2019 Domain Adaptation and Representation Transfer and Medical Image Learning with Less Labels and Imperfect Data

Learning interpretable disentangled representations using adversarial vaes

Learning Interpretable representation in medical applications is becoming essential for adopting data-driven models into clinical practice. It has been recently shown that learning a disentangled feature representation is important for a more compact and explainable representation of the data. In this paper, we introduce a novel adversarial variational autoencoder with a total correlation constraint to enforce independence on the latent representation while preserving the reconstruction fidelity. Our proposed method is validated on a publicly available dataset showing that the learned disentangled representation is not only interpretable, but also superior to the state-of-the-art methods. We report a relative improvement of 81.50% in terms of disentanglement, 11.60% in clustering, and 2% in supervised classification with a few amounts of labeled data.

PDF Project Project

Ashkan Khakzar, Shadi Albarqouni, Nassir Navab

January 2019 International Conference on Medical Image Computing and Computer-Assisted Intervention

Learning Interpretable Features via Adversarially Robust Optimization

Neural networks are proven to be remarkably successful for classification and diagnosis in medical applications. However, the ambiguity in the decision-making process and the interpretability of the learned features is a matter of concern. In this work, we propose a method for improving the feature interpretability of neural network classifiers. Initially, we propose a baseline convolutional neural network with state of the art performance in terms of accuracy and weakly supervised localization. Subsequently, the loss is modified to integrate robustness to adversarial examples into the training process. In this work, feature interpretability is quantified via evaluating the weakly supervised localization using the ground truth bounding boxes. Interpretability is also visually assessed using class activation maps and saliency maps. The method is applied to NIH ChestX-ray14, the largest publicly available chest x-rays dataset. We demonstrate that the adversarially robust optimization paradigm improves feature interpretability both quantitatively and visually.

PDF Project Project

Sai Gokul Hariharan, Christian Kaethner, Norbert Strobel, Markus Kowarschik, Shadi Albarqouni, Rebecca Fahrig, Nassir Navab

January 2019 International Conference on Medical Image Computing and Computer-Assisted Intervention

Learning-based x-ray image denoising utilizing model-based image simulations

Project

Mhd Hasan Sarhan, Shadi Albarqouni, Mehmet Yigitsoy, Nassir Navab, Abouzar Eslami

January 2019 International Conference on Medical Image Computing and Computer-Assisted Intervention

Multi-scale Microaneurysms Segmentation Using Embedding Triplet Loss

Deep learning techniques are recently being used in fundus image analysis and diabetic retinopathy detection. Microaneurysms are an important indicator of diabetic retinopathy progression. We introduce a two-stage deep learning approach for microaneurysms segmentation using multiple scales of the input with selective sampling and embedding triplet loss. The model first segments on two scales and then the segmentations are refined with a classification model. To enhance the discriminative power of the classification model, we incorporate triplet embedding loss with a selective sampling routine. The model is evaluated quantitatively to assess the segmentation performance and qualitatively to analyze the model predictions. This approach introduces a 30.29% relative improvement over the fully convolutional neural network.

PDF Project Project

Sai Gokul Hariharan, Christian Kaethner, Norbert Strobel, Markus Kowarschik, Julie DiNitto, Shadi Albarqouni, Rebecca Fahrig, Nassir Navab

January 2019 International journal of computer assisted radiology and surgery

Preliminary results of DSA denoising based on a weighted low-rank approach using an advanced neurovascular replication system

Project

Amelia Jiménez-Sánchez, Anees Kazi, Shadi Albarqouni, Chlodwig Kirchhoff, Peter Biberthaler, Nassir Navab, Diana Mateus, Sonja Kirchhoff

January 2019 arXiv preprint arXiv:1902.01338

Towards an Interactive and Interpretable CAD System to Support Proximal Femur Fracture Classification

We demonstrate the feasibility of a fully automatic computer-aided diagnosis (CAD) tool, based on deep learning, that localizes and classifies proximal femur fractures on X-ray images according to the AO classification. The proposed framework aims to improve patient treatment planning and provide support for the training of trauma surgeon residents. A database of 1347 clinical radiographic studies was collected. Radiologists and trauma surgeons annotated all fractures with bounding boxes, and provided a classification according to the AO standard. The proposed CAD tool for the classification of radiographs into types ‘A’, ‘B’ and ’not-fractured’, reaches a F1-score of 87% and AUC of 0.95, when classifying fractures versus not-fractured cases it improves up to 94% and 0.98. Prior localization of the fracture results in an improvement with respect to full image classification. 100% of the predicted centers of the region of interest are contained in the manually provided bounding boxes. The system retrieves on average 9 relevant images (from the same class) out of 10 cases. Our CAD scheme localizes, detects and further classifies proximal femur fractures achieving results comparable to expert-level and state-of-the-art performance. Our auxiliary localization model was highly accurate predicting the region of interest in the radiograph. We further investigated several strategies of verification for its adoption into the daily clinical routine. A sensitivity analysis of the size of the ROI and image retrieval as a clinical use case were presented.

PDF Project

Sai Gokul Hariharan, Norbert Strobel, Christian Kaethner, Markus Kowarschik, Stefanie Demirci, Shadi Albarqouni, Rebecca Fahrig, Nassir Navab

January 2018 International journal of computer assisted radiology and surgery

A photon recycling approach to the denoising of ultra-low dose X-ray sequences

Project

Amelia Jiménez-Sánchez, Shadi Albarqouni, Diana Mateus

January 2018 Intravascular Imaging and Computer Assisted Stenting and Large-Scale Annotation of Biomedical Data and Expert Label Synthesis

Capsule networks against medical imaging data challenges

A key component to the success of deep learning is the availability of massive amounts of training data. Building and annotating large datasets for solving medical image classification problems is today a bottleneck for many applications. Recently, capsule networks were proposed to deal with shortcomings of Convolutional Neural Networks (ConvNets). In this work, we compare the behavior of capsule networks against ConvNets under typical datasets constraints of medical image analysis, namely, small amounts of annotated data and class-imbalance. We evaluate our experiments on MNIST, Fashion-MNIST and medical (histological and retina images) publicly available datasets. Our results suggest that capsule networks can be trained with less amount of data for the same or better performance and are more robust to an imbalanced class distribution, which makes our approach very promising for the medical imaging community.

PDF Code Project

Christoph Baur, Benedikt Wiestler, Shadi Albarqouni, Nassir Navab

January 2018 International MICCAI Brainlesion Workshop

Deep autoencoding models for unsupervised anomaly segmentation in brain MR images

Reliably modeling normality and differentiating abnormal appearances from normal cases is a very appealing approach for detecting pathologies in medical images. A plethora of such unsupervised anomaly detection approaches has been made in the medical domain, based on statistical methods, content-based retrieval, clustering and recently also deep learning. Previous approaches towards deep unsupervised anomaly detection model patches of normal anatomy with variants of Autoencoders or GANs, and detect anomalies either as outliers in the learned feature space or from large reconstruction errors. In contrast to these patch-based approaches, we show that deep spatial autoencoding models can be efficiently used to capture normal anatomical variability of entire 2D brain MR images. A variety of experiments on real MR data containing MS lesions corroborates our hypothesis that we can detect and even delineate anomalies in brain MR images by simply comparing input images to their reconstruction. Results show that constraints on the latent space and adversarial training can further improve the segmentation performance over standard deep representation learning.

PDF Project

Christoph Baur, Shadi Albarqouni, Nassir Navab

January 2018 OR 2.0 Context-Aware Operating Theaters, Computer Assisted Robotic Endoscopy, Clinical Image-Based Procedures, and Skin Image Analysis

Generating highly realistic images of skin lesions with GANs

As many other machine learning driven medical image analysis tasks, skin image analysis suffers from a chronic lack of labeled data and skewed class distributions, which poses problems for the training of robust and well-generalizing models. The ability to synthesize realistic looking images of skin lesions could act as a reliever for the aforementioned problems. Generative Adversarial Networks (GANs) have been successfully used to synthesize realistically looking medical images, however limited to low resolution, whereas machine learning models for challenging tasks such as skin lesion segmentation or classification benefit from much higher resolution data. In this work, we successfully synthesize realistically looking images of skin lesions with GANs at such high resolution. Therefore, we utilize the concept of progressive growing, which we both quantitatively and qualitatively compare to other GAN architectures such as the DCGAN and the LAPGAN. Our results show that with the help of progressive growing, we can synthesize highly realistic dermoscopic images of skin lesions that even expert dermatologists find hard to distinguish from real ones.

PDF Project

Katharina Breininger, Shadi Albarqouni, Tanja Kurzendorfer, Marcus Pfister, Markus Kowarschik, Andreas Maier

January 2018 International journal of computer assisted radiology and surgery

Intraoperative stent segmentation in X-ray fluoroscopy for endovascular aortic repair

PDF Project

Katharina Breininger, Tobias Würfl, Tanja Kurzendorfer, Shadi Albarqouni, Marcus Pfister, Markus Kowarschik, Nassir Navab, Andreas Maier

January 2018 Intravascular Imaging and Computer Assisted Stenting and Large-Scale Annotation of Biomedical Data and Expert Label Synthesis

Multiple device segmentation for fluoroscopic imaging using multi-task learning

PDF Project

Mai Bui, Shadi Albarqouni, Slobodan Ilic, Nassir Navab

January 2018 arXiv preprint arXiv:1805.08443

Scene coordinate and correspondence learning for image-based localization

Scene coordinate regression has become an essential part of current camera re-localization methods. Different versions, such as regression forests and deep learning methods, have been successfully applied to estimate the corresponding camera pose given a single input image. In this work, we propose to regress the scene coordinates pixel-wise for a given RGB image by using deep learning. Compared to the recent methods, which usually employ RANSAC to obtain a robust pose estimate from the established point correspondences, we propose to regress confidences of these correspondences, which allows us to immediately discard erroneous predictions and improve the initial pose estimates. Finally, the resulting confidences can be used to score initial pose hypothesis and aid in pose refinement, offering a generalized solution to solve this task.

PDF Project Project

Amelia Jiménez-Sánchez, Anees Kazi, Shadi Albarqouni, Sonja Kirchhoff, Alexandra Sträter, Peter Biberthaler, Diana Mateus, Nassir Navab

January 2018 arXiv preprint arXiv:1809.10692

Weakly-supervised localization and classification of proximal femur fractures

Project

Mai Bui, Sergey Zakharov, Shadi Albarqouni, Slobodan Ilic, Nassir Navab

January 2018 2018 IEEE International Conference on Robotics and Automation (ICRA)

When regression meets manifold learning for object recognition and pose estimation

In this work, we propose a method for object recognition and pose estimation from depth images using convolutional neural networks. Previous methods addressing this problem rely on manifold learning to learn low dimensional viewpoint descriptors and employ them in a nearest neighbor search on an estimated descriptor space. In comparison we create an efficient multi-task learning framework combining manifold descriptor learning and pose regression. By combining the strengths of manifold learning using triplet loss and pose regression, we could either estimate the pose directly reducing the complexity compared to NN search, or use learned descriptor for the NN descriptor matching. By in depth experimental evaluation of the novel loss function we observed that the view descriptors learned by the network are much more discriminative resulting in almost 30% increase regarding relative pose accuracy compared to related works. On the other hand, regarding directly regressed poses we obtained important improvement compared to simple pose regression. By leveraging the advantages of both manifold learning and regression tasks, we are able to improve the current state-of-the-art for object recognition and pose retrieval that we demonstrate through in depth experimental evaluation.

PDF Project Project

Babak Ehteshami Bejnordi, Mitko Veta, Paul Johannes Van Diest, Bram Van Ginneken, Nico Karssemeijer, Geert Litjens, Jeroen AWM Van Der Laak, Meyke Hermsen, Quirine F Manson, Maschenka Balkenhol, others

January 2017 Jama

Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer

PDF Project

Christoph Baur, Shadi Albarqouni, Nassir Navab

January 2017 International Conference on Medical Image Computing and Computer-Assisted Intervention

Semi-supervised deep learning for fully convolutional networks

Deep learning usually requires large amounts of labeled training data, but annotating data is costly and tedious. The framework of semi-supervised learning provides the means to use both labeled data and arbitrary amounts of unlabeled data for training. Recently, semi-supervised deep learning has been intensively studied for standard CNN architectures. However, Fully Convolutional Networks (FCNs) set the state-of-the-art for many image segmentation tasks. To the best of our knowledge, there is no existing semi-supervised learning method for such FCNs yet. We lift the concept of auxiliary manifold embedding for semi-supervised learning to FCNs with the help of Random Feature Embedding. In our experiments on the challenging task of MS Lesion Segmentation, we leverage the proposed framework for the purpose of domain adaptation and report substantial improvements over the baseline model.

PDF Code Project Project

Shadi Albarqouni, Javad Fotouhi, Nassir Navab

January 2017 International Conference on Medical Image Computing and Computer-Assisted Intervention

X-ray in-depth decomposition: Revealing the latent structures

X-ray is the most readily available imaging modality and has a broad range of applications that spans from diagnosis to intra-operative guidance in cardiac, orthopedics, and trauma procedures. Proper interpretation of the hidden and obscured anatomy in X-ray images remains a challenge and often requires high radiation dose and imaging from several perspectives. In this work, we aim at decomposing the conventional X-ray image into d X-ray components of independent, non-overlapped, clipped sub-volume, that separate rigid structures into distinct layers, leaving all deformable organs in one layer, such that the sum resembles the original input. Our proposed model is validaed on 6 clinical datasets (∼7200 X-ray images) in addition to 615 real chest X-ray images. Despite the challenging aspects of modeling such a highly ill-posed problem, exciting and encouraging results are obtained paving the path for further contributions in this direction.

Project

Mai Bui, Shadi Albarqouni, Michael Schrapp, Nassir Navab, Slobodan Ilic

January 2017 2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

X-Ray PoseNet: 6 DoF pose estimation for mobile X-Ray devices

Precise reconstruction of 3D volumes from X-ray projections requires precisely pre-calibrated systems where accurate knowledge of the systems geometric parameters is known ahead. However, when dealing with mobile X-ray devices such calibration parameters are unknown. Joint estimation of the systems calibration parameters and 3d reconstruction is a heavily unconstrained problem, especially when the projections are arbitrary. In industrial applications, that we target here, nominal CAD models of the object to be reconstructed are usually available. We rely on this prior information and employ Deep Learning to learn the mapping between simulated X-ray projections and its pose. Moreover, we introduce the reconstruction loss in addition to the pose loss to further improve the reconstruction quality. Finally, we demonstrate the generalization capabilities of our method in case where poses can be learned on instances of the objects belonging to the same class, allowing pose estimation of unseen objects from the same category, thus eliminating the need for the actual CAD model. We performed exhaustive evaluation demonstrating the quality of our results on both synthetic and real data.

PDF Project

Shadi Albarqouni, Christoph Baur, Felix Achilles, Vasileios Belagiannis, Stefanie Demirci, Nassir Navab

January 2016 IEEE transactions on medical imaging

Aggnet: deep learning from crowds for mitosis detection in breast cancer histology images

PDF Project Project

Christoph Baur, Shadi Albarqouni, Stefanie Demirci, Nassir Navab, Pascal Fallavollita

January 2016 International Conference on Medical Imaging and Augmented Reality

Cathnets: detection and single-view depth prediction of catheter electrodes

PDF Project Video

Shadi Albarqouni, Ulrich Konrad, Lichao Wang, Nassir Navab, Stefanie Demirci

January 2016 International journal of computer assisted radiology and surgery

Single-view X-ray depth recovery: toward a novel concept for image-guided interventions

PDF Project

S Albarqouni, M Baust, S Conjeti, A Al-Amoudi, N Navab

January 2015 26th British Machine Vision Conference (BMVC), Swansea, UK

Multi-scale Graph-based Guided Filter for De-noising Cryo-Electron Tomographic Data

PDF Project Project

Talks

Participated at the Helmholtz-ELLIS Workshop on Foundation Models in Science

Elodie, David, and Shadi participated at the Helmholtz ELLIS Workshop on Foundation Models which took place in Berlin over two days, bringing together leading researchers and practitioners from academia and industry. The primary focus was to explore the transformative impact of large-scale AI models ( extit{Foundation Models}) across various scientific domains, including medical imaging, astrophysics, tabular data analysis, and beyond.

Mar 17, 2025 — Mar 18, 2025 Berlin, Germany

David D. Gaviria, Elodie Germani, Shadi Albarqouni

Project Follow

Invited Talk at BioMedIA, MBZUAI, UAE

I had the pleasure to give an invited talk at the BioMedIA Lab in MBZUAI (Mohamed bin Zayed University of Artificial Intelligence)! I had the pleasure to meet Dr Mohammad Yaqub, his group members and a few familiar faces in our community. Thanks, Mohammad!

Jan 22, 2025 — Abu Dhabi, UAE

Shadi Albarqouni

Project Project Follow

Invited Keynote at the First Arab Science Symposium in Hannover, Germany

I am deeply grateful to the organizers of this symposium for bringing together Syrian and Arab scholars in such a meaningful way. It is a privilege to have the opportunity to present on the topic of collective intelligence, highlighting the importance of interdisciplinary research in shaping a better future. Events like these foster collaboration and inspire innovative solutions that transcend boundaries, and I am honored to be a part of it.

Sep 20, 2024 — Hannover, Germany

Project Project Follow

Participate at the Helmholtz Annual Meeting 2024 in Berlin

I’m truly honored and excited to be invited to participate in the Helmholtz Annual Meeting 2024. I was particularly inspired by the insightful talk on synaptic transmission by Nobel laureate and Stanford Professor Thomas C. Südhof. His groundbreaking work continues to shape our understanding of neuroscience and opens new doors for innovation in the field. Additionally, the panel discussion led by BMBF Minister Ms. Bettina Stark-Watzinger was highly thought-provoking, especially her emphasis on fostering research and development in Germany. These sessions highlight the critical role of science and policy in shaping our future.

Sep 17, 2024 — Berlin, Germany

Project Project Follow

Participate at the DAAD PRIME Alumni Seminar

I was fortunate enough to attend and participate at the DAAD PRIME Alumni Seminar where I had the chance to meet and re-connect with PRIME fellows and a couple of colleagues! We exchanged career paths and trajectories and heard several success stories in both academia and industries.

Sep 13, 2024 — Sep 15, 2024 Bonn, Germany

Project Project Follow

Invited Talk at North American Imaging in MS Cooperative, USA

It is a pleasure to accept the invitation to deliver a talk at the The Artificial Intelligence and Machine-Learning (AIMS) working group of the North American Imaging in MS (NAIMS) Cooperative. I’ll present our Nature Machine Intelligence Paper on anomaly detection in MR Brain Imaging which includes MS Lesions.

Sep 11, 2024 — Buffalo, USA

Project Project Follow

Invited Talk at the Summer School for Surgical Data Science

Very much enjoyed the time in Strasbourg! Great keynotes and lectures at the summer school for surgical data science. I was fortunate enough to meet a couple of brilliant minds at the school! It was also a pleasure meeting my mentor Prof. Nassir Navab! Thanks for the invitation, organziation and the warm hospitality, Prof. Nicolas Padoy 🙏🙏

Jul 17, 2024 — Jul 18, 2024 Strasbourg, France

Project Project Follow

Participate at the International Conference on Medical Imaging with Deep Learning (MIDL)

Happy to see so many brilliant works at the hashtag#MIDL conference in Paris! It’s a fantastic opportunity to connect with colleagues and graduates passionate about medical imaging and deep learning.

Jul 3, 2024 — Jul 5, 2024 Paris, France

Project Follow

Invited Keynote Lecture on AI in Ophthalmology and Pathology

I was invited to give a Keynote Lecture on AI in Ophthalmology and Pathology to present a few works on Histopathology; Mitotoc figure detection in Breast Cancer, Domain Adaptation from H&E Imaging to IHC imaging among others. Thanks Prof. Martina C. Herwig-Carl for the invitation and warm hospitality!

Jun 15, 2024 — Bonn, Germany

Shadi Albarqouni, Mertina Herwig-Carl

Project Project Follow

Participate at the Helmholtz AI Conference, Germany

We have just participated at the Helmholtz AI conference and presented one of our Helmholtz AI projects!

Jun 12, 2024 — Jun 14, 2024 CCD CONGRESS CENTER, DÜSSELDORF, Germany

Project Follow

Attending the general assembly of the Arab German Young Academy

I had the pleasure to meet such brilliant scientists from all over the Arab world and Germany in the general assembly of the Arab-German Young Academy of Sciences and Humanities in Berlin!

Apr 22, 2024 — Apr 25, 2024 Berlin, Germany

Project Follow

Research Visit at the University of Sharjah, UAE

I have just completed the research visit at the University of Sharjah, NYU Abu Dhabi, and other esteemed universities. This initiative aims to strengthen connections and foster collaboration between German and Emirati academic institutes in the field of health and medical sciences. I will soon return to the Emirates to organize a one-day workshop on AI and healthcare at a prestigious academic institute. Stay tuned for more details!

Apr 14, 2024 — May 17, 2024 Sharjah, UAE

Project Project Follow

Invited Talk at the Okinawa Institute of Science and Technology Graduate University, Japan

Feb 9, 2024 — Okinawa, Japan

Project Project Follow

Invited to a Panel Discussion on AI in clinical routine at LMU, Munich, Germany

I was excited to join a distinguished panel, including Prof. Dr. Wolfgang Böcker, Prof. Dr. Clemens Cyran, and Katharina Danhauser, moderated by Claus Belka, to explore AI’s readiness for clinical routine. Our discussion spotlighted crucial aspects: the current state of AI in clinics and strategic placement within workflows. Emphasis was placed on robust data governance and addressing development challenges. The dialogue highlighted the importance of collaboration with stakeholders, involving regulatory bodies, and prioritizing education and training. Grateful for the opportunity to contribute to this pivotal conversation shaping the future of AI in healthcare. Thanks for the invitation, Guillaume Landry!

Jan 19, 2024 — Munich, Germany

Project Project Follow

Organize the Autumn School on AI (EEDA - ايدا)

Thrilled to share the success of our EEDA Autumn School on AI – a global collaboration between Albarqouni Lab (Germany), Universitätsklinikum Bonn, The University of Bonn, and the Faculty of Health Sciences at Beirut Arab University (Lebanon). Over the weekend, a distinguished Lebanese delegation from the Beirut Arab University ( BAU), American University of Beirut ( AUB), and Lebanese American University ( LAU) joined us for an immersive program blending cutting-edge AI with diverse disciplines.

Dec 14, 2023 — Dec 18, 2023 Bonn, Germany

Project Project Follow

Invited Talk at the Sino-German Symposium on AI in Medicine

Very much enjoyed the time in Shanghai! Great keynotes and lectures at the Sino-German Symposium on AI in Medicine. I was fortunate enough to meet a couple of brilliant minds at the school! It was also a pleasure meeting my mentor Prof. Nassir Navab! Thanks for the invitation, organziation and the warm hospitality, Prof. Kuangyu Shi 🙏🙏

Nov 11, 2023 — Nov 16, 2023 Shanghai, China

Project Project Follow

Participate at the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI)

I stand here, my heart heavy with emotion, as I find myself struggling to focus on the incredible work and learning opportunities at the MICCAI conference in Vancouver, Canada. I hope for the day when we can attend conferences and go about our academic pursuits without the weight of such heart-wrenching events on our minds. Until then, my thoughts and prayers are with my family in Gaza and with all those who yearn for a more peaceful world.

Oct 8, 2023 — Oct 12, 2023 Vancouver, Canada

Project Follow

Participate at the International Conference on Computer Vision

I very much enjoyed the two days at the International Conference on Computer Vision (ICCV) in the lovely Paris 🇫🇷. I met a couple of students, colleagues and friends who are coming from Stanford 🇺🇸, Imperial 🇬🇧, Google and ETH Zurich 🇨🇭, and TU Munich, UniBonn, and University of Potsdam 🇩🇪! We discussed the possibilities of collaboration and exchange that might happen in the future!

Oct 2, 2023 — Oct 6, 2023 Paris, France

Shadi Albarqouni, David D. Gaviria, Razieh Rezaei

Project Follow

Invited Talk at German Jordanian University (GIU), Amman, Jordan

IEEE EMBS Jordan Chapter is pleased to invite faculty, students, fresh graduates, and professionals to attend the (virtual) EMBS Jordan Chapter Distinguished Lecture on Wednesday, July 26th, 2023, 6:00-7:30 PM (Jordan Local Time). This time we have a special guest PROF. Dr. Shadi Albarqouni, Professor of Computational Medical Imaging at the University of Bonn in Bonn, Germany. For registration, use the form below or the QR code. The seminar will take place on the ZOOM platform. Details for how to join the virtual meeting will be sent later for those who confirm their registration.

Jul 27, 2023 — Jul 26, 2023 Amman, Jordan

Project Project Follow

Invited Talk at DomGen2023 workshop, Warwick, UK

I very much enjoyed the invited talk at the University of Warwick, United Kingdom! I met students from the UK 🇬🇧 , Korea 🇰🇷 , Iran 🇮🇷 , Pakistan 🇵🇰 , Saudi 🇸🇦 , and Egypt 🇪🇬 ! Incredibly amazing how international that place is! The talk was part of the DomGen workshop. Thank you so much for the invitation and the warm welcome and hospitality, Mostafa Jahanifar and Nasir Rajpoot!

Jul 11, 2023 — Warwick, UK

Project Project Follow

Fact Finding Mission to the United Arab Emirates

Just wrapped up an incredible 72-hour agya fact-finding mission with the brilliant Vice-Presidents and Provosts of top universities in Dubai and Abu Dhabi, UAE. The main goal of our fact-finding mission is to promote the call for membership of the Arab-German Young Academy of Sciences and Humanities (AGYA)! Dr. Nada El Darra and I had the pleasure to meet Vice-President Sultan Al Haji, the Acting Provost Prof. Tim Baldwin at MBZUAI (Mohamed bin Zayed University of Artificial Intelligence), and the Acting Provost and Chief Academic Officer Prof. Michael Allen at Zayed University besides our brilliant colleagues Dr. Rania Dghaim, Dr. Fatme Al Anouti, and Dr. Omar Alfandi. Thanks to our alumna, Dr. Henda Mahmoudi, and her colleagues Nour El Jundi and Evans Kioko, we very much enjoyed the wonderful field trip at International Center for Biosaline Agriculture (ICBA) where we discussed potential interdisciplinary research collaboration on AI and Food Safety!

Feb 19, 2023 — Feb 23, 2023 Dubai, UAE

Project Follow

Invited Talk at the Workshop on Collaborative Learning: From Theory to Practice

I had the pleasure to give an invited talk at the #Collaborative Learning workshop at MBZUAI (Mohamed bin Zayed University of Artificial Intelligence)! It was a wonderful weekend full of amazing talks and fruitful discussions! I had the pleasure to meet a few familiar faces in our community along with other great speakers from UC Berkeley, Harvard, MIT, KAUST, ETH Zurich, Nvidia, and EPFL, among others. I would like to thank Michael I. Jordan and the organizing team behind the workshop for the invitation and the excellent hospitality! For those who are interested in the talks, they will be made publicly available soon!

Oct 8, 2022 — Oct 9, 2022 Abu Dhabi, UAE

Project Project Follow

Invited Talk at AI 4 Imaging

I will deliver a talk in Federated Deep Learning in Healthcare

Jun 29, 2022 — Jul 2, 2022 Masstricht, Netherlands

Project Follow

Co-Organizing the 10. DFG-#Nachwuchsakademie

We had a wonderful week (10. DFG-#Nachwuchsakademie) with many insightful talks and fruitful discussions at the Schloss Birlinghoven! 20 participants with medical and technical backgrounds, from all over Germany, came together to learn about #AI in #Medicine!

Jan 23, 2022 — May 23, 2022 Schloss Birlinghoven, Bonn, Germany

Project Follow

Learn to Recognize

Collaboration:

Funding:

Professor of Computational Medical Imaging Research at University of Bonn | AI Young Investigator Group Leader at Helmholtz AI | Affiliate Scientist at Technical University of Munich

Related

Publications

Talks