TLDR: This research systematically evaluates deep learning models and explainable AI (xAI) techniques for automated region-of-interest (ROI) detection in knee MRI scans, focusing on meniscus injuries. It compares supervised models like ResNet50, InceptionV3, and Vision Transformers, alongside self-supervised U-Net variants. The study found that ResNet50 consistently excelled in classification and ROI identification, outperforming transformer-based models on the MRNet dataset. Grad-CAM was identified as the most effective xAI method for providing clinically meaningful explanations. The findings suggest that CNN-based transfer learning is currently the most effective approach for knee MRI classification given existing data constraints, while highlighting the potential for future advancements with larger datasets and transformer models.
Magnetic Resonance Imaging (MRI) is a vital tool for diagnosing knee injuries, but the traditional method of manually reviewing MRI scans is both time-consuming and can vary significantly between different medical professionals. This manual process can lead to inefficiencies, especially when identifying specific areas of interest (ROIs) like meniscus injuries, which are crucial for accurate diagnosis.
To address these challenges, a recent study systematically evaluated various deep learning models combined with explainable AI (xAI) techniques. The goal was to automate the detection of ROIs in knee MRI scans, specifically focusing on meniscus injuries, and to ensure that the AI’s decisions are transparent and clinically meaningful. The researchers aimed to find the best combination of AI models and xAI methods that could reliably assist in diagnostic decision-making in a clinical setting.
Exploring Different AI Approaches
The study investigated several deep learning architectures, including well-known models like ResNet50, InceptionV3, and Vision Transformers (ViT), as well as different versions of U-Net models enhanced with multi-layer perceptron (MLP) classifiers. These models were tested using both supervised learning (where the AI learns from labeled data) and self-supervised approaches (where the AI learns patterns without explicit labels).
To make the AI’s decisions understandable, the researchers integrated xAI methods such as Grad-CAM and Saliency Maps. These techniques generate visual heatmaps that highlight the specific regions in the MRI scan that the AI model focused on when making a prediction. This helps clinicians understand why a model made a particular diagnosis.
The study utilized the MRNet dataset from Stanford University Medical Center, which contains knee MRI data from over a thousand patients. This dataset includes scans with varying numbers of slices and provides labels for abnormalities, ACL tears, and meniscal tears, making it suitable for training and evaluating diagnostic AI systems.
Key Findings and Model Performance
The results showed that among the supervised classification models, ResNet50 consistently performed the best for diagnosing meniscus tears and identifying ROIs. It achieved an AUC (Area Under the Receiver Operating Characteristic Curve) of 0.8184 and an accuracy of 0.74. InceptionV3 and Vision Transformers, despite their advanced capabilities, did not perform as well, likely because they require much larger datasets or more specific pre-training to reach their full potential, which was not available in this study’s context.
For self-supervised models, a custom U-Net trained for image reconstruction showed excellent fidelity, indicating it learned robust structural features from the MRI scans. When this U-Net was combined with an MLP classifier for diagnosis, it demonstrated that features learned during reconstruction could be repurposed for diagnostic predictions, though its classification accuracy was lower than ResNet50’s.
Regarding explainable AI, Grad-CAM proved to be the most effective method for generating clear and clinically meaningful explanations. It consistently highlighted relevant regions in the MRI scans, helping to validate the model’s decisions.
Also Read:
- Advanced AI for Precise Pathological Liver Segmentation
- Advancing Breast Cancer Imaging with AI-Synthesized MRI
Implications for Future Diagnostics
The study concluded that for knee MRI classification with current data limitations, CNN-based transfer learning, particularly with ResNet50, is the most practical and effective approach. While transformer-based models and hybrid U-Net approaches show promise, they currently face challenges with smaller medical datasets. This research provides valuable insights for AI developers and clinicians, suggesting that AI-assisted tools can significantly reduce diagnostic workload and improve accuracy by transparently highlighting diagnostically relevant regions in MRI scans. For more details, you can refer to the full research paper: A Systematic Study of Deep Learning Models and xAI Methods for Region-of-Interest Detection in MRI Scans.
Future work will involve more extensive experiments, especially with larger-scale pre-trained Vision Transformers, to explore if these models can eventually surpass CNN-based methods in this specialized medical imaging domain.


