TLDR: A new method combines Global Class Activation Probabilistic Mapping (GCAPM) and SafeML to make AI skin lesion classification safer and more trustworthy. GCAPM provides detailed, multi-class explanations of AI decisions, while SafeML monitors for data shifts and flags uncertain diagnoses for human review, significantly reducing misdiagnosis risk and improving reliability in medical settings.
Artificial intelligence (AI) models are becoming increasingly accurate in classifying skin lesions, with some even outperforming dermatologists. However, a significant hurdle to their widespread adoption in medical practice is a lack of trust. Beyond just high accuracy, medical professionals require trustworthy and understandable diagnoses. Existing methods for explaining AI decisions, such as LIME and CAM, often have reliability issues, with LIME being inconsistent and CAM-based methods typically overlooking insights from all possible classes.
To address these critical limitations, researchers have introduced a novel approach combining Global Class Activation Probabilistic Mapping (GCAPM) with SafeML. This innovative framework aims to enhance both the explainability and safety of skin lesion classification models, ultimately improving patient safety by reducing the risk of misdiagnosis.
The core of the new method, GCAPM, works by analyzing the activation probability maps of all potential classes at a pixel level. Unlike traditional methods that only highlight regions relevant to the predicted class, GCAPM provides a unified visualization of the diagnostic process. This means it can show if the model is also paying attention to features associated with other classes, even if it makes an incorrect prediction. This comprehensive view helps clinicians understand the model’s reasoning more deeply and identify potential areas of uncertainty or misdirection.
Complementing GCAPM is SafeML, a technique designed to monitor machine learning model performance during deployment. SafeML enhances the detection of false diagnoses and issues warnings to doctors and patients when necessary. It does this by monitoring for “data drift”—situations where the operational data the model encounters significantly differs from its training data. By establishing thresholds based on GCAPM metrics during development, SafeML can flag abnormal predictions at runtime, even without immediate access to ground truth labels, thereby improving diagnostic reliability and patient safety.
The methodology involves a two-stage process: an offline stage where the relationship between segmentation outputs and explainability metrics is analyzed, and a runtime stage that uses this knowledge to detect anomalies and ensure reliable predictions. For evaluating the quality of explanations, the researchers introduced specific metrics: “Attribute Sensitivity” (measuring how well the model focused on lesion areas) and “Attribute False Positive Rate” (measuring how much the model mistakenly focused on non-lesion areas). These metrics provide a more nuanced understanding of the model’s attention than standard segmentation metrics.
Furthermore, a “selective prediction” framework was developed using a meta-classifier. This meta-classifier takes the model’s prediction probabilities, along with the Attribute Sensitivity and Attribute False Positive Rate, to determine if the original diagnosis is accurate. If the meta-classifier flags a prediction as uncertain or potentially incorrect, it triggers human intervention, preventing the system from issuing unchecked results to users.
The effectiveness of this combined approach was validated using widely adopted ISIC datasets (2017 and 2019) and modern deep learning architectures like MobileNetV2 and Vision Transformers. To simulate real-world data shifts, the researchers also tested the models with artificially blurred images at various intensity levels.
The experimental results were promising. GCAPM was shown to effectively highlight instances where the model attended to different classes within a lesion, patterns often missed by conventional methods. This deeper insight into the model’s focus mechanisms significantly enhanced explainability and reduced diagnostic risk. A strong correlation was observed between the attribute metrics (Att Sensitivity and Att FPR) and prediction accuracy, suggesting these metrics are reliable indicators of a model’s trustworthiness.
Crucially, predictions falling within the confidence interval derived from these attribute metrics consistently showed higher diagnostic accuracy compared to predictions based solely on probability scores. Even under challenging conditions with simulated data contamination (up to 50% blurring), the selective predictor accurately identified correct predictions approximately 90% of the time and successfully flagged inaccurate predictions with over 75% accuracy in most cases. This demonstrates the framework’s robust ability to proactively detect and avoid high-risk misdiagnoses.
Also Read:
- CLARIFY: A New AI Framework for Accurate and Efficient Skin Condition Diagnosis
- Bridging the Data Gap: eSkinHealth Dataset for Neglected Tropical Skin Diseases in West Africa
In conclusion, this research presents a significant step towards making AI in medical diagnosis safer and more trustworthy. By providing more reliable explanations through GCAPM and actively monitoring for abnormal predictions with SafeML, the system empowers clinicians and patients with greater confidence in AI-assisted diagnoses. Future work will explore the framework’s fairness across diverse skin tones and its applicability to other AI domains, including Generative AI models. For more details, you can read the full research paper here.


