spot_img
HomeResearch & DevelopmentAdvancing Thyroid Cancer Recurrence Prediction with Bayesian Neural Networks...

Advancing Thyroid Cancer Recurrence Prediction with Bayesian Neural Networks and Explainable AI

TLDR: A new study introduces a comprehensive framework for classifying differentiated thyroid cancer (DTC) recurrence, integrating machine learning, Bayesian Neural Networks (BNN), and SHAP analysis. The BNN model with a Normal (0,10) prior, applied after feature selection, achieved the highest accuracy of 0.9870. This model not only provides highly accurate predictions but also quantifies uncertainty and offers clear interpretations of feature contributions, with ‘response to initial treatment’ identified as the most influential factor.

Thyroid cancer, particularly differentiated thyroid cancer (DTC), is a significant health concern, with recurrence being a major challenge for patients and clinicians. While survival rates for DTC are generally high, recurrence can occur in 10-30% of patients, necessitating ongoing surveillance and treatment. This highlights the critical need for accurate, interpretable, and uncertainty-aware models to predict and classify DTC recurrence.

A recent study introduces a comprehensive framework designed to improve the classification of DTC recurrence. The research utilized a dataset of 383 patients with 16 clinical and pathological variables, aiming to develop models that are not only precise but also provide insights into their predictions and quantify their confidence levels.

Initial Machine Learning Approaches

The study began by employing 11 different machine learning (ML) models on the complete dataset. Among these, the Support Vector Machines (SVM) model showed the highest accuracy, reaching 0.9481. To simplify the models and remove redundant information, a feature selection process was then carried out using the Boruta algorithm. This algorithm helped identify the most relevant variables for predicting recurrence. When the same ML models were applied to this reduced dataset, the Logistic Regression (LR) model emerged as the top performer, achieving an accuracy of 0.9611.

Addressing Uncertainty with Bayesian Neural Networks

Despite the high accuracies achieved by traditional ML models, a key limitation in clinical decision-making is their lack of uncertainty quantification. These models typically provide a single prediction without indicating how confident they are in that prediction. To overcome this, the researchers implemented Bayesian Neural Networks (BNN). BNNs are a type of neural network that incorporates Bayesian inference, allowing them to quantify uncertainties by learning distributions over model parameters rather than fixed values.

The study explored six different prior distributions for the BNN model, including Normal (0,1), Normal (0,10), Laplace (0,1), Cauchy (0,1), Cauchy (0,2.5), and Horseshoe (1). These varying priors helped assess how different assumptions about the model’s parameters affect its performance and uncertainty estimations. The BNN model with a Normal (0,10) prior distribution demonstrated superior performance, achieving accuracies of 0.9740 before feature selection and an impressive 0.9870 after feature selection. This indicates that the BNN, especially with a less restrictive prior, significantly enhances predictive performance and provides robust uncertainty quantification.

Interpreting Model Predictions with SHAP

Given that the BNN model with Normal (0,10) prior on the feature-selected dataset was identified as the best-performing model, the researchers further analyzed it using SHapley Additive exPlanations (SHAP) values. SHAP is a powerful tool in explainable artificial intelligence (XAI) that helps interpret how each feature contributes to a model’s predictions, thereby reducing the ‘black-box’ nature of complex models.

The SHAP analysis revealed that the ‘response to initial treatment’ was the most influential factor in determining DTC recurrence. Other significant contributors included ‘risk type’ and ‘cancer stage’. The analysis showed that patients with a poor response to initial treatments, higher risk types, and advanced cancer stages had an increased likelihood of recurrence. Interestingly, younger age was associated with a higher risk of recurrence, while older patients showed a lower or neutral effect. The impact of tumor stage and adenopathy was also observed, though their effects were more nuanced and sometimes associated with other variables.

Also Read:

Implications and Future Directions

This study provides a comprehensive framework for classifying differentiated thyroid cancer recurrence by integrating feature selection, machine learning models, uncertainty-aware Bayesian modeling, and SHAP analysis. The findings suggest that BNNs offer a more flexible and robust approach for clinical applications, providing not only accurate predictions but also crucial insights into the model’s confidence and the factors driving its decisions. This can lead to more informed and confident data-driven decision-making in managing DTC recurrence.

While promising, the study acknowledges limitations such as a relatively small sample size and data from a single clinical center, which may affect generalizability. Future research could expand on this by incorporating larger, more diverse datasets, conducting external validation, exploring deeper BNN architectures, and integrating domain-informed priors to further enhance the model’s robustness and applicability. For more detailed information, you can refer to the full research paper here.

Karthik Mehta
Karthik Mehtahttps://blogs.edgentiq.com
Karthik Mehta is a data journalist known for his data-rich, insightful coverage of AI news and developments. Armed with a degree in Data Science from IIT Bombay and years of newsroom experience, Karthik merges storytelling with metrics to surface deeper narratives in AI-related events. His writing cuts through hype, revealing the real-world impact of Generative AI on industries, policy, and society. You can reach him out at: [email protected]

- Advertisement -

spot_img

Gen AI News and Updates

spot_img

- Advertisement -