TLDR: This research introduces a Federated Learning framework for segmenting ischemic stroke lesions from MRI scans across multiple healthcare institutions. By collaboratively training a model without sharing raw patient data, the FedAvg aggregation rule achieved superior and more generalized performance compared to centralized and other federated methods. This approach effectively overcomes challenges posed by diverse patient demographics, scanner types, and expert annotations, providing robust segmentation capabilities even for institutions with limited data and training resources, all while preserving patient privacy.
Stroke is a critical global health issue, ranking as the second leading cause of death and the third leading cause of disability worldwide. Ischemic stroke, specifically, accounts for over 80% of all stroke cases, occurring when blood vessels in the brain become blocked. Accurate and timely identification and measurement of infarct volume—the area of dead tissue—are crucial for effective treatment and prognosis. Diffusion Resonance Imaging (DWI) and Apparent Diffusion Coefficient (ADC) maps are standard tools for this, allowing radiologists to distinguish between affected and healthy brain tissue.
However, analyzing stroke lesions is complex due to significant variations among patients, different scanner technologies used in hospitals, and even inconsistencies in how expert radiologists annotate lesions. Traditional computational methods designed to help localize and segment these lesions often learn patterns from data collected at a single institution. This limits their ability to generalize and accurately identify lesions with diverse shapes and characteristics across different hospitals. Furthermore, many clinical centers lack sufficient labeled data to train or adapt these specialized solutions effectively.
This research introduces a groundbreaking collaborative framework that leverages Federated Learning (FL) to segment ischemic stroke lesions in DWI sequences. Federated Learning is a privacy-preserving approach that allows multiple institutions to collaboratively train a shared model without directly sharing their sensitive patient data. Instead, only model updates or “knowledge” are shared, protecting patient privacy while enabling a more robust and generalized model.
Addressing Multi-Institution Challenges with Federated Learning
The study involved an extensive dataset of 2031 DWI and ADC neuroimages, emulating 14 distinct healthcare centers. These centers were categorized into “large centers” with sufficient data and computational capabilities for model training, and “limited centers” which served as out-of-distribution institutions to test the model’s generalization without additional training. This setup realistically mimics the diverse environments found in real-world healthcare.
The core idea is to overcome the “domain-shift” problem, where models trained on data from one hospital perform poorly when applied to data from another due to differences in equipment, patient demographics, and imaging protocols. By sharing knowledge from deep center-independent representations, the federated model learns from the collective experience of all participating institutions.
The researchers compared five different federated aggregation rules, including FedAvg, VanillaAvg, Beta Weighting, Softmax, and FedProx, against a centralized baseline model (trained on all combined data). The FedAvg model emerged as the top performer, achieving a Dice Score (DSC) of 0.71 ±0.24, an Absolute Volume Difference (AVD) of 5.29 ±22.74, an Absolute Lesion Difference (ALD) of 2.16 ±3.60, and a Lesion F1 Score (LF1) of 0.70 ±0.26 across all centers. Notably, FedAvg outperformed both the centralized approach and other federated rules.
Strong Generalization and Consistent Performance
A key finding was the FedAvg model’s remarkable generalization properties. It demonstrated uniform performance across different lesion categories (small, medium, and large) and maintained reliable performance in the out-of-distribution “limited centers.” In these limited centers, the FedAvg model achieved a DSC of 0.64 ±0.29 and an AVD of 4.44 ±8.74 without any additional training. This is particularly significant because it means smaller hospitals or those with limited data can benefit from a highly accurate segmentation model without needing to train it themselves, thus lowering barriers to advanced computational support.
The study highlights that federated learning can effectively address the variability inherent in ischemic stroke imaging, encompassing diverse patient demographics, scanner vendors, and lesion morphologies. The FedAvg approach not only achieved faster convergence during training but also consistently outperformed the centralized baseline in many metrics, especially for large centers and across different lesion sizes.
For instance, in large centers, FedAvg showed improved detection for small lesions and better delineation for large lesions. In limited centers, it surpassed the centralized baseline for small and medium lesion volumes. This consistent performance across various scenarios underscores the robustness and practical utility of the federated approach.
Also Read:
- Efficient Data Erasure: Machine Unlearning’s Role in Medical AI Privacy
- Adaptive Pruning for Federated Person Re-Identification
Future Implications
This research confirms Federated Learning as a promising and practical solution for ischemic stroke segmentation in real-world clinical settings. It offers a way to develop powerful, generalizable segmentation models while strictly adhering to patient privacy regulations like HIPAA and GDPR. The framework provides functional segmentation models to centers that lack the resources for extensive training or fine-tuning, democratizing access to advanced AI tools in healthcare.
The code supporting these results is available online, fostering transparency and further research in this critical area. You can find more details about this research paper here: Federative Ischemic Stroke Segmentation Research.


