TLDR: This research introduces an uncertainty-guided online ensemble method for machine learning models in fusion science, specifically for predicting Toroidal Field coil deflections at the DIII-D facility. It addresses the challenge of non-stationary data streams by continuously adapting models. The method uses Deep Gaussian Process Approximation (DGPA) for reliable uncertainty estimation to intelligently combine predictions from multiple learners. Online learning reduced prediction error by 80% compared to static models, and the uncertainty-guided ensemble further improved accuracy by 10% over standard online learning, providing robust and trustworthy predictions for critical fusion applications.
The quest for controlled nuclear fusion, a clean and virtually limitless energy source, relies heavily on advanced technologies and precise control. Next-generation fusion devices, such as tokamaks, generate vast amounts of data that present unique challenges for traditional machine learning models. One significant hurdle is the non-stationary nature of fusion data, meaning its characteristics change over time due to experimental evolution and machine wear-and-tear. This “data drift” causes conventional machine learning models, which assume stable data distributions, to lose accuracy and effectiveness.
A recent study addresses this critical issue by applying online learning techniques to fusion science, an area where it has been largely unexplored. The research focuses on predicting the deflection of Toroidal Field (TF) coils, also known as B-coils, at the DIII-D fusion facility. These coils are crucial for plasma confinement and are subjected to extreme electromagnetic forces, making their precise monitoring and prediction vital for operational safety and performance optimization.
The paper highlights that traditional physics-based models for fusion research are computationally expensive and not easily adaptable in real-time. This creates a significant opportunity for data-driven approaches, particularly those incorporating Machine Learning (ML). However, standard ML models struggle with out-of-distribution (OOD) samples—new data that significantly differs from the original training data—and often lack mechanisms to quantify the uncertainty of their predictions, which is essential for decision-makers in high-stakes environments like fusion facilities.
To overcome these limitations, the researchers propose an innovative uncertainty-guided online ensemble method. This approach leverages Deep Gaussian Process Approximation (DGPA) for calibrated uncertainty estimation. DGPA is a technique that combines the expressive power of deep neural networks with the distance-awareness of Gaussian Processes, allowing it to reliably estimate uncertainties, even for OOD data. These uncertainty values are then used to guide a meta-algorithm that combines predictions from an ensemble of learners, each trained on different historical data horizons.
The study demonstrates the profound impact of online learning. Compared to a static model that does not adapt to new data, the online learning model reduced prediction error by an impressive 80%. This significant improvement underscores the necessity of continuous adaptation for maintaining ML model performance in dynamic fusion environments. Furthermore, the online learning model improved uncertainty calibration by approximately 68%, providing more reliable confidence estimates for its predictions.
Building on the success of single-model online learning, the researchers further enhanced performance through online ensembles. They compared a “naive” online ensemble, which simply averages predictions, with their proposed uncertainty-guided online ensemble. The uncertainty-guided approach, which weights predictions based on their estimated reliability (lower uncertainty means higher weight), showed superior results. It reduced prediction errors by about 10% compared to standard single-model online learning, while the naive ensemble achieved about a 6% reduction. This indicates that using uncertainty to intelligently combine predictions leads to more accurate and robust outcomes.
The ensemble models were trained using different buffer sizes of historical data, allowing them to adapt to various types of data drifts, whether abrupt or gradual. This flexibility is crucial in fusion experiments where data characteristics can change due to maintenance schedules, experimental evolution, and equipment wear-and-tear. The research also emphasizes that while DGPA was used for uncertainty estimation due to its reliability, the ensemble framework is adaptable to any uncertainty quantification technique that can accurately correlate predicted uncertainty with actual error.
Also Read:
- Real-DRL: Bridging the Gap for Safe AI in Physical Systems
- Enhanced System Monitoring: Using Kullback–Leibler Divergence for Reliable Identification
In conclusion, this research presents a significant step towards enabling the long-term deployment of machine learning in fusion applications. By demonstrating the effectiveness of online learning and introducing a novel uncertainty-guided online ensemble method, the authors provide a framework for self-sustaining ML models that can continuously adapt to non-stationary data streams, offer reliable uncertainty-aware predictions, and ultimately contribute to the safe and efficient operation of next-generation fusion devices. You can read the full paper here.


