TLDR: IMAC is a novel framework designed to improve EEG signal classification by addressing data distribution shifts caused by heterogeneous recording setups. It formulates cross-domain alignment as a spatial time series imputation task, standardizing electrode configurations, decoupling temporal and spatial information, and using a channel-dependent masking and reconstruction strategy. This approach enables the model to learn robust, aligned signal representations, leading to state-of-the-art classification accuracy across various EEG datasets and scenarios, demonstrating strong resilience to real-world data variability.
Electroencephalogram (EEG) signals, which capture brain activity, are vital for developing brain-computer interfaces (BCI) used in areas like rehabilitation, emotion recognition, and diagnosing neurological disorders. However, classifying these signals accurately in real-world settings is challenging due to significant variations in how the data is collected. Differences in electrode configurations, acquisition protocols, and hardware can cause what are known as ‘data distribution shifts’ or ‘domain gaps’ between datasets, making it difficult for models trained on one dataset to perform well on another.
Introducing IMAC: A Novel Approach to EEG Alignment
A new framework called IMAC (IMpute And Classify) has been introduced to address these persistent challenges. Unlike traditional methods that focus on learning common features across different data sources, IMAC tackles the problem by directly modifying and correcting the EEG data itself. It redefines the alignment of cross-domain EEG data shifts as a ‘spatial time series imputation’ task, essentially learning to fill in missing or inconsistent spatial information in the brain signals.
How IMAC Works: A Three-Module Framework
IMAC’s innovative approach is built upon three core modules:
-
Spatial Topology Unification Module (STUM): EEG recordings often use different numbers and arrangements of electrodes. STUM standardizes these diverse electrode layouts by mapping them onto a unified 2D spatial representation. If some electrodes are missing in a particular setup, it uses a technique called radial basis function (RBF) interpolation to estimate and fill in those missing channels, ensuring a consistent 64-channel spatial topology across all datasets.
-
Temporal-Spatial Decomposition Module (TSDM): EEG signals are complex, with both temporal (time-based) dynamics and spatial (location-based) distributions. Directly reconstructing spatial information can be difficult due to this strong coupling. TSDM cleverly decouples these components, allowing the model to learn temporal patterns and spatial structures independently. It uses pre-learned ‘temporal pattern pools’ (representing trends, seasonality, and residuals) and a channel-dependent encoder to capture spatial correlations.
-
Channel-dependent Mask and Imputation Module (CMIM): This is where the core ‘imputation’ happens. CMIM simulates real-world data variability by randomly masking (hiding) certain channels in the EEG signal’s spatial representation. It then uses the surrounding contextual information to reconstruct these masked channels. This process teaches the model to robustly recover from missing data and align signals across different domains. The module uses a combination of ‘fidelity loss’ (ensuring accurate reconstruction) and ‘consistency loss’ (ensuring stable reconstructions under different masking patterns) to achieve high-quality imputation.
During inference (when the model is used for classification), IMAC uses its learned reconstruction patterns to deterministically fill in any missing channels, transforming the signals into a consistent format before feeding them into a classifier. This streamlined process ensures robust performance even with unseen channel configurations.
Also Read:
- Unlocking AI Potential with Astromorphic Transformers
- MissDDIM: A New Era of Fast and Consistent Data Imputation for Tabular Data
Superior Performance Across Diverse Datasets
The effectiveness of IMAC was rigorously evaluated across 10 publicly available EEG datasets covering various tasks, including Parkinson’s Disease detection, motor imagery, and emotion recognition. The results consistently demonstrated IMAC’s superior performance compared to state-of-the-art baseline methods in both cross-subject (different individuals) and cross-center (different recording facilities) validation scenarios.
Notably, IMAC achieved significant improvements in classification accuracy, especially on challenging datasets known for missing channels and inconsistent electrode configurations. For instance, on the BNCI dataset for motor imagery, IMAC surpassed the second-best method by a substantial margin, highlighting its ability to mitigate performance degradation caused by severe spatial distortions. The framework also showed strong robustness under simulated real-world distribution shifts, maintaining high signal integrity even with added noise or channel masks.
This research underscores the potential of incorporating spatial imputation as a fundamental strategy to address a wide array of adaptation challenges in EEG classification. For more in-depth information, you can refer to the full research paper available at https://arxiv.org/pdf/2508.03437.


