TLDR: A new AI foundation model uses a dual-encoder architecture and triplet-based contrastive learning to separate true physical signals from instrument distortions in structured time series data. Tested on simulated astronomical observations, it significantly outperforms traditional models in disentangling factors and improving prediction accuracy, especially with limited data, demonstrating strong generalization capabilities.
Observational data across various scientific and industrial fields often presents a fundamental challenge: the information we gather is a blend of the true underlying phenomena and systematic distortions introduced by the measurement instruments themselves. This entanglement makes it difficult for advanced artificial intelligence models, known as foundation models, to generalize effectively, especially when dealing with diverse or multi-instrument datasets.
In astronomy, for instance, vast amounts of data are collected, but the signals from celestial objects are frequently intertwined with effects from telescopes and sensors. This mixing limits how well AI models can interpret and apply what they learn, hindering their ability to make accurate predictions or discoveries.
A Novel Approach to Disentanglement
To address this, researchers have developed a new AI model called a ‘Causal Foundation Model’. This innovative model is specifically designed to explicitly separate the inherent physical properties of an observed system from the systematic effects caused by the measurement process. It draws inspiration from causal representation learning and employs a technique called structured contrastive learning.
The model utilizes a ‘dual-encoder’ architecture, meaning it has two independent processing units. One encoder focuses on extracting the underlying physical signal (e.g., the true variability of a star), while the other learns the unique characteristics of the instrument’s distortions. Both encoders process the same input data. To train this system, the model leverages naturally occurring ‘observational triplets’. These triplets consist of an ‘anchor’ observation, an observation of the ‘same star’ but with a different instrument, and an observation of a ‘different star’ but with the ‘same instrument’. This clever setup allows the model to learn what remains constant about a star regardless of the instrument, and what is consistent about an instrument regardless of the star it observes.
Once the separate physical and instrumental representations are learned, a decoder combines them multiplicatively to reconstruct the original input. During training, additional ‘contrastive objectives’ are applied to ensure that the stellar latent space (the model’s internal representation of the star) is invariant to instrument changes, and the instrumental latent space is invariant to changes in the observed star.
Testing and Results
To validate their approach, the researchers created a simulated dataset of time series observations. This dataset was designed to mimic the complexity of variable star light curves, similar to those observed by missions like NASA’s Transiting Exoplanet Survey Satellite (TESS). This controlled environment allowed for a precise evaluation of the model’s ability to disentangle physical and instrumental factors.
The results were highly encouraging. Analysis of the learned latent spaces showed that the stellar latent space strongly aligned with intrinsic physical properties of the stars and exhibited minimal clustering based on the instrument used. Conversely, the instrument latent space was well-structured by instrument configuration. This demonstrated the model’s success in separating these distinct factors.
Furthermore, when the disentangled representations were used for downstream prediction tasks, such as predicting a primary stellar parameter, the Causal Foundation Model significantly outperformed traditional foundation models that use a single, shared latent space. This performance advantage was particularly pronounced in ‘low-data regimes’, meaning the model could achieve comparable or better results with ten times less training data. This highlights the model’s ability to support key foundation model capabilities like few-shot generalization and efficient adaptation.
Also Read:
- Disentangling Multi-Scale Features for Enhanced Time Series Classification
- A Unified AI Model for Predicting Wireless Network Dynamics
Broader Implications
This research underscores the critical importance of embedding causal structure into foundation models. The methodology, while demonstrated in astronomy, has broad applicability to other domains where observational data conflates true signals with measurement distortions. This could include fields such as biomedicine, remote sensing, or climate forecasting, where similar structured relationships in data can be identified or constructed.
The full research paper provides more technical details and can be accessed at this link.


