TLDR: A novel method uses Kullback–Leibler divergence within the Kalman filter framework to automatically select the most plausible input–parameter–state estimation. By comparing prior and posterior probability distributions, the method identifies the best-performing identification among multiple runs with different initial parameter guesses, proving effective in linear, nonlinear, and limited information systems for improved system monitoring.
Understanding how complex systems behave, especially when we don’t know all the forces acting on them or their internal properties, is a significant challenge in engineering. Imagine trying to monitor the health of a bridge without knowing the exact traffic loads or the precise stiffness of its components. This is the core problem of “input–system–state identification,” where engineers aim to figure out the external forces (input), the system’s characteristics (parameters), and its current condition (state) simultaneously.
A major hurdle in this process is the uncertainty that arises from different initial guesses about the system’s parameters. When you start an identification algorithm with various initial assumptions, you often get different results, making it difficult to determine which outcome is the most accurate or “plausible.”
A Novel Approach Using Kullback–Leibler Divergence
A new method, developed by Marios Impraimakis from the University of Southampton, addresses this challenge by employing a statistical concept called Kullback–Leibler (K-L) divergence within the well-known Kalman filter framework. The K-L divergence essentially measures how one probability distribution differs from another. In simpler terms, it quantifies the “information gain” when moving from an initial belief (prior distribution) to an updated belief based on new data (posterior distribution).
Here’s how the method works:
- First, the Kalman filter (or its specialized versions like the Unscented Kalman Filter or Residual-based Kalman Filter) is run multiple times. Each run starts with a different set of initial guesses for the system’s parameters. This provides several potential estimations of the system’s input, parameters, and state.
- Next, for each of these estimations, the method compares the resulting “posterior” probability distributions (what the data tells us about the parameters) with the “prior” probability distributions (our initial assumptions about the parameters). This comparison is done using the Kullback–Leibler divergence.
- Finally, the identification run that yields the least Kullback–Leibler divergence is selected as having the most plausible results. The logic here is that a lower K-L divergence indicates less “surprise” or a smaller shift from the initial assumption to the data-informed result, suggesting a more consistent and reliable identification.
This approach is particularly powerful because it automatically selects the best-performing identification without human intervention, even when starting with different initial parameter guesses. It evaluates results in real-time and can directly compare multiple simultaneous identification attempts.
Also Read:
- Real-Time System Monitoring: A New Kalman Filter for Estimating Hidden Forces and Properties
- Boosting CLIP Model Performance with Kalman Filter Fine-Tuning for Enhanced Generalization
Broad Applications and Future Potential
The research demonstrates the effectiveness of this K-L divergence method across various scenarios. It has been successfully applied to linear systems (where relationships are straightforward), nonlinear systems (where relationships are more complex, like those involving Duffing oscillators), and even in “limited information” applications where only a few sensors are available. This versatility makes it a valuable tool for structural health monitoring and other system analysis tasks.
For a deeper dive into the mathematical details and experimental validations, you can read the full research paper available here.
The author emphasizes that while the method has been extensively validated through numerical investigations, future research will involve applying it to real-world experimental data to further solidify its practical utility. This method offers a significant step forward in accurately identifying and monitoring dynamic systems, even under challenging conditions of unknown inputs and limited sensor data.


