TLDR: This paper introduces a new method called CADY (CAusally-informed DYnamics) for training robot dynamics models. Instead of learning a single causal relationship, CADY learns a distribution of possible causal structures, which makes robot planning more robust and efficient. It achieves this by using a novel approach to estimate causal links and masking its internal representations. Experiments show CADY significantly reduces computational needs, improves resilience to noisy data, and adapts better to unexpected changes in robot dynamics, validated across various robot platforms in both simulation and real-world settings.
Robotics systems are intricate, with many components interacting in complex ways. For robots to perform tasks reliably, especially in unpredictable real-world environments, understanding these interactions is crucial. A powerful tool for this is the Structural Causal Model (SCM), which maps out how different parts of a robot system influence each other. SCMs offer a significant advantage: they help robots generalize better to new situations and become more robust, even if some sensor data is corrupted. This is because robotic systems often have “sparse interactions,” meaning one component usually only affects a few others directly.
However, figuring out the exact SCM for a robot is incredibly challenging. The number of possible causal structures grows astronomically with more variables, making it computationally expensive. Plus, sometimes multiple different causal structures can explain the same observed data, making it hard to pick just one “correct” model.
To overcome these hurdles, a new research paper introduces a novel approach: instead of trying to find a single causal graph, they propose learning a distribution over possible causal structures. This means the robot’s planning system can consider multiple plausible ways its components might interact, leading to more robust and reliable behavior.
The researchers made two key innovations. First, they realized that a robot’s dynamics (how it moves from one state to the next) can only flow forward in time. This allows them to significantly narrow down the search space for SCMs to a specific type called “bipartite Directed Acyclic Graphs” (DAGs). Second, they found an efficient way to estimate the parameters of this causal structure distribution using a technique called Integrated Gradients (IG), which helps determine how much each input feature contributes to an output.
Based on this, they developed a model called CAusally-informed DYnamics (CADY). CADY uses a probabilistic neural network architecture that incorporates these sampled causal structures to “mask” its internal representations. This masking process essentially tells the model which parts of its internal knowledge are relevant for predicting specific outcomes, based on the learned causal relationships. This makes the dynamics model more robust to unexpected disturbances and changes.
The training of CADY involves two steps: first, a “contribution model” learns from the robot’s experience, and then this model is used to estimate the causal structure distribution. Finally, the main CADY dynamics model is trained using masks sampled from this distribution. When the robot is deployed, the learned causal relationships are preserved, making its predictions more reliable.
For planning, CADY integrates with existing control strategies like Model Predictive Control (MPC). By sampling causal structures at each step, the planner can account for the uncertainty about the true underlying causal relationships, making its action selection more robust. It also naturally incorporates other types of uncertainty, like unexpected disturbances.
Also Read:
- Enhancing World Models for Robotics Through Novelty Detection
- Unpacking Shortcut Learning in Robot Policies: Why Dataset Structure Matters for Generalization
Key Findings and Advantages
CADY was validated through extensive experiments using various robots, including manipulators and mobile robots, in both simulations and real-world scenarios. Their findings are impressive:
Lower Computational Requirements: CADY significantly reduces the computational cost and the number of parameters needed compared to other methods. This makes it highly suitable for real-time operation on robots with limited processing power. For example, in some tasks, it required nearly 20 times fewer operations.
Increased Robustness: CADY showed superior resilience to common real-world challenges like missing sensor data or noisy inputs. When a sensor failed or data was corrupted, CADY’s performance degraded much less than other models. This is attributed to its causal structure, which prevents noise from spreading throughout the model.
Adaptability to Changes: The model demonstrated a remarkable ability to adapt to unexpected changes in the robot’s dynamics, such as varying terrain friction. Even after sudden “interventions” that altered the robot’s behavior, CADY maintained better prediction accuracy and adapted more quickly after fine-tuning. The model’s ability to reduce the influence of non-causal variables was a key factor here.
In conclusion, this research presents a significant step forward in robot learning and planning. By focusing on learning distributions of causal structures, the CADY model offers a more robust, efficient, and adaptable solution for robots operating in complex and uncertain environments. You can find the full research paper here: Learning Causal Structure Distributions for Robust Planning.


