TLDR: DiTree is a new framework that combines diffusion models with sampling-based planners to improve kinodynamic motion planning for robots. It uses learned diffusion policies to intelligently guide the robot’s search for collision-free, dynamically feasible paths, even in new, unseen environments. This hybrid approach results in significantly faster planning and higher success rates compared to traditional methods, while also offering theoretical guarantees and demonstrating real-world applicability.
Robots are becoming an increasingly common sight in various applications, from autonomous driving to complex industrial tasks. A critical challenge for these systems is kinodynamic motion planning (KMP), which involves computing collision-free paths that also respect the robot’s physical limitations and dynamics, such as its speed, acceleration, and turning radius. This is a complex problem, especially in high-dimensional spaces with non-linear dynamics.
Traditionally, this problem has been tackled using sampling-based planners (SBPs). These methods explore the robot’s environment by building a search tree, simulating random actions to find a path. While SBPs offer strong theoretical guarantees, like completeness (eventually finding a solution if one exists), they can be slow and inefficient, especially in cluttered or complex environments, because they rely on uninformed, random exploration.
More recently, learning-based approaches, such as diffusion planning, have emerged. These methods can generate trajectories much faster by learning from expert demonstrations. However, they often struggle with generalization, meaning they perform poorly in environments different from those they were trained on. Crucially, they also lack critical safety guarantees, such as ensuring collision avoidance, which is a major hurdle for deploying them on physical robots.
Introducing DiTree: A Hybrid Approach
A new framework called Diffusion Tree (DiTree) aims to bridge this gap by combining the strengths of both learning and search. Developed by Yaniv Hassidof, Tom Jurgenson, and Kiril Solovey, DiTree leverages diffusion policies (DPs) as ‘informed samplers’ to guide the state-space search within traditional SBPs. This innovative combination allows DiTree to generate provably-safe solutions quickly, even for complex robotic systems operating in entirely new environments.
The core idea behind DiTree is to use a diffusion model, trained on expert trajectories, to intelligently suggest actions for the robot. Instead of random actions, the diffusion policy proposes sequences of actions that are conditioned on the robot’s current state, a target state (either the goal or an exploration-guiding state), and local obstacle information. This ‘context-aware’ sampling helps the robot explore promising directions more efficiently.
One of DiTree’s key features is its ability to generalize. The diffusion policy is trained on a single environment, yet it can effectively plan in wildly different, unseen scenarios. This is achieved by representing the target state relative to the current state, making the planning agnostic to global coordinates, and by conditioning the policy on localized obstacle information rather than the entire scene. Furthermore, DiTree ensures collision avoidance by integrating collision checking directly into its search tree, a guarantee often missing in standalone learning-based methods.
Also Read:
- Advancing Multi-Robot Navigation with Discrete-Guided Diffusion
- Zero-Shot Navigation: How ZeST Helps Robots Map Unknown Environments
Performance and Real-World Impact
In comprehensive evaluations across 15 distinct scenarios and two robot types (a car and a quadruped ‘Ant’ robot), DiTree demonstrated significant improvements. It was, on average, three times faster than classical SBPs and achieved roughly 30% higher success rates. In some complex scenarios where classical SBPs completely failed, DiTree still managed to find solutions. Beyond just finding paths, DiTree also discovered 25-50% shorter trajectories compared to traditional methods, indicating its potential for generating more efficient solutions.
The researchers also conducted an ablation study, revealing that even a single diffusion iteration for action sampling yielded the best performance, highlighting that for tree-based search, faster, coarser samples are often more beneficial than high-quality, time-consuming ones. This suggests that the role of the generative model in DiTree is to prioritize promising search directions rather than to execute entire trajectories.
DiTree’s real-world applicability was demonstrated by deploying it on a physical car robot. In a sharp-corner turning scenario, DiTree consistently navigated without collisions, while a classical RRT planner encountered eight collisions out of ten trials. This success underscores the robustness and quality of the learned sampling, which inherently incorporates a safety margin.
While DiTree represents a significant step forward, the authors acknowledge limitations, including the assumption of full knowledge of obstacles, a 2D workspace, and reliance on pre-collected expert demonstrations. Future work aims to address these by incorporating partial environmental knowledge, extending to 3D environments, and exploring learning behaviors from scratch using reinforcement learning.
This work lays the groundwork for safer, more general, and data-driven motion planners in robotics. For more details, you can read the full research paper here.


