TLDR: IHiD (Intention-aware Hierarchical Diffusion model) is a novel unsupervised method for detecting anomalies in long-term trajectories. It uses a hierarchical approach, combining Inverse Q Learning for high-level intent evaluation (identifying unusual subgoal choices) and a diffusion model for low-level subtrajectory generation (detecting fine-grained deviations). This dual-layered system effectively captures the diversity of normal trajectories and achieves significant performance improvements over state-of-the-art baselines in real-world datasets like taxi and vessel traffic, particularly for complex anomalies like route-switching.
Long-term trajectory anomaly detection is a critical challenge in various real-world applications, from traffic management to maritime monitoring. These trajectories, which span extended periods and contain rich contextual information, are notoriously difficult to analyze due to their inherent diversity and complex spatial-temporal dependencies. Traditional methods often fall short, struggling to capture the full spectrum of normal behaviors and distinguish them from subtle or significant deviations.
A new research paper introduces an innovative solution: the Intention-aware Hierarchical Diffusion model (IHiD). This unsupervised method tackles the problem by simultaneously considering both the high-level intentions of agents (like a vehicle’s planned route) and the low-level details of their navigation (the actual path taken). The paper, authored by Chen Wang, Sarah Erfani, Tansu Alpcan, and Christopher Leckie from The University of Melbourne, proposes a two-tiered approach to identify anomalies more effectively than previous state-of-the-art techniques. You can read the full paper here: Intention-aware Hierarchical Diffusion Model for Long-Term Trajectory Anomaly Detection.
Understanding the IHiD Approach
The core idea behind IHiD is to mimic how humans break down complex tasks. A long journey, for instance, can be seen as a sequence of smaller goals (subgoals), each with a corresponding segment of travel (subtrajectory) to reach it. IHiD leverages this hierarchical structure with two main components:
1. High-Level Intent Evaluation (Inverse Q Learning): This part of the model focuses on the agent’s decision-making at a broader level. It uses a technique called Inverse Q Learning (IQL) to learn what constitutes a ‘normal’ sequence of subgoals. By analyzing typical trajectories, the IQL model develops an understanding of expected intentions. When a new trajectory is observed, IHiD assesses whether the agent’s chosen next subgoal aligns with these learned normal intentions. If the ‘Q-value’ (a measure of expected future reward) for a selected subgoal is unusually low, it signals a potential anomaly, suggesting an unexpected or ‘unintended’ high-level decision.
Imagine a vessel’s journey: normally, it might go from port A to waypoint B, then to port C. If it suddenly decides to go from port A to an unusual waypoint X, the high-level model would flag this as suspicious, even if the path to X itself is physically possible.
2. Low-Level Subtrajectory Generation (Diffusion Model): Once a subgoal selection is deemed normal, IHiD delves into the fine-grained details of the actual path taken to reach that subgoal. It employs a diffusion model, a powerful type of generative AI, to reconstruct the expected subtrajectory based on the given subgoal information. Diffusion models are excellent at capturing the diverse ways normal trajectories can unfold. By comparing the actual subtrajectory with the model’s reconstruction, IHiD calculates a ‘reconstruction error’. A large error indicates that the observed path deviates significantly from what’s considered normal for that specific subgoal, thus identifying a low-level anomaly, such as a ‘detour’.
Why IHiD Stands Out
This hierarchical design offers several key advantages:
- It’s the first known work to integrate Inverse Q Learning and diffusion models within a single framework for trajectory anomaly detection.
- It effectively maintains long-term spatial-temporal consistency by accounting for the uncertainty in agents’ intentions and using a subgoal-based diffusion model for subtrajectory generation.
- The method is robust enough to detect anomalies even in highly diverse datasets, where normal trajectories can vary significantly.
- Experiments show IHiD achieves substantial improvements in anomaly detection performance, with up to a 30.2% increase in F1 score over existing methods.
Real-World Performance
The researchers tested IHiD on two real-life datasets: the Chengdu taxi trajectory dataset and the AIS (Automatic Identification System) vessel traffic dataset. They evaluated its performance against various types of anomalies, including ‘big detours’ (significant deviations), ‘small detours’ (minor deviations between subgoals), and ‘route-switching’ anomalies (changing from one normal route to another).
IHiD consistently outperformed a wide range of baselines, including classic machine learning methods and advanced deep learning models. It showed particular strength in detecting challenging ‘route-switching’ anomalies, which often stump other methods. The study also highlighted the complementary nature of its two components: the high-level IQL model excels at spotting unusual subgoal choices, while the low-level diffusion model is crucial for catching subtle deviations within a subtrajectory.
Also Read:
- Advanced AI for Autonomous Driving: Learning Multiple Valid Actions in Urban Environments
- Smart Control: How AI Teams Learn Safely with a Hierarchical Approach
Looking Ahead
The IHiD model represents a significant step forward in long-term trajectory anomaly detection. By understanding both the ‘why’ (intentions) and the ‘how’ (specific paths) of agent movement, it provides a more comprehensive and accurate way to identify unusual behaviors in complex environments. Future work aims to enhance the model’s ability to generate even more realistic trajectories by incorporating physical and road network constraints, further bridging the gap between theoretical models and real-world applications.


