TLDR: HeraldLight is a novel traffic signal control system that uses a dual Large Language Model (LLM) architecture combined with a Herald Module. The Herald Module forecasts queue lengths and extracts real-time traffic data, enabling the LLM-Agent to make fine-grained, second-level signal adjustments. An LLM-Critic then refines these decisions, correcting errors and hallucinations. This approach significantly reduces average travel time (20.03%) and queue length (10.74%) compared to state-of-the-art methods, demonstrates strong scalability and transferability, and drastically cuts down on AI hallucinations, even performing well under extreme weather conditions.
Traffic congestion is a persistent challenge in urban environments, leading to wasted time, increased pollution, and frustration for commuters. Traditional methods for managing traffic signals often struggle to adapt to real-time changes, while even advanced AI solutions like reinforcement learning (RL) can lack flexibility and interpretability. Recent advancements in Large Language Models (LLMs) offer promising avenues for smarter traffic control, but they too face hurdles like fixed timing limitations and the potential for ‘hallucinations’ – inaccurate or inconsistent decisions.
Addressing these critical issues, researchers have introduced HeraldLight, an innovative dual LLM architecture designed for parallel, fine-grained traffic signal control. This system aims to enhance both the efficiency and reliability of urban traffic management by combining predictive intelligence with a robust decision-making framework.
How HeraldLight Works
HeraldLight operates on a unique dual LLM architecture, guided by a specialized component called the Herald Module. Here’s a breakdown of its key elements:
The Herald Module: This module acts as the system’s eyes and ears, continuously extracting real-time contextual information from traffic conditions. Crucially, it forecasts queue lengths for each traffic phase up to 40 seconds in advance. By understanding how traffic is likely to evolve, the Herald Module provides the foundational data needed for dynamic, second-level adjustments to signal timings. It learns scenario-specific dynamics, mapping queue lengths to release times to ensure precise control.
The Dual LLMs Architecture: HeraldLight employs two distinct LLMs working in tandem:
- LLM-Agent: This is the primary decision-maker. Using the forecasts and insights provided by the Herald Module, the LLM-Agent makes fine-grained decisions about which traffic phase should be active and for how long.
- LLM-Critic: To combat the common issue of hallucinations in LLMs, a second, more powerful LLM (instantiated with a model like ChatGPT) acts as a critic. It evaluates the outputs of the LLM-Agent, identifies potential errors or inconsistencies, and proposes corrections. These refined outputs are then used in a score-based fine-tuning process, continuously improving the LLM-Agent’s accuracy and robustness. This iterative interaction between the agent and critic systematically enhances the system’s reasoning capabilities.
This collaborative agent-critic approach allows HeraldLight to achieve dynamic traffic signal control with second-level precision, adapting to fluctuating traffic conditions in real-time.
Also Read:
- A Centralized AI Approach to Adaptive Traffic Control
- Optimizing Urban Traffic Flow with Single-Agent Reinforcement Learning
Performance and Impact
The effectiveness of HeraldLight was rigorously tested using the CityFlow simulator on real-world datasets from 224 intersections across Jinan, Hangzhou, and New York. The results demonstrate significant improvements over existing state-of-the-art methods:
- Reduced Travel Time: HeraldLight achieved a remarkable 20.03% reduction in average travel time across all tested scenarios.
- Shorter Queue Lengths: It also led to a 10.74% reduction in average queue length on the Jinan and Hangzhou networks.
- Scalability: The system proved highly scalable, performing exceptionally well on the large-scale New York network, which features 196 intersections.
- Transferability: HeraldLight showed strong transferability, meaning models trained in one scenario could be effectively applied to others with minimal performance degradation.
- Hallucination Mitigation: The dual LLM architecture drastically reduced hallucination errors, dropping from 9.23% to a mere 0.163% in challenging scenarios, indicating more stable and reliable decision-making.
- Extreme Weather Robustness: Even under simulated extreme weather conditions (reduced acceleration and speed), HeraldLight demonstrated greater adaptability and less performance degradation compared to other leading methods.
The research highlights that fixed-time strategies are insufficient for complex traffic, and that coupling LLM reasoning with a predictive module like Herald significantly boosts control effectiveness. The source code for HeraldLight is publicly available on GitHub, fostering further research and development. You can find the full research paper here: HeraldLight Research Paper.
In conclusion, HeraldLight represents a significant leap forward in intelligent transportation systems, offering a more adaptive, efficient, and reliable solution for managing urban traffic signals. Its ability to make fine-grained, dynamic adjustments while mitigating common AI pitfalls paves the way for smoother, less congested city driving experiences.


