TLDR: This research introduces a dual-loop multi-agent system for 6G networks that leverages Large Language Models (LLMs) to significantly enhance task planning and execution efficiency. By decomposing complex tasks into smaller, parallel subtasks handled by collaborating agents on edge and terminal devices, the system effectively overcomes inherent LLM limitations like resource constraints and improves performance. The framework, validated through a case study in urban emergency response, demonstrates superior success rates and reduced latency compared to existing methods, paving the way for more intelligent and autonomous 6G services.
The future of wireless communication, 6th generation (6G) networks, promises an era of ubiquitous computing resources and native intelligence. This advanced environment is ideal for integrating artificial intelligence, particularly Large Language Models (LLMs), into intelligent services through an agent framework. These LLM-enabled agents, equipped with planning capabilities and auxiliary modules, can autonomously understand diverse environments and user intentions, then plan and execute actions accordingly.
However, individual network devices often have limited resources, which can significantly hinder the efficient operation of complex LLM-enabled agents, especially when they need to call multiple tools. This challenge highlights a critical need for efficient collaboration across different levels of network devices.
Introducing the Dual-Loop Edge-Terminal Collaboration Framework
To address these limitations, researchers have proposed a novel framework and method for an LLM-enabled multi-agent system (MAS) that features dual-loop terminal-edge collaborations within 6G networks. This system is designed to enhance both task planning capabilities and execution efficiency.
The framework operates with two main loops:
-
The Outer Loop: This loop involves iterative collaborations between a ‘global agent’ and multiple ‘sub-agents’ deployed on edge servers and terminal devices. The global agent is responsible for breaking down complex tasks into smaller, parallel sub-tasks. These sub-tasks are then distributed among the sub-agents, significantly enhancing the overall planning capability by simplifying individual agent responsibilities.
-
The Inner Loop: Within this loop, sub-agents, each with a dedicated role, continuously reason, execute, and replan their assigned sub-tasks. A key innovation here is the incorporation of parallel tool calling generation, along with intelligent offloading strategies, to dramatically improve execution efficiency. This means multiple tools can be used simultaneously, and their execution can be shifted to the most suitable device (edge or terminal) based on resource availability and task requirements.
How the System Works: Key Components
The proposed dual-loop multi-agent system integrates several crucial components:
-
Multimodal Perception: Agents are designed to perceive and understand information from various sources, such as video, audio, and text. This includes not only raw data but also the ability to recognize user intentions and scene semantics, which is vital for providing dynamic, on-demand services in 6G.
-
Memory Module: This component stores interaction contexts, 6G network knowledge, and past action experiences. It’s divided into short-term and long-term memory, non-parametric (raw data) and parametric (fine-tuned model parameters) memory, and declarative (factual knowledge) and procedural (skills/strategies) memory. This comprehensive memory system allows agents to learn, adapt, and self-evolve.
-
Planning Module: The core of the agent, responsible for task planning. The dual-loop structure, enhanced by an LLMCompiler, ensures that tasks are efficiently decomposed and executed. The outer loop handles the broad task breakdown, while the inner loop, using a Directed Acyclic Graph (DAG) for tool dependencies, manages the parallel execution and dynamic replanning of sub-tasks.
-
Scheduling Module: After planning, this module intelligently offloads the execution of various tools (classic algorithms, machine learning models, or even LLMs) to the most appropriate computing resources across heterogeneous terminals and edge devices. This ensures optimal resource utilization and task execution efficiency.
Real-World Validation: Urban Safety Governance
To demonstrate its effectiveness, the framework was tested in an urban emergency response scenario, such as detecting fires or traffic accidents. In this case study, the system showed improved task planning capability and execution efficiency. For instance, a global agent would decompose an emergency response into sub-tasks for specialized agents like a Video-Agent (to analyze footage), a Meteorology-Agent (to assess weather impact), a Map-Agent (to locate emergency services), a Keyframe-Agent (to extract critical visual details), and a Report-Agent (to compile a comprehensive report).
Experimental results showed that this dual-loop system significantly outperformed classical agent schemes like ReAct and LLMCompiler in terms of success rate for complex tasks and reduced execution latency, especially as the number of required tool calls increased. This is largely due to its ability to decompose tasks and execute tool calls in parallel.
Also Read:
- Structuring Intelligence: Language Models Crafting Hierarchical Learning Environments for AI Agents
- Making Sense of AI Actions: TalkToAgent’s Approach to Explaining Reinforcement Learning
Looking Ahead: Challenges and Future Directions
While promising, the deployment of LLM-enabled agents in 6G networks still faces challenges. These include the need for more flexible on-device deployment of LLMs, improving reasoning abilities and context window management for complex tasks, mitigating communication overhead and ensuring security, and addressing the issue of ‘hallucination’ (where LLMs generate factually incorrect outputs).
This research provides valuable insights into how LLM-enabled multi-agent systems with dual-loop edge-terminal collaboration can accelerate the advent of the 6G era, offering a robust framework for intelligent and autonomous network services. For more technical details, you can refer to the full research paper here.


