Next-Gen VR: How ASL360 Delivers Seamless 360-Degree Video via AI and Drone-Assisted Networks

TLDR: ASL360 is an AI-driven system that uses deep reinforcement learning to optimize 360-degree video streaming for mobile VR users in drone-assisted 5G networks. It intelligently schedules base and enhancement video layers, manages multiple buffers, and dynamically adapts to network conditions to maximize user experience. The system significantly improves video quality, reduces rebuffering, and minimizes quality fluctuations compared to existing methods, offering a smoother and higher-quality immersive video experience.

Immersive virtual reality (VR) experiences, particularly those involving 360-degree video, are rapidly gaining popularity. However, delivering high-quality, uninterrupted 360-degree video to mobile VR users presents significant challenges for current wireless networks. The sheer volume of data, coupled with dynamic network conditions and the need for a seamless user experience, often leads to frustrating issues like video stalls and inconsistent quality. Addressing these hurdles, a new research paper introduces ASL360, an innovative AI-enabled system designed to revolutionize adaptive streaming of layered 360-degree video over drone-assisted wireless networks.

Understanding the Challenge of 360-Degree Video

Traditional cellular networks often struggle to meet the stringent demands of 360-degree video streaming. These videos require massive bandwidth, and users’ head movements mean that only a portion of the panoramic view (the ‘viewport’) is actively watched at any given moment. Efficiently delivering only the relevant, high-quality content while maintaining a smooth playback is a complex task. Existing solutions often fall short, leading to a poor Quality of Experience (QoE) characterized by rebuffering events (stalls) and noticeable fluctuations in video quality.

To overcome these limitations, the integration of Unmanned Aerial Vehicles (UAVs), or drones, into existing network infrastructure has emerged as a promising solution. These UAVs can act as mobile base stations, augmenting conventional macro base stations (MBSs) with high-capacity millimeter-Wave (mm-Wave) communication links, thereby enhancing data rates and optimizing network management.

Introducing ASL360: A Smart Solution

ASL360, proposed by Alireza Mohammadhosseini, Jacob Chakareski, and Nicholas Mastronarde, is an adaptive deep reinforcement learning-based scheduler. Its primary goal is to maximize the overall Quality of Experience (QoE) for mobile VR users by intelligently managing video streaming in a UAV-assisted 5G wireless network. The system leverages the power of Artificial Intelligence to make smart, real-time decisions about how and when to deliver video content.

How ASL360 Works

The core of ASL360 lies in its sophisticated approach to video encoding and delivery. 360-degree videos are broken down into ‘dependent layers’ and ‘segmented tiles’. This means the video has a ‘Base Layer’ (BL) which provides a fundamental, low-quality version of the entire panoramic frame, and ‘Enhancement Layers’ (ELs) which add higher quality details, specifically for the user’s current viewport. By only sending high-quality segments for the part of the video the user is actually looking at, ASL360 conserves bandwidth and improves efficiency.

Users in the ASL360 system utilize two separate buffers: one for the Base Layer segments and another for the Enhancement Layer segments. This dual-buffer system allows for a delicate balance: the Base Layer buffer ensures continuous playback even during network fluctuations, while the Enhancement Layer buffer allows for quick adaptation to changes in the user’s viewport, ensuring the highest quality where it matters most.

The system models the scheduling decision as a Constrained Markov Decision Process (CMDP), where an AI agent learns to select between downloading Base or Enhancement layers. It uses a policy gradient-based method called Proximal Policy Optimization (PPO) to find the best strategy. Crucially, ASL360 includes a dynamic adjustment mechanism for its cost components, allowing it to adaptively prioritize video quality, buffer occupancy, and quality changes based on real-time network conditions and streaming session demands. This means if the network is struggling, it might prioritize keeping the Base Layer buffer full to prevent stalls, but if conditions are good, it will focus on delivering high-quality Enhancement Layers.

Impressive Results and What They Mean

Extensive simulations, using realistic 5G mm-Wave network traces and 8K 360-degree video sequences, demonstrated ASL360’s superior performance compared to existing methods. The results are compelling:

ASL360 achieved approximately 2 dB higher average video quality (measured by PSNR) compared to competitive baseline methods like the Threshold-based method and Pensieve (another reinforcement learning-based solution). This translates to a noticeably sharper and more detailed visual experience for users.
It delivered an impressive 80% lower average rebuffering time. This means significantly fewer frustrating interruptions and a much smoother playback experience.
The system also showed 57% lower video quality variation, ensuring a more consistent and stable visual quality throughout the streaming session, avoiding jarring shifts between high and low quality.

These improvements highlight the effectiveness of ASL360’s layered and adaptive approach in enhancing the Quality of Experience for immersive video streaming, especially in dynamic and challenging network environments. The system’s ability to dynamically balance quality, rebuffering, and smoothness is a game-changer for mobile VR.

Also Read:

The Road Ahead

The researchers plan to further enhance ASL360 by integrating computing resources to jointly manage both network and computational resource allocation, aiming for even greater QoE in immersive video services. Future work will also explore the scalability of ASL360 for multiple users and investigate optimal resource allocation algorithms to ensure fairness among all VR users. For more technical details, you can refer to the full research paper here.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Next-Gen VR: How ASL360 Delivers Seamless 360-Degree Video via AI and Drone-Assisted Networks

Understanding the Challenge of 360-Degree Video

Introducing ASL360: A Smart Solution

How ASL360 Works

Impressive Results and What They Mean

The Road Ahead

Gen AI News and Updates

PASA Unveils New ‘Data for AI’ Guidance to Foster Responsible Innovation in Pensions Administration

AZTECH Introduces Comprehensive AI Training Series to Propel Regional Digital Transformation

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates