spot_img
HomeResearch & DevelopmentSmart Scaling for Stream Processing on Edge Devices

Smart Scaling for Stream Processing on Edge Devices

TLDR: A new platform (MUDAP) and an AI agent (RASK) have been developed to enhance the performance of stream processing services on resource-limited edge devices. Unlike traditional autoscalers that only adjust computing resources, MUDAP and RASK enable multi-dimensional scaling, allowing adjustments to both resource levels (CPU, RAM) and service-specific parameters (like data quality or model size). RASK efficiently learns how to optimize these parameters, leading to 28% fewer Service Level Objective (SLO) violations compared to existing methods, especially during high demand, with minimal overhead.

Edge devices are becoming increasingly vital for services that require quick responses and high reliability, such as those found in autonomous vehicles or disaster response systems. However, these devices often have limited computing resources, which can make it challenging for stream processing services to consistently meet their performance goals, known as Service Level Objectives (SLOs).

Traditional methods for automatically adjusting resources, or ‘autoscaling,’ typically focus only on increasing or decreasing computing power like CPU or memory. This approach falls short on edge devices where resources are scarce and services might need more nuanced adjustments.

Introducing MUDAP: A Multi-Dimensional Autoscaling Platform

To tackle these limitations, researchers have introduced a new approach called the Multi-Dimensional Autoscaling Platform (MUDAP). MUDAP is designed to support very precise, ‘vertical’ scaling across two key areas: service-level parameters and resource-level parameters. This means that beyond just adjusting CPU or RAM, MUDAP can also scale service-specific aspects. For example, a video processing service could dynamically adjust its data quality (like video resolution) or the size of its underlying machine learning model to better manage its workload and resource consumption.

RASK: The Intelligent Scaling Agent

At the heart of MUDAP’s optimization capabilities is a scaling agent called Regression Analysis of Structural Knowledge (RASK). RASK is an intelligent system that continuously learns about the processing environment. It builds a continuous regression model, which is essentially a mathematical representation of how different scaling actions impact service performance. This model allows RASK to efficiently explore various options and infer the best possible scaling actions to optimize performance across multiple competing services on a single edge device.

The research highlights that RASK is remarkably efficient, capable of building an accurate regression model in just 20 iterations, which translates to observing about 200 seconds of processing. This quick learning ability is crucial for dynamic edge environments.

Addressing Key Challenges in Edge Computing

This new platform and agent directly address several core challenges:

  • SLO Fulfillment on Resource-Constrained Devices: By enabling service-aware vertical scaling, MUDAP allows for trade-offs, such as reducing data quality slightly to maintain essential throughput, which is vital when resources are limited.
  • Multi-Dimensional Autoscaling: The system is tailored to individual services, understanding that different services respond differently to scaling actions. RASK learns these impacts, even in complex scenarios.
  • Adapting to Dynamic Workloads: Edge devices often experience fluctuating data loads. RASK ensures that SLOs are met by dynamically adjusting resources and service parameters, avoiding over-provisioning while maintaining performance during peak demands.

Performance and Impact

The effectiveness of MUDAP and RASK was rigorously tested against existing autoscalers, including the Kubernetes Vertical Pod Autoscaler (VPA) and a reinforcement learning agent. The evaluation involved scaling up to nine services on a single edge device under various dynamic request patterns.

The results were compelling: RASK consistently sustained the highest request loads with 28% fewer SLO violations compared to the baseline autoscalers. This significant improvement was achieved by intelligently adding more ‘elasticity dimensions’ – meaning RASK could adjust more aspects of the services and resources. Furthermore, RASK introduced minimal CPU overhead, demonstrating its efficiency in resource-sensitive environments.

The study also found that increasing the number of elasticity dimensions (e.g., adjusting CPU, data quality, and model size) directly led to higher overall SLO fulfillment. While the complexity of the optimization problem increases with more services, caching previous decisions helped maintain stable performance and even improved SLO fulfillment in some cases.

Also Read:

The Future of Edge Device Management

This research paves the way for more sophisticated and efficient management of stream processing services on edge devices. By moving beyond traditional resource-only scaling, MUDAP and RASK offer a flexible, multi-dimensional approach that can ensure critical services maintain their performance even in the most challenging, resource-constrained environments. For more details, you can refer to the full research paper here.

Karthik Mehta
Karthik Mehtahttps://blogs.edgentiq.com
Karthik Mehta is a data journalist known for his data-rich, insightful coverage of AI news and developments. Armed with a degree in Data Science from IIT Bombay and years of newsroom experience, Karthik merges storytelling with metrics to surface deeper narratives in AI-related events. His writing cuts through hype, revealing the real-world impact of Generative AI on industries, policy, and society. You can reach him out at: [email protected]

- Advertisement -

spot_img

Gen AI News and Updates

spot_img

- Advertisement -