Optimizing AI Energy Use in 5G Networks: A DeepRx Study on Efficiency and Knowledge Distillation

TLDR: This research paper investigates the energy consumption of AI/ML models, specifically focusing on DeepRX, a deep learning receiver for 5G networks. It evaluates energy usage during both training and inference, highlighting that inference is the major energy expenditure for deployed models. The study identifies key factors influencing energy consumption, such as hardware choice and specific model components. A core contribution is the application of knowledge distillation (KD) to create smaller, more energy-efficient DeepRX models (student models) that maintain the performance of larger teacher models. The results demonstrate that KD effectively reduces energy consumption while improving performance, achieving a lower error rate compared to models trained from scratch.

In an era of rapid technological advancement, the telecommunications industry is increasingly integrating Artificial Intelligence (AI) and Machine Learning (ML) to enhance efficiency and user experience. However, this progress comes with a significant challenge: substantial energy consumption. AI/ML models contribute to a notable portion of global energy use, and for telecom operators, energy costs can account for a large percentage of their operating expenses. This highlights a critical need to balance technological innovation with energy efficiency and environmental sustainability.

A recent study, titled “Energy Efficiency in AI for 5G and Beyond: A DeepRx Case Study”, delves into this challenge by focusing on DeepRX, a deep learning receiver based on a fully convolutional ResNet architecture. The research evaluates DeepRX’s energy consumption and explores methods to reduce it, particularly through a technique called knowledge distillation.

Understanding Energy Consumption in AI Models

The study emphasizes that the total energy consumption of an ML model includes both computation and memory operations. While computational energy is related to the number of operations, memory energy, often overlooked, can be even more significant, influenced by how data is managed and reused. The researchers used various approaches to estimate and measure energy use, including metrics based on operations and hardware specifications, and software tools like Intel Power Gadget, Experiment-Impact-Tracker, and CodeCarbon. These tools leverage built-in processor features to track energy consumption.

A key finding from the energy assessment of DeepRX was that energy usage varies significantly depending on the hardware. High-performance processors, like Intel Xeon, consume more energy, while specialized AI processors, such as the Google Coral Edge TPU, are more energy-efficient. The study also provided a detailed breakdown of DeepRX’s energy consumption, identifying that specific central layers (residual blocks 5, 6, and 7) were the most energy-intensive. This insight is crucial for targeting optimization efforts.

Furthermore, the research compared the energy consumed during the training and inference phases of DeepRX. While training requires substantial energy, the study found that for a deployed model like DeepRX, which performs many inferences per second, the energy expended during ongoing inference operations quickly surpasses the total energy used for training. This underscores that the primary energy expenditure for a deployed AI model occurs during its operational inference phase.

Enhancing Energy Efficiency with Knowledge Distillation

To address the high energy consumption, especially during inference, the study applied knowledge distillation (KD). KD is a technique where a smaller, more energy-efficient “student” model learns from a larger, more complex “teacher” model. The student model aims to replicate the performance of the teacher but with reduced computational demands due to its smaller size.

The process involved several steps: first, selecting an optimal student model size that minimizes energy use without severely impacting performance. The researchers found that while smaller models naturally degrade in performance, KD could help mitigate this. Second, identifying the right teacher model size was crucial; a teacher that is too small might not provide enough useful knowledge, while an excessively large teacher might be too complex for the student to effectively learn from. For DeepRX, a 30 TFLOPs teacher model proved most effective for an 11 TFLOPs student model.

Finally, fine-tuning KD hyperparameters, such as the alpha coefficient (balancing student’s own loss against distillation loss) and the temperature parameter (which “softens” the teacher’s probability distribution to provide more nuanced information), was essential for optimal results. The study demonstrated that models trained with KD achieved a notably lower error floor across various signal-to-interference and noise ratio (SINR) levels compared to models trained from scratch. This indicates that KD is highly effective in achieving energy-efficient AI solutions while maintaining or even improving performance.

Also Read:

Conclusion

This research provides valuable insights into measuring and improving the energy efficiency of AI models, using DeepRX as a practical case study. It highlights the importance of selecting appropriate processors and focusing optimization efforts on the most energy-consuming components. By successfully applying knowledge distillation, the study demonstrated a significant improvement in DeepRX’s performance for a given model size, achieving a 4 dB gain for a specific Bit Error Rate. This work paves the way for more sustainable and cost-effective AI deployments in telecommunications and beyond.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Optimizing AI Energy Use in 5G Networks: A DeepRx Study on Efficiency and Knowledge Distillation

Understanding Energy Consumption in AI Models

Enhancing Energy Efficiency with Knowledge Distillation

Conclusion

Gen AI News and Updates

PASA Unveils New ‘Data for AI’ Guidance to Foster Responsible Innovation in Pensions Administration

AZTECH Introduces Comprehensive AI Training Series to Propel Regional Digital Transformation

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates