Boosting PIM Performance: A Co-Design Approach to Mitigate Voltage Drops

TLDR: AIM is a novel software and hardware co-design approach to mitigate IR-drop issues in high-performance Processing-in-Memory (PIM) chips. It introduces metrics like Rtog and HR to correlate workloads with IR-drop. Software methods, Lower Hamming Rate (LHR) and Weight Distribution Shift (WDS), optimize data to reduce IR-drop potential with minimal accuracy loss. Hardware component, IR-Booster, dynamically adjusts voltage-frequency pairs based on HR and real-time monitoring, ensuring reliability. An HR-aware task mapping strategy further optimizes performance for mixed workloads. The system achieves up to 69.2% IR-drop mitigation, 2.29x energy efficiency improvement, and 1.152x speedup on a 7nm 256-TOPS PIM chip.

High-performance computing is constantly pushing the boundaries of what’s possible, and a technology called Processing-in-Memory (PIM) is at the forefront. PIM seamlessly integrates computing capabilities directly within memory units, offering impressive gains in computing density, energy efficiency, and precision. This approach is particularly promising for demanding applications like artificial intelligence.

However, as PIM chips become more powerful and operate at higher speeds, they face a significant challenge known as IR-drop. Imagine a complex city with many buildings drawing power simultaneously; if the power lines aren’t robust enough, some buildings might experience a dip in voltage. Similarly, in a PIM chip, the simultaneous operation of numerous computing units can cause the supply voltage to drop, leading to performance degradation and even reliability issues. Traditional methods to combat IR-drop, often involving complex circuit-level adjustments, are resource-intensive and can compromise the chip’s overall power, performance, and area (PPA).

To address this critical issue, researchers have introduced a novel approach called AIM: Software and Hardware Co-design for Architecture-level IR-drop Mitigation in High-performance PIM. This comprehensive solution tackles IR-drop from a system-wide perspective, combining intelligent software optimizations with dynamic hardware adjustments. You can read the full research paper for more technical details here: AIM: Software and Hardware Co-design for Architecture-level IR-drop Mitigation in High-performance PIM.

Understanding the Problem: Workloads and IR-drop

A key insight behind AIM is recognizing the direct link between the tasks a PIM chip performs (its workload) and the severity of IR-drop. Unlike general-purpose processors, PIM architectures often handle predictable and regular workloads, primarily neural networks. This predictability creates an opportunity for architecture-level optimization.

AIM introduces two crucial metrics to quantify this relationship: Rtog (instantaneous toggle rate) and HR (Hamming Rate). Rtog measures the rapid switching of bits within a PIM bank, which directly correlates with dynamic current draw and thus IR-drop. HR, derived from Rtog, simplifies this by focusing on the average ‘1’ bits in the pre-loaded memory data (weights). Crucially, HR is independent of the input data, making it an optimizable metric for software-based mitigation.

Software Innovations: Optimizing for Lower IR-drop

AIM’s software component focuses on reducing the Hamming Rate (HR) of the data stored in memory, thereby lowering the potential for IR-drop. It achieves this through two main methods:

Lower Hamming Rate (LHR): This technique integrates a regularization term into the quantization process, which is how neural network weights are converted into a more compact, lower-precision format. LHR encourages weights to align with values that naturally have lower HR, such as 0 or 8 in an 8-bit system, without significantly impacting the accuracy of the neural network. This is like subtly nudging the data to be more ‘IR-drop friendly’ during its creation.
Weight Distribution Shift (WDS): Building on LHR, WDS further refines the weight distribution. It shifts the entire distribution of weights towards positive values, concentrating them around smaller positive numbers that inherently have lower HR. To maintain computational accuracy, a clever hardware mechanism called a ‘shift compensator’ is used to correct for this shift after calculations, ensuring the final results are correct without adding delays to critical operations.

Hardware Intelligence: Dynamic Adaptation

While software optimizations prepare the data, AIM’s hardware component, IR-Booster, provides real-time, dynamic adjustments to mitigate IR-drop during operation. PIM chips are often divided into ‘Macro Groups’ that share power supplies and operate at the same frequency.

IR-Booster V-f Pair Adjustment: IR-Booster extends traditional Dynamic Voltage and Frequency Scaling (DVFS) by offering more granular control. Instead of just adjusting voltage and frequency based on general workload, it uses the pre-determined HR information for each Macro Group to select optimal voltage-frequency (V-f) pairs. This allows for lower voltages or higher frequencies (or both) when the HR is low, significantly improving energy efficiency and performance. It operates in ‘sprint mode’ for maximum throughput or ‘low-power mode’ for energy savings.
IRFailure-driven Recomputing: To ensure reliability, IR-Booster includes an ‘IR Monitor’ that constantly checks for voltage drops. If an IR-drop violation (IRFailure) occurs, the system quickly adjusts the V-f pair to a safer level and initiates a recomputation for the affected tasks. This mechanism ensures that even aggressive optimizations don’t compromise the chip’s functional integrity.

Co-optimization: HR-aware Task Mapping

To maximize the benefits of both software and hardware, AIM introduces HR-aware task mapping. In complex scenarios where multiple tasks with varying HR values run concurrently, simply assigning them randomly can lead to inefficiencies. HR-aware task mapping intelligently assigns tasks to PIM macros, considering their HR values. This prevents tasks with very different IR-drop characteristics from interfering with each other, ensuring that Macro Groups can operate at their most efficient V-f pairs. This optimization is crucial for maintaining high performance and energy efficiency across diverse workloads.

Also Read:

Impressive Results

Evaluations on a 7nm 256-TOPS PIM chip design demonstrate the significant impact of AIM. The system achieved up to 69.2% IR-drop mitigation within a macro. This translates to a remarkable 2.29 times improvement in energy efficiency and a 1.152 times speedup in overall chip performance. The study also showed that while IR-Booster provides the primary benefits for transformer-based AI models (due to their dynamic nature), LHR and WDS are particularly effective for convolution-based networks where weights can be pre-optimized.

AIM represents a significant step forward in designing high-performance PIM chips. By intelligently co-designing software and hardware, it effectively addresses the critical challenge of IR-drop, paving the way for more powerful, efficient, and reliable AI accelerators.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Boosting PIM Performance: A Co-Design Approach to Mitigate Voltage Drops

Understanding the Problem: Workloads and IR-drop

Software Innovations: Optimizing for Lower IR-drop

Hardware Intelligence: Dynamic Adaptation

Co-optimization: HR-aware Task Mapping

Impressive Results

Gen AI News and Updates

Peking University Researchers Unveil Analog Chip Boosting AI Data Centers by Up to 1,000-Fold

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

A New Era for Spiking Neural Networks: Hyperdimensional Decoding Boosts Accuracy and Efficiency

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates