Streamlining Chiplet Design with Multi-Agent AI Guidance

TLDR: MAHL is a novel framework that uses a multi-agent Large Language Model (LLM) system to automate and optimize the hierarchical design of chiplets for AI workloads. It addresses common LLM limitations like flattened designs, high validation costs, and imprecise parameter optimization through six collaborative agents. The framework significantly improves design generation accuracy and achieves competitive Power, Performance, and Area (PPA) results compared to conventional LLMs and expert-based methods.

As artificial intelligence workloads continue to grow in size and complexity, the challenge of designing efficient computing hardware becomes increasingly significant. Traditional chip design methods struggle with the high dimensionality of modern AI algorithms, encompassing various computing cores, array sizes, and memory hierarchies. While machine learning has already begun to assist in agile chip design, the recent advancements in Large Language Models (LLMs) offer a promising avenue for further innovation, particularly in the realm of 2.5D integration – an advanced packaging technique that helps reduce area overhead and development costs for integrated circuits.

However, relying solely on LLMs for chiplet design presents several hurdles. These include the tendency of LLMs to generate ‘flattened’ designs (where all code is in one block, lacking modularity), high validation costs due to inaccurate code, and difficulties in precisely optimizing design parameters. To tackle these challenges, researchers from the University of Minnesota – Twin Cities have introduced MAHL: Multi-Agent LLM-Guided Hierarchical Chiplet Design with Adaptive Debugging. This innovative framework aims to streamline the chiplet design process by leveraging a collaborative multi-agent system.

Introducing MAHL: A Collaborative AI Approach to Chiplet Design

MAHL is a hierarchical LLM-based framework designed to generate chiplet designs efficiently, with a focus on optimizing Power, Performance, and Area (PPA). It achieves this through the coordinated efforts of six specialized agents, each handling a critical aspect of the design workflow:

AI-Hardware Hierarchical Parser: This agent takes a user’s natural language input describing an AI algorithm and breaks it down into computational and interconnection modules. It then maps these to appropriate hardware modules, using LLMs and a library of existing components. If a component isn’t found, it can interact with a human to get the necessary specifications.
Hierarchical Module Description Generator: For mapped hardware modules, this agent retrieves hierarchical descriptions from a library. For new or unmapped modules, a duo-agent system (a generator LLM and an evaluator LLM) creates and refines structured descriptions, ensuring they meet format and completeness requirements.
Retrieval-Augmented Code Generator: This agent takes the hierarchical module descriptions and generates Hardware Description Language (HDL) code. It prioritizes code reuse by dynamically querying a Code Library for high-quality, pre-verified code snippets, only generating new code when necessary.
Diverseflow Validator: A crucial component for debugging, this agent integrates simulation and synthesis tools with a multi-round debugging strategy. It generates testbenches using LLMs and retrieves existing ones. To prevent getting stuck on a single error, it injects controlled ‘noise’ into prompts, encouraging diverse debugging attempts. A ‘Thinker’ LLM diagnoses issues, and a ‘Coder’ LLM applies fixes.
Multi-Granularity Design Space Explorer: Chiplet design involves a vast number of possible configurations. This agent combines the strengths of LLMs for coarse-grained exploration of the design space with analytical techniques for fine-grained refinement. It identifies optimal hardware parameters by analyzing PPA metrics and provides feedback for further optimization.
Configurator: For the final physical layout, this LLM-based agent automatically generates and revises configuration files for tools like OpenROAD, tuning parameters such as wire length, spacing, and chip size to ensure a successful layout.

Also Read:

Performance and Impact

Experiments demonstrate that MAHL significantly improves the accuracy of generating both simple RTL (Register-Transfer Level) designs and complex real-world chiplet designs. For simple RTL designs, the framework improves the average Pass@1 success rate by 44.67% compared to conventional methods. For real-world chiplet designs, evaluated by Pass@5, MAHL achieves a remarkable increase from 0 to 0.72 when compared to conventional LLMs under the best-case scenario. This means MAHL is far more successful at producing functional and correct designs.

Furthermore, when compared to state-of-the-art expert-based frameworks like CLARIE, MAHL achieves comparable or even superior PPA results for specific optimization objectives. For instance, MAHL-generated designs showed an average latency reduction of 16.08% in high-performance mode and an impressive 83.96% area reduction in compact-area mode compared to human-designed chiplets. While LLM-based generation tends to excel at primary design objectives, the research notes that there’s still room for improvement in achieving a comprehensive trade-off across all hardware design factors, such as power density.

The MAHL framework represents a significant step forward in automating chiplet design for AI applications. By decomposing complex tasks into manageable submodules and leveraging the power of multi-agent LLMs, it addresses key limitations of direct LLM application, paving the way for more efficient and optimized hardware development. You can find more details about this research in the paper available at arXiv.org.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Streamlining Chiplet Design with Multi-Agent AI Guidance

Introducing MAHL: A Collaborative AI Approach to Chiplet Design

Performance and Impact

Gen AI News and Updates

Baidu Unveils Next-Generation AI Accelerators and ERNIE 5.0 Model

NVIDIA Introduces $249 Jetson Orin Nano Super Developer Kit for Accessible Generative AI

Optimizing Neural Processing Units for Continual Learning with Microscaling

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates