MoEcho: Exposing User Privacy Risks in Mixture-of-Experts AI

TLDR: MoEcho is a new framework that demonstrates how the efficient Mixture-of-Experts (MoE) architecture in large AI models (LLMs, VLMs) can inadvertently leak sensitive user data. By monitoring subtle hardware execution patterns (side-channels) on CPUs and GPUs, attackers can infer private information from user prompts, reconstruct model responses, and even deduce visual attributes or reconstruct images, highlighting a critical need for enhanced privacy safeguards in modern AI systems.

Modern artificial intelligence, especially large language models (LLMs), has made incredible strides, but this progress often comes with a trade-off: computational efficiency. To tackle this, many advanced AI systems now use a design called Mixture-of-Experts (MoE). While MoE models are great at balancing performance and cost by selectively activating specialized subnetworks, new research reveals a significant privacy vulnerability inherent in their design.

A new study introduces ‘MoEcho,’ a framework that uncovers how the dynamic way MoE models operate can inadvertently expose sensitive user information. Imagine an AI model processing your private health query or analyzing a personal image. MoEcho demonstrates that the specific ‘experts’ activated within the MoE architecture leave behind unique digital ‘footprints’ that can be tracked and exploited by attackers.

The Hidden Footprints of AI

The core of the MoEcho attack lies in observing these ‘footprints’ during two key phases of an AI model’s operation:

Expert Load: When an AI model first processes an entire input (like a long prompt), it distributes parts of that input among its various experts. The number of times each expert is used creates a unique ‘load’ pattern.
Expert Sequence: As the AI model generates its response token by token, it activates a specific sequence of experts for each new piece of information. This sequence can be highly indicative of the output being generated.

These patterns, while seemingly innocuous, are highly correlated with the user’s input and the model’s output, making them a goldmine for privacy breaches.

How Attackers Listen In: Side-Channel Exploitation

MoEcho leverages what are known as ‘side-channel attacks.’ Instead of directly hacking the AI’s code, these attacks observe subtle, indirect information leaked during the system’s execution. The researchers developed four novel architectural side-channels:

On CPUs: They used ‘Cache Occupancy Channels’ to measure how much each expert uses the processor’s temporary memory (cache), inferring the ‘expert load.’ For the ‘expert sequence,’ they employed ‘Pageout+Reload’ attacks, which involve monitoring how quickly expert-specific memory pages are reloaded after being temporarily removed from memory.
On GPUs: For graphics processing units, ‘Performance Counters’ were used to track the number of computational threads each expert utilized, revealing the ‘expert load.’ To capture the ‘expert sequence,’ a ‘TLB Evict+Reload’ attack was devised, which monitors the Translation Lookaside Buffer (TLB), a special cache for memory addresses.

These techniques allow an attacker, even if co-located on the same physical machine (e.g., in a cloud environment), to stealthily gather information about which experts are active and for how long.

Four Ways Your Privacy Can Be Compromised

Based on these leaked execution patterns, MoEcho proposes four powerful privacy attacks:

Prompt Inference Attack (PIA): This attack aims to uncover sensitive attributes from user inputs, such as a patient’s illness, age, or gender from a healthcare query. For example, the study achieved a 99.8% success rate in inferring private patient inputs in healthcare records using templated inputs.
Response Reconstruction Attack (RRA): Here, the attacker can reconstruct the AI model’s entire output, word for word. This could expose sensitive information from revised emails or personal documents. The research showed a 92.8% success rate in reconstructing LLM responses.
Visual Inference Attack (VIA): For AI models that process images (Vision-Language Models or VLMs), this attack can deduce visual attributes like facial features or even identify individuals from input images.
Visual Reconstruction Attack (VRA): This advanced attack can reconstruct parts of an input image, even if the user only provided a masked version, by using the leaked expert load information to guide a generative model.

The researchers evaluated MoEcho on several open-source MoE models, including DeepSeek-V2 Lite and DeepSeek-VL2, demonstrating the widespread applicability and effectiveness of these attacks in real-world scenarios.

Also Read:

Protecting AI’s Future

The findings from MoEcho highlight a critical security and privacy threat in the rapidly evolving landscape of AI. As MoE architectures become more common for their efficiency, it’s crucial to implement safeguards. Potential mitigations include:

Robust Training: Introducing randomness or differential privacy during the training of MoE routers to obscure expert decisions.
Secure Deployment: Randomizing the execution order of experts or distributing them across multiple devices to make tracking harder.
General Side-Channel Defenses: Adapting existing defenses like balancing computation to hide load variations, restricting access to sensitive system tools, and rigorously isolating shared hardware resources.

This groundbreaking work is the first to conduct a run-time, architecture-level security analysis of the popular MoE structure, urging developers and users to consider these privacy implications. For more in-depth technical details, you can read the full research paper here.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

MoEcho: Exposing User Privacy Risks in Mixture-of-Experts AI

The Hidden Footprints of AI

How Attackers Listen In: Side-Channel Exploitation

Four Ways Your Privacy Can Be Compromised

Protecting AI’s Future

Gen AI News and Updates

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates