New Research Reveals How Adversarial Audio Can Control and Disrupt AI Voice Assistants

TLDR: A new paper demonstrates that audio-based large language models (ALLMs) like Qwen2-Audio are vulnerable to “adversarial noise.” Attackers can craft subtle audio perturbations to either force ALLMs to perform specific malicious actions (like deleting calendar events) or degrade their performance, even in real-world scenarios by playing sounds over the air. While some defenses exist, attackers can adapt, highlighting a significant security risk for AI voice assistants.

A recent research paper titled “Attacker’s Noise Can Manipulate Your Audio-based LLM in the Real World” sheds light on critical vulnerabilities in audio-based large language models (ALLMs), such as Qwen2-Audio. Authored by Vinu Sankar Sadasivan, Soheil Feizi, Rajiv Mathews, and Lun Wang, this study reveals how subtle, crafted audio disturbances can be used to manipulate these advanced AI systems, posing a significant threat in real-world scenarios.

The paper identifies two main types of attacks: targeted and untargeted. In a targeted attack, an adversary creates stealthy audio perturbations designed to force an ALLM to perform specific, predefined actions. For instance, an attacker could play background noise that tricks an AI assistant into waking up (e.g., by interpreting the noise as “Hey Qwen”) or even executing harmful commands like “delete my calendar events” or “send money to X.” The researchers demonstrated a 100% success rate in these targeted attacks across various experimental settings, highlighting the ease with which these models can be exploited.

Untargeted attacks, on the other hand, aim to degrade the ALLM’s overall utility. By injecting adversarial noise during a user’s interaction, the attacker can significantly impair the model’s ability to understand and respond correctly. This could lead to a voice assistant misinterpreting commands or generating nonsensical replies, making the service unreliable for innocent users. The study showed that adversarial noise caused much more extreme degradation in speech recognition performance compared to random noise.

A crucial aspect of this research is its focus on real-world applicability. The authors explain that these adversarial sounds can be transmitted over the air, impacting other users in public spaces without their knowledge. To achieve this, they incorporated audio augmentation techniques like translation, additive noise, and SpecAugment during the attack optimization process. SpecAugment, in particular, proved vital for making the attacks robust against real-world distortions like microphone interference and ambient noise, enabling successful manipulation even when played through a speaker and recorded by another device.

The research also explored the transferability of these attacks, finding that they remain effective even when the ALLM is given different system instructions, including those designed to make it ignore background noise. This suggests that custom instructions or basic filtering might not be sufficient to protect against these sophisticated attacks.

While the paper highlights a significant security concern, it also investigates potential defensive measures. Simple input audio augmentations, such as sample rate modification and noise reduction, were tested. However, the most effective defense identified was neural audio compression, specifically EnCodec. This technique showed nearly 100% success in countering the attacks. Despite this, the authors caution that attackers could adapt their methods to bypass these defenses if they are aware of them, emphasizing the need for continuous development of more robust and adaptive security protocols.

Also Read:

In conclusion, this paper underscores a critical vulnerability in audio-based LLMs, demonstrating how adversarial audio can be used for both targeted malicious actions and general degradation of service in real-world scenarios. The findings call for increased caution in deploying and open-sourcing such models and stress the urgent need for advanced, adaptive defenses to protect users from these emerging threats. For more details, you can read the full research paper here.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

New Research Reveals How Adversarial Audio Can Control and Disrupt AI Voice Assistants

Gen AI News and Updates

OneShield Achieves Landmark Registration Under Cloud Security Alliance AI Controls Matrix, Setting New Industry Standard

TrojAI Unveils Defend for MCP to Bolster Security for AI Agent Workflows

OpenAI Unveils ‘Friendlier’ GPT-5.1 for ChatGPT, Emphasizing Enhanced User Experience and Adaptive Intelligence

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates