Stealthy Data Attacks: How AI Can Create Undetectable Adversarial Examples in Tables

TLDR: A new research paper introduces a method to create “imperceptible” adversarial attacks on tabular data (like spreadsheets). Unlike attacks on images, tabular data is tricky due to mixed data types. The researchers use a Variational Autoencoder (VAE) to generate these attacks in a “latent space,” ensuring the altered data looks statistically normal and isn’t easily detected as an anomaly, while still fooling machine learning models. This approach offers a more practical and realistic way to test the robustness of AI systems handling real-world data.

In the rapidly evolving world of artificial intelligence, understanding and mitigating vulnerabilities is crucial. One significant area of concern is ‘adversarial attacks,’ where subtle, often imperceptible changes are made to data to trick machine learning models. While these attacks are well-studied in areas like image recognition, applying them to tabular data – the kind found in spreadsheets, databases, and financial records – presents unique challenges.

A new research paper, titled “Crafting Imperceptible On-Manifold Adversarial Attacks for Tabular Data,” by Zhipeng He, Alexander Stevens, Chun Ouyang, Johannes De Smedt, Alistair Barros, and Catarina Moreira, delves into these challenges and proposes an innovative solution. The core problem with tabular data is its heterogeneous nature, combining different types of information like numbers and categories. Unlike images, where a few pixel changes might still look similar to the human eye, small alterations in tabular data can drastically change its meaning or make it look obviously fake.

The Problem with Traditional Attacks

Traditional adversarial attack methods often rely on mathematical constraints (like ‘ℓp-norms’) that work well for continuous data like images. However, when applied to tabular data, these methods tend to produce ‘outlier’ examples – data points that deviate significantly from the original data’s statistical patterns. Imagine changing a person’s age to an unrealistic number or altering a categorical feature in a way that doesn’t make sense in the real world. Such changes make the adversarial examples easily detectable, limiting their practical use in exposing real-world AI vulnerabilities.

The researchers emphasize the concept of ‘imperceptibility,’ meaning the adversarial example should be statistically indistinguishable from the original data distribution. This is where traditional methods fall short, as they often create data that looks ‘out-of-distribution,’ making the attack obvious.

A Novel Approach: Latent Space Perturbations with VAEs

To overcome these limitations, the paper proposes a novel framework that uses a Variational Autoencoder (VAE). A VAE is a type of neural network capable of learning a compressed, continuous representation of data, known as a ‘latent space.’ The key innovation here is to generate adversarial examples by making subtle changes not to the original data directly, but within this learned latent space.

The VAE designed by the authors is specifically tailored for tabular data. It can handle both numerical and categorical features by transforming them into a unified, continuous latent manifold. This means that when a perturbation is applied in this latent space, the VAE’s decoder can reconstruct an adversarial example that remains statistically consistent with the original data distribution. In simpler terms, the altered data looks ‘normal’ and ‘on-manifold’ – it preserves the inherent statistical patterns and relationships of the original dataset.

The VAE architecture includes an encoder (to map input data to the latent space), a decoder (to reconstruct data from the latent space), and a classification head (to help the VAE learn a latent space that separates different classes of data effectively). This joint training ensures that the latent space is not only compact but also semantically meaningful, allowing for targeted yet imperceptible perturbations.

How the Attacks Work

The process involves taking an original data point, encoding it into its latent representation, adding a small, optimized ‘noise’ or perturbation to this latent representation, and then decoding it back into a new, adversarial data point. This perturbation is carefully calculated to cause a machine learning model to misclassify the data, while ensuring the altered data remains within the statistical boundaries of the original dataset. This approach ensures that the adversarial examples are both effective at fooling models and practically undetectable as anomalies.

Also Read:

Key Findings and Practical Implications

The researchers conducted extensive evaluations across six diverse datasets and three different machine learning models (Multi-Layer Perceptron, Soft Decision Tree, and TabTransformer). Their findings highlight several important points:

Superior Reconstruction: The VAE, especially when trained with a classification loss, demonstrated excellent ability to reconstruct mixed-type tabular data, preserving its predictive performance.
Imperceptibility is Key: While traditional attacks often achieved high success rates in fooling models, they frequently produced data that was statistically an outlier, making them easily detectable. The VAE-based method, however, consistently achieved significantly lower outlier rates, meaning the adversarial examples remained ‘in-distribution.’
In-Distribution Success Rate (IDSR): The paper introduces a new metric, IDSR, which combines attack effectiveness with imperceptibility. Their proposed VAE-based method consistently achieved the best or second-best IDSR across various scenarios, demonstrating its practical utility.
Sparsity Control: The study also explored how to control the number of features modified in an attack. While explicit methods for sparsity control showed limited improvement, the VAE’s inherent ability to learn compact representations naturally led to more targeted and minimal perturbations.
VAE vs. GAN: The research validated the choice of VAE over Generative Adversarial Networks (GANs) for this task, showing that VAEs offer superior reconstruction fidelity, especially for categorical features, which is crucial for generating realistic tabular data.

In conclusion, this research underscores the importance of generating ‘on-manifold’ adversarial examples for tabular data. By leveraging the power of VAEs to perturb data in a learned latent space, the authors provide a robust and practical framework for creating imperceptible adversarial attacks. This work is vital for understanding and improving the robustness of machine learning systems that handle real-world tabular information.

The source code for this research is openly available for access. You can find more details in the full research paper.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Stealthy Data Attacks: How AI Can Create Undetectable Adversarial Examples in Tables

The Problem with Traditional Attacks

A Novel Approach: Latent Space Perturbations with VAEs

How the Attacks Work

Key Findings and Practical Implications

Gen AI News and Updates

Visier Unveils Model Context Protocol (MCP) for AI Agents to Govern People Data Across Enterprises

Nokod Security Unveils Adaptive Agent Security for Comprehensive AI Agent Protection

A New Way to Disentangle Data for Scientific Exploration

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates