Uncovering Hidden Privacy Risks: Entity-Level Membership Inference in Large Language Models

TLDR: This paper introduces EL-MIA, a new framework to quantify privacy risks in Large Language Models by detecting if specific sensitive entities (like names or phone numbers) were part of their training data. Traditional methods focus on entire documents, but EL-MIA targets finer-grained information. The researchers developed a benchmark dataset and proposed new attack methods that significantly outperform existing techniques, revealing that LLMs are vulnerable to entity-level privacy breaches, especially for low-entropy data types. The study also explores how model size, training epochs, and attribute types influence this susceptibility.

Large Language Models (LLMs) have become incredibly powerful, excelling at a vast array of language tasks. However, their widespread use also brings significant privacy concerns, particularly regarding the memorization of sensitive information, known as Personally Identifiable Information (PII), from their training data. This memorization can lead to serious privacy breaches, regulatory compliance issues, and risks in real-world applications.

Traditional methods for detecting such privacy risks, called Membership Inference Attacks (MIAs), typically focus on determining whether an entire document or a long sequence of text was part of a model’s training dataset. While these methods have made progress, they often fall short when it comes to identifying risks at a much finer, more granular level – specifically, the entity level. This means they struggle to tell if individual sensitive attributes, such as a person’s name, date of birth, address, or phone number, were included in the training data.

Introducing EL-MIA: A New Approach to Privacy Auditing

A recent research paper, EL-MIA: Quantifying Membership Inference Risks of Sensitive Entities in LLMs, proposes a novel framework called “EL-MIA” (Entity-Level Membership Inference Attack) to address this gap. Developed by Ali Satvaty, Suzan Verberne, and Fatih Turkmen from the University of Groningen and Leiden University, EL-MIA aims to audit the vulnerability of specific sensitive entities within LLM training data. This approach allows for more precise privacy risk assessments and supports the development of stronger defenses.

The core idea behind EL-MIA is to determine if a particular entity (e.g., “John Doe”) was part of the training data, given a partial or redacted sentence (e.g., “<full name> teaches at the University of New York.”). This is crucial because attackers often seek to confirm the presence of specific sensitive fields rather than entire text samples.

Building a Benchmark for Entity-Level Risks

To evaluate EL-MIA, the researchers constructed a new benchmark dataset. This dataset is built upon the existing AI4Privacy dataset, which contains a rich variety of annotated PII. For each sensitive entity, two versions of a sentence are created: a “Member” version with the original PII value from the dataset, and a “Non-member” version where the PII is replaced by a randomly selected value of the same type. This setup ensures that any differences in model behavior can be attributed solely to the specific PII under test.

Novel Attack Methods Outperform Existing Techniques

The paper systematically compares existing MIA techniques against two newly proposed methods designed specifically for entity-level attacks:

Reference-set normalization: This method scores how much a sensitive entity candidate is favored by the model compared to a matched set of plausible alternatives. It uses log-likelihood ratios to make this determination.
Enhancing signal with suffix scoring: This simple heuristic further sharpens membership signals by focusing only on the tokens immediately following the candidate entity, masking out the prefix. This helps to reduce noise from generic context and highlight memorized local continuations.

The experimental results, using Pythia models of various scales (from 160M to 6.9B parameters), clearly demonstrate that the proposed reference-set attacks, especially the suffix variant, consistently outperform all existing MIA methods. This highlights the limitations of general MIA methods when applied to the fine-grained entity-level threat model.

Key Findings and Insights

The research provides several important insights into entity-level memorization:

Vulnerability by Entity Type: The most vulnerable categories of sensitive entities are often small, low-entropy, and single-token domains like gender (e.g., “male,” “female”), cardinal directions, or currency codes. This is because models can assign sharply peaked probabilities to a limited number of options, making membership cues easier to detect.
Effect of Model Size and Training: While larger models generally show increased susceptibility, the rate of increase is size-dependent. Interestingly, mid-sized models (around 1B parameters) can sometimes be more susceptible in earlier training epochs, suggesting a complex interplay between model capacity, overfitting, and memorization dynamics. Susceptibility generally increases with more training epochs across all model sizes.
Token Count and Entities: The length of prefix tokens (text before the sensitive attribute) positively contributes to attack success, especially for larger models after the first epoch. The number of sensitive entities in a sample also plays a role, with complex samples initially benefiting from privacy but larger models eventually memorizing them more easily after extensive training.

Also Read:

Implications and Future Directions

The EL-MIA framework and its findings underscore the significant vulnerability of LLMs to entity-level privacy breaches. Even after just one epoch of training, LLMs exhibit substantial susceptibility under properly designed attacks. The researchers have made their benchmark and trained model checkpoints publicly available, providing a valuable resource for the community to develop new attack strategies and defense mechanisms.

This work serves as a crucial call for standardized evaluation of PII exposure, which is a realistic and pressing threat in operational LLM deployments. Future research will focus on expanding the threat model to different access levels, exploring variations of auxiliary data, developing further attack methods, and, critically, proposing robust defense strategies against these sophisticated entity-level privacy attacks.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Uncovering Hidden Privacy Risks: Entity-Level Membership Inference in Large Language Models

Introducing EL-MIA: A New Approach to Privacy Auditing

Building a Benchmark for Entity-Level Risks

Novel Attack Methods Outperform Existing Techniques

Key Findings and Insights

Implications and Future Directions

Gen AI News and Updates

Vesl AI Recognized for AI Infrastructure Innovation with ASOCIO Digital Summit Award

AT&T Unleashes Agentic AI Across Business Operations for Enhanced Efficiency and Innovation

Visier Unveils Model Context Protocol (MCP) for AI Agents to Govern People Data Across Enterprises

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates