TLDR: A groundbreaking new tool named ‘Unmarker’ has emerged, posing a significant challenge to existing AI image watermarking techniques. Developed by researchers, Unmarker is described as the first practical universal attack capable of effectively erasing both traditional and advanced semantic watermarks embedded in AI-generated images, raising concerns about the future of deepfake detection.
The integrity of AI-generated content, particularly in the face of rising deepfake concerns, is under scrutiny following the development of a novel attack tool called ‘Unmarker.’ This new research, slated to be presented at the prestigious IEEE S&P 2025 conference, reveals a method capable of universally circumventing defensive watermarking techniques designed to identify AI-generated images.
Defensive watermarking has been a key strategy for Generative AI (GenAI) providers to embed hidden fingerprints within their images, enabling subsequent detection of deepfakes. However, Unmarker, as detailed in the research paper ‘UnMarker: A Universal Attack on Defensive Image Watermarking,’ fundamentally disrupts this defense mechanism.
Unlike previous attacks, Unmarker operates without requiring detector feedback, specific knowledge of the watermarking scheme, or complex denoising pipelines. Its effectiveness stems from an in-depth analysis of watermarking paradigms, which revealed that robust schemes typically construct their watermarks within the spectral amplitudes of an image. Unmarker exploits this by employing two novel adversarial optimizations designed to disrupt these spectral amplitudes, thereby erasing the embedded watermarks.
Evaluations against state-of-the-art watermarking schemes have demonstrated Unmarker’s formidable capabilities. It not only defeats traditional watermarking methods while maintaining superior image quality compared to existing attacks but also successfully breaks semantic watermarks. Semantic watermarks, which alter an image’s underlying structure and were considered the future of defensive watermarking, saw their best detection rates plummet to as low as 43% when subjected to Unmarker, effectively rendering them useless.
Also Read:
- Researchers Unveil Zero-Click Prompt Injection Vulnerabilities in AI Agents at Black Hat Conference
- Ethical Storm Brews Over AI-Generated Deepfakes: Calls for Stricter Regulation Intensify
This development has significant implications for the ongoing battle against malicious uses of GenAI, such as the creation of deepfakes for political smear campaigns or non-consensual pornography. The researchers behind Unmarker suggest that their findings indicate defensive watermarking may not be a viable long-term defense against deepfakes, urging the scientific community to actively explore alternative detection and defense strategies.


