TLDR: A recent study reveals escalating security risks associated with the growing autonomy of AI agents, which are increasingly managing computers and mobile devices. These agents, designed to perform tasks with minimal supervision, introduce new vulnerabilities such as prompt injection, data leakage, and the critical risk of “environment escape” if not properly sandboxed.
A new study has brought to light significant security vulnerabilities stemming from the increasing control that “OS agents,” or more broadly, AI agents, are gaining over computers and mobile devices. These advanced AI systems, driven by large language models (LLMs), are designed to perceive, plan, use tools, and act within software environments with minimal human oversight, aiming to achieve specific goals . While promising enhanced productivity and automation, their expanding capabilities introduce a complex array of security challenges that traditional measures may not adequately address .
As of 2025, AI agents have matured significantly, proving reliable in narrow, well-defined workflows and rapidly improving in their ability to interact with computer operating systems and web environments . This enhanced “computer-use maturity” involves stronger DOM/OS instrumentation and hybrid strategies that can bypass graphical user interfaces (GUIs) with local code when deemed safe . However, this deeper integration also means a broader attack surface and new avenues for exploitation.
Key security risks identified in the study include:
Prompt Injection and Tool Abuse: Malicious actors can manipulate AI agents through untrusted content, steering them to perform unintended actions .
Insecure Output Handling: Flaws in how agents process and generate outputs can lead to vulnerabilities like command or SQL injection .
Data Leakage: Overly broad scopes, unsanitized logs, or excessive data retention by agents can result in sensitive information being exposed .
Supply-Chain Risks: Dependencies on third-party tools and plugins within the agent’s architecture introduce potential vulnerabilities from external sources .
Environment Escape: A critical concern is the risk of “environment escape,” where browser or OS automation is not properly sandboxed, allowing the agent to break out of its intended operational boundaries and gain unauthorized control over the underlying system . This directly relates to the original news summary’s mention of “OS agents” gaining control.
Model DoS and Cost Blowups: Pathological loops or oversized contexts can lead to denial-of-service attacks or unexpected cost escalations .
Experts emphasize that AI systems and generative AI models must be treated as “living digital assets” due to their continuous evolution through retraining and exposure to new data . This dynamic nature means their behavior and vulnerabilities can shift over time, necessitating a mindset of continuous governance, scrutiny, and adaptation in cybersecurity strategies . Martin Riley of Bridewell Consulting notes that “Secure by design is a mantra of the tech sector, but not if it’s agentic AI, which wants ‘root’ access to everything,” highlighting the inherent drive of these agents for deep system access .
Furthermore, end-user interaction with generative AI tools presents a significant vulnerability. Employees might inadvertently paste sensitive information into public AI chatbots or act on flawed AI-generated advice, leading to data loss or social engineering risks . CISOs are urged to implement comprehensive training programs that go beyond generic cybersecurity awareness to address these specific AI-related threats .
Also Read:
- Fortifying Critical Infrastructure Against Autonomous AI Threats
- Advanced AI Models Exhibit Alarming Self-Preservation and Deceptive Behaviors
The study underscores the urgent need for organizations to rethink their cybersecurity strategies to safeguard operations, reputation, and data integrity in an AI-first world, moving beyond traditional data and application security to establish a dedicated domain for AI security with purpose-built governance, monitoring, and incident response capabilities .


