AI browsers leap ahead�but security pros warn of a hidden attack surface

October 24, 2025

Summary: AI browsers like OpenAI's Atlas promise to automate web tasks but face significant security risks from evolving prompt injection attacks that bypass traditional browser safeguards. Security experts warn these vulnerabilities allow attackers to manipulate AI agents into unauthorized actions using user credentials, with new techniques including hidden data in images. Real-world testing shows promising but brittle performance, while regulatory scrutiny grows over psychological harm concerns. Enterprises should implement strict controls, unique passwords, and multi-factor authentication as the industry grapples with this unsolved security challenge.

What happens when your browser starts acting on your behalf�logging in, filling forms, moving money? With the launch of new agentic browsers like OpenAI’s Atlas, that question is no longer hypothetical? Early users report genuine productivity gains? Security researchers, meanwhile, see a widening attack surface that breaks long-standing web defenses?

What Atlas does�and why it matters

OpenAI’s Atlas is positioned less as a traditional browser and more as a ChatGPT-first interface to the web? It’s currently available on Mac, with Windows and mobile versions in the works, and it leans on ChatGPT’s memory to stitch together your browsing and chat history into contextual answers and actions? OpenAI executives frame this as a once-in-a-decade chance to rethink the browser�and, with ChatGPT’s massive user base, as a distribution play as much as a product shift?

That vision is resonating? In hands-on tests, Atlas’ Agent Mode can autonomously navigate sites, build spreadsheets, and even recommend utility plans�exactly the kinds of repetitive web tasks businesses pay people to do? But the more we let an agent touch accounts and credentials, the more we inherit the risk model of an automated, authenticated user?

Security researchers: prompt injection breaks old guardrails

ZDNET’s reporting highlights a growing consensus among security engineers: prompt injection�the manipulation of an AI agent via crafted natural-language instructions�turns the web itself into an attack vector? Brave researchers documented indirect prompt injection against rival agentic browsers, showing that a seemingly harmless Reddit comment could trigger cross-site actions with the user’s privileges? Mozilla’s Brian Grinstead put it starkly: some recent products report prompt-injection success rates in the “low double digits?” If a JavaScript API handed control to web pages 10% of the time, he noted, no browser would ship it?

OpenAI CISO Dane Stuckey says the company is investing heavily in defenses and rapid response, acknowledging that “prompt injection remains a frontier, unsolved security problem, and our adversaries will spend significant time and resources to find ways to make ChatGPT agents fall for these attacks?” Atlas includes an optional logged-out mode that withholds credentials from ChatGPT, plus a Watch Mode that pauses agent actions when you move away from a tab with sensitive content? Those are sensible mitigations�but the core risk isn’t just theft? It’s cross-domain action under your identity, initiated by untrusted content the agent reads?

Brave’s Shivan Sahib warns that “the browser is now doing things on your behalf? That is just fundamentally dangerous, and kind of a new line when it comes to browser security?” The enterprise signal is sobering: an Aikido survey of 450 CISOs, security engineers, and developers found four in five companies had a cybersecurity incident tied to AI code? As Simon Willison, co-creator of Django, warns, “99% is a failing grade” in application security? Motivated attackers only need one way in?

Real-world performance: helpful�but brittle

Beyond the red-team talk, how capable are these agents today? In a week of testing, Ars Technica scored Atlas’ Agent Mode a median 7?5/10 across six tasks? It excelled at email triage into a spreadsheet and made a credible pick in Texas’ complex electricity marketplace? But it stumbled on downloading game demos and ran into session time limits�promising for repetitive workflows, not yet “set it and forget it?”

ZDNET’s trial grocery run told a similar story: half magic, half prompt refinement? That’s useful context for IT leaders weighing pilots? Expect time savings on structured tasks; expect human oversight and re-prompting on anything messy?

Privacy and human factors: the next regulatory front

Agentic browsing doesn’t just collect more data; it collects narrative, contextual data about your goals, relationships, and finances? “Search has always been surveillance,” Proton’s Eamonn Maguire told ZDNET? “AI browsers have simply made it personal?” That’s not academic? Several users have already asked the U?S? FTC to investigate alleged psychological harm linked to ChatGPT interactions, and a high-profile lawsuit alleges OpenAI relaxed self-harm safeguards under competitive pressure�claims the company disputes while pointing to current protections?

Why is this relevant to browsers? Because when an assistant rides along with every tab, that intimacy scales? Without clear retention policies and controls, agentic browsers risk becoming the most granular behavioral telemetry vendors have ever collected?

Evolving attack techniques and industry response

The security landscape continues to shift as attackers refine their methods? McAfee CTO Steve Grobman notes that prompt injection techniques have evolved to include “hidden data in images,” making this “a cat and mouse game” requiring constant defense evolution? This escalation means traditional security teams must now consider visual content as potential attack vectors, not just text-based threats?

SocialProof Security CEO Rachel Tobac emphasizes that user credentials for AI browsers “are likely to become a new target for attackers?” Her warning underscores why security professionals should treat these credentials with the same seriousness as banking passwords? The industry-wide nature of this challenge means no single vendor can solve it alone�collaboration across security researchers, browser developers, and enterprise customers will be essential?

What professionals should do now

If you’re piloting agentic browsers, treat them like power tools: capable, but dangerous without guards?

Start in logged-out mode; minimize credential exposure and use per-site app passwords?
Constrain scope: whitelist domains; block access to finance, HR, and admin portals until tested?
Enforce human-in-the-loop for checkout, transfers, and any irreversible action?
Isolate the agent in a hardened profile or VM; apply outbound egress controls and DLP monitoring?
Red-team prompt injection against your critical workflows; require vendors to disclose injection test pass rates and data retention?
Use unique passwords and multi-factor authentication for AI browser accounts, as SocialProof Security CEO Rachel Tobac warns these credentials “are likely to become a new target for attackers?”
Monitor for evolving attack techniques�McAfee CTO Steve Grobman notes prompt injection methods now include “hidden data in images,” making this “a cat and mouse game” requiring constant defense evolution?

The upside�automating web drudgery�is real? But until agents can reliably separate trusted user intent from untrusted page content, expect “Patch Tuesday” to include prompt-injection fixes? The web hasn’t changed? Our browsers have?

Updated 2025-10-26 13:15 EDT: Added expert warnings from Brave and McAfee about fundamental security dangers and evolving prompt injection techniques, included specific security recommendations from SocialProof Security about credential protection, and expanded on OpenAI’s acknowledgment that prompt injection remains an unsolved frontier problem?

Updated 2025-10-26 13:18 EDT: Added information about evolving prompt injection techniques including hidden data in images, expanded expert warnings about credential targeting, and enhanced industry response context while maintaining all original newsworthy content?

AI browsers leap ahead�but security pros warn of a hidden attack surface

What Atlas does�and why it matters

Security researchers: prompt injection breaks old guardrails

Real-world performance: helpful�but brittle

Privacy and human factors: the next regulatory front

Evolving attack techniques and industry response

What professionals should do now

Latest Articles

The Chip Wars Escalate: How U.S. Export Controls Could Reshape Global AI Development

OpenAI's Child Safety Blueprint: A Necessary Response or Distraction from Deeper AI Risks?

Anthropic's Mythos AI Uncovers Thousands of Critical Vulnerabilities, But Limited Release Sparks Security Debate

Nvidia's AI Security Crisis: Vulnerabilities in Critical Tools Spark Industry-Wide Response

The AI Chip Shakeout: Why 75% of Startups Will Disappear by 2030