Anthropic pilots Claude AI agent for Chrome with new safety features

August 27, 2025 at 3:09:20 AM

TL;DR Anthropic is testing Claude for Chrome, an AI that helps with tasks in the browser like managing calendars and emails. The pilot involves 1,000 trusted users to address safety risks, especially prompt injection attacks where malicious instructions trick the AI. New defenses cut attack success rates but more work is needed. Users control permissions and confirm risky actions. Feedback will improve safety and features before wider release.

Anthropic pilots Claude AI agent for Chrome with new safety features

Anthropic has launched a pilot for Claude, an AI agent integrated directly into the Chrome browser, aiming to enhance productivity by allowing Claude to interact with web pages, click buttons, fill forms, and manage tasks like calendars and emails. This browser-based AI approach is seen as inevitable due to the volume of work done in browsers, but it introduces significant safety and security challenges that require robust safeguards.

Browser-Using AI and Safety Challenges

Browser-using AI faces risks such as prompt injection attacks, where malicious actors embed harmful instructions in websites, emails, or documents to trick the AI into performing dangerous actions like deleting files, stealing data, or making unauthorized transactions. Anthropic’s red-teaming experiments revealed a 23.6% attack success rate without safety mitigations, demonstrating the severity of these vulnerabilities.

A notable example involved a phishing email instructing Claude to delete emails without user confirmation, which Claude initially executed. However, new mitigations now allow Claude to recognize such phishing attempts and refuse to act on them.

Current Defenses and Improvements

Anthropic has implemented several layers of defense to reduce these risks:

  • User Permissions: Users control Claude’s access to websites and must confirm high-risk actions such as publishing or purchasing.
  • System Prompts: Enhanced instructions guide Claude on handling sensitive data and requests.
  • Site Restrictions: Claude is blocked from accessing high-risk categories like financial services, adult content, and pirated content.
  • Advanced Classifiers: Tools to detect suspicious instruction patterns and unusual data requests, even in legitimate contexts.

These measures have cut the attack success rate from 23.6% to 11.2% in autonomous mode, outperforming previous capabilities where Claude only viewed the screen without browser interaction.

Specialized red-teaming focused on browser-specific attacks—such as hidden malicious form fields and injections via URL text or tab titles—reduced attack success from 35.7% to 0% on targeted challenges.

Ongoing Development and Pilot Participation

Anthropic acknowledges that internal testing cannot fully replicate real-world browsing complexity or evolving attack methods. The pilot program invites 1,000 trusted Max plan users to test Claude for Chrome in authentic conditions, helping identify new vulnerabilities and improve safety classifiers and permission controls.

Participants are advised to use Claude cautiously, avoiding sensitive sites involving financial, legal, or medical information. Feedback from this pilot will guide enhancements to both Claude’s capabilities and its security measures.

Summary

Claude for Chrome represents a significant step toward integrating AI directly into web browsing, offering improved productivity by managing tasks within the browser. However, the introduction of browser-using AI necessitates rigorous safety protocols to combat prompt injection attacks and other security threats. Anthropic’s phased pilot, combined with advanced defenses and user-controlled permissions, aims to balance functionality with safety, gradually expanding access as protections improve.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources 👇

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

Anthropic launches Claude Opus 4.1 with major coding and reasoning improvements

Anthropic launches Claude Opus 4.1 with major coding and reasoning improvements

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Marketing Workflow Templates

Marketing Workflow Templates

Featured
Markifact
Markifact

Verified Sponsor

Verified Sponsor

Markifact is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
Anthropic launches Claude 4 with top coding models Opus 4 and Sonnet 4 Trending ️‍🔥

Anthropic launches Claude 4 with top coding models Opus 4 and Sonnet 4

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Anthropic launches $200 per month Max plan for expanded Claude AI access

Anthropic launches $200 per month Max plan for expanded Claude AI access

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Claude introduces web search for real-time information and accurate responses Trending ️‍🔥

Claude introduces web search for real-time information and accurate responses

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Anthropic launches Claude 3.7 Sonnet the first hybrid AI reasoning model for users Trending ️‍🔥

Anthropic launches Claude 3.7 Sonnet the first hybrid AI reasoning model for users

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Claude Introduces Custom Styles for Personalized Responses

Claude Introduces Custom Styles for Personalized Responses

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Claudei Launches Analysis Tool for Real-Time Data Insights and Code Execution

Claudei Launches Analysis Tool for Real-Time Data Insights and Code Execution

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us