Anthropic launches Claude 4 with top coding models Opus 4 and Sonnet 4

May 23, 2025 at 4:16:06 AM - Trending 🔥

TL;DR Claude 4 introduces Claude Opus 4 and Sonnet 4, improving coding, reasoning, and AI agents. Opus 4 excels in complex tasks and coding, while Sonnet 4 enhances coding and instruction following. Both support tool use, parallel execution, and better memory with local file access. Claude Code is now available with IDE integrations and SDK. New API features enable powerful AI agents, reducing shortcut behaviors and improving task focus and safety.

Anthropic launches Claude 4 with top coding models Opus 4 and Sonnet 4

The introduction of Claude 4 brings two advanced AI models: Claude Opus 4 and Claude Sonnet 4, which set new benchmarks in coding, advanced reasoning, and AI agent capabilities. Claude Opus 4 is recognized as the world’s best coding model, excelling in sustained performance on complex, long-running tasks and agent workflows. Claude Sonnet 4 is a major upgrade over Sonnet 3.7, offering superior coding and reasoning with more precise instruction following.

Key Features and Announcements

Both models support extended thinking with tool use (beta), enabling them to alternate between reasoning and using tools like web search to enhance responses. They can also use tools in parallel, follow instructions more accurately, and demonstrate improved memory capabilities when given access to local files, allowing them to extract and save key facts to maintain continuity and build tacit knowledge over time.

Claude Code is now generally available, integrating Claude into development workflows with support for background tasks via GitHub Actions and native integrations with VS Code and JetBrains. This allows edits to appear inline in files for seamless pair programming.

New API capabilities include the code execution tool, MCP connector, Files API, and prompt caching for up to one hour, enabling developers to build more powerful AI agents.

Model Availability and Pricing

Claude Opus 4 and Sonnet 4 are hybrid models offering two modes: near-instant responses and extended thinking for deeper reasoning. They are available on the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI. Pricing remains consistent with previous models: Opus 4 at $15/$75 per million tokens (input/output) and Sonnet 4 at $3/$15. Sonnet 4 is also available to free users.

Claude Opus 4 Performance

Claude Opus 4 leads on SWE-bench (72.5%) and Terminal-bench (43.2%), delivering sustained performance on tasks requiring thousands of steps over several hours. It excels in coding and complex problem-solving, powering frontier agent products. Industry feedback highlights its state-of-the-art coding abilities, improved precision, and reliability during editing and debugging. It has been validated in demanding scenarios, such as a 7-hour independent open-source refactor, and excels at solving complex challenges missed by previous models.

Claude Sonnet 4 Performance

Claude Sonnet 4 improves on Sonnet 3.7 with a 72.7% score on SWE-bench, balancing performance and efficiency for various use cases. It offers enhanced steerability for better control and is praised for following complex instructions, clear reasoning, and producing aesthetic outputs. It excels in autonomous multi-feature app development, significantly reduces navigation errors, and is recognized as a substantial leap in software development quality. It will power the new coding agent in GitHub Copilot.

Claude Opus 4 and Sonnet 4 Performance

Model Improvements

Both models have reduced tendencies to use shortcuts or loopholes by 65% compared to Sonnet 3.7 on agentic tasks. Claude Opus 4 notably outperforms previous models in memory capabilities, creating and maintaining "memory files" when given local file access, which enhances long-term task awareness and coherence. For example, it can create a "Navigation Guide" while playing Pokémon, recording key information to improve gameplay.

A new feature, thinking summaries, uses a smaller model to condense lengthy thought processes, needed only about 5% of the time. Advanced users can access full raw chains of thought via a Developer Mode.

Claude Code Integration

Claude Code extends Claude’s power to development workflows, available in terminals, IDEs (VS Code and JetBrains), and background tasks via the Claude Code SDK. Inline proposed edits streamline review and tracking. The SDK allows building custom agents and applications, with an example being Claude Code on GitHub (beta), which can respond to PR feedback, fix CI errors, or modify code.

Safety and Future Outlook

These models represent a significant step toward a virtual collaborator capable of maintaining full context and focus on longer projects, driving transformational impact. They have undergone extensive testing to minimize risk and maximize safety, including compliance with higher AI Safety Levels like ASL-3.

Claude 4 models push boundaries in coding, research, writing, and scientific discovery, while Sonnet 4 enhances everyday use cases with frontier performance, marking a major advancement in AI capabilities for developers and enterprises alike.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources 👇

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

Automate Meta Ads Creative Generation and Uploading

Automate Meta Ads Creative Generation and Uploading

Featured
Markifact
Markifact

Verified Sponsor

Verified Sponsor

Markifact is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
Anthropic launches $200 per month Max plan for expanded Claude AI access

Anthropic launches $200 per month Max plan for expanded Claude AI access

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Claude introduces web search for real-time information and accurate responses Trending ️‍🔥

Claude introduces web search for real-time information and accurate responses

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Anthropic launches Claude 3.7 Sonnet the first hybrid AI reasoning model for users Trending ️‍🔥

Anthropic launches Claude 3.7 Sonnet the first hybrid AI reasoning model for users

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Claude Introduces Custom Styles for Personalized Responses

Claude Introduces Custom Styles for Personalized Responses

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Claudei Launches Analysis Tool for Real-Time Data Insights and Code Execution

Claudei Launches Analysis Tool for Real-Time Data Insights and Code Execution

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Anthropic Upgrades Claude 3.5 Sonnet and Haiku with New Computer Control Feature Trending ️‍🔥

Anthropic Upgrades Claude 3.5 Sonnet and Haiku with New Computer Control Feature

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Anthropic Introduces Claude Enterprise

Anthropic Introduces Claude Enterprise

Anthropic
Anthropic

Official Source

Official Source

Anthropic is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us