Anthropic has introduced Claude 3.7 Sonnet, a new frontier AI model designed to provide both immediate and thoughtful responses to user inquiries. This model is termed the first “hybrid AI reasoning model”, allowing users to activate its reasoning capabilities for varying durations of contemplation.
Claude 3.7 Sonnet is accessible via the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI for developers. For business and consumer interactions, it is available on Claude.ai across web, iOS, and Android platforms. Pricing starts at $3 per million input tokens and $15 per million output tokens, with potential savings of up to 90% through prompt caching and 50% with batch processing.
Use Cases
Claude 3.7 Sonnet demonstrates advanced capabilities in understanding nuanced instructions, correcting mistakes, and generating insights from complex data. It supports various applications, including:
- Code Generation: Excelling in the software development lifecycle, it can handle tasks from planning to bug fixes and maintenance.
- Computer Use: Developers can integrate Claude to perform tasks like a human, interacting with screens and executing commands.
- Advanced Chatbots: Its enhanced reasoning and human-like tone make it suitable for chatbots that require data integration and action across systems.
- Knowledge Q&A: With a large context window, it effectively answers questions from extensive knowledge bases.
- Visual Data Extraction: Capable of extracting information from visuals, it is ideal for data analytics.
- Customer-facing Agents: It offers superior instruction following and reasoning for complex workflows.
- Content Generation and Analysis: Claude 3.7 Sonnet can create compelling content and analyze it deeply.
- Robotic Process Automation: It automates repetitive tasks with high instruction-following capabilities.
Benchmarks
Claude 3.7 Sonnet shows state-of-the-art performance in coding, vision, and reasoning tasks, particularly excelling in instruction-following and multimodal capabilities. It has outperformed previous models in benchmarks, including Pokémon gameplay tests.
Extensive testing and collaboration with external experts have been conducted to ensure Claude 3.7 Sonnet meets safety, security, and reliability standards. The safety card for this release outlines new safety results and addresses emerging risks associated with computer use and the benefits of reasoning models.