Models overview (Opus 4.7, Sonnet 4.6, Haiku 4.5, deprecation policy)
The Claude model family is a set of large language models from Anthropic, organized into three named tiers—Opus, Sonnet, and Haiku—each balancing capability, speed, and cost differently. Opus is the most powerful tier, designed for complex reasoning, long-horizon autonomous tasks, and demanding software engineering. Sonnet sits in the middle, offering a practical balance of intelligence and throughput suitable for most production workloads. Haiku is the fastest and most cost-efficient tier, built for real-time applications and high-volume processing where latency and cost matter most.
All current Claude models accept text and image inputs and produce text output. They support multilingual use and vision capabilities. Each model is identified by a string you pass in the API's `model` parameter, and starting with the Claude 4.6 generation, Anthropic moved to dateless model IDs (e.g., `claude-sonnet-4-6`) that act as pinned snapshots rather than rolling pointers.
Anthropic maintains a formal model lifecycle with four stages: Active (fully supported and recommended), Legacy (no longer updated, may be deprecated), Deprecated (still functional but not recommended, with a named replacement and scheduled retirement date), and Retired (API requests return an error). Anthropic commits to at least 60 days of notice before moving a model to Retired status.
When you’d use it
- ◆Real-time customer support chat — A SaaS company needs an AI assistant that responds to customer questions in under a second without running up large inference bills. Claude Haiku 4.5 handles the high request volume at low cost while still giving accurate, helpful answers.
- ◆Bulk document classification — A logistics firm processes thousands of shipping complaints daily and needs each one tagged with a department label (e.g., Billing, Delivery, Damage). Haiku 4.5 runs these classifications in parallel at a fraction of a cent per request.
- ◆Production coding assistant — An engineering team integrates an AI code reviewer into their CI pipeline. Claude Sonnet 4.6 reviews pull requests, suggests improvements, and explains changes at a speed and cost that fits continuous integration workflows.
- ◆Enterprise RAG over large document corpora — A legal team needs to ask questions across thousands of contracts. Claude Sonnet 4.6's 1M-token context window (generally available) lets the system ingest entire document histories in one request and perform comparative analysis without chunking.
- ◆Long-horizon autonomous software refactoring — A platform engineering team wants to migrate a legacy monolith to microservices. Claude Opus is tasked with analyzing hundreds of thousands of lines of code, identifying dependencies, and producing a step-by-step refactoring plan it can execute with minimal supervision.
What changed recently
- ◆2026-04-19 — Claude 3 Haiku (claude-3-haiku-20240307) reached hard retirement. All API requests to this model ID now return an error. Anthropic recommends migrating to Claude Haiku 4.5.
- ◆2026-02-19 — Claude Sonnet 3.7 was retired. API requests to that model ID now return an error.
- ◆2026-06-15 — Scheduled retirement date for Claude Sonnet 4 (claude-sonnet-4-20250514) and Claude Opus 4 (claude-opus-4-20250514) on the Claude API. Anthropic recommends migrating to Claude Sonnet 4.6 and Claude Opus 4.7 respectively before this date.
- ◆2026-03 — The 1M-token context window beta (previously gated behind the `context-1m-2025-08-07` header) was promoted to General Availability for Claude Sonnet 4.6 and Claude Opus 4.6 and later. The beta header is no longer required and the old beta for Sonnet 4.5 and Sonnet 4 has been retired.
This is the short version
The full chapter has three worked examples, the common pitfalls, and the workflow that makes it pay — plus the other 84 features, kept current.
Get Claude Master — $97 →