Extended thinking mode (in chat)
Extended thinking mode is a feature in Claude's chat interface that gives Claude extra time and computational space to reason through problems before delivering a final answer. Instead of switching to a different model, it allows the same model to work through a dedicated internal reasoning process — exploring multiple approaches, checking its own logic, and planning its response — before committing to an output.
When extended thinking is enabled, Claude generates 'thinking content blocks' that capture this internal reasoning. Users can see a collapsible 'Thinking' section above Claude's final response, showing a summary of the reasoning process. On Claude 4 models via the API, the full internal reasoning is encrypted for safety, and only a summary is returned. This transparency lets users understand how Claude arrived at an answer and verify the logic.
Extended thinking is best suited for tasks where accuracy and depth matter more than speed: complex math, multi-step coding problems, trade-off analysis, research synthesis, and long agentic workflows. It is not universally beneficial — simple or intuitive tasks may not improve and can even suffer from over-reasoning.
When you’d use it
- ◆Complex math and logic problems — A student or professional needs to solve a multi-step algebra problem, probability puzzle, or formal logic proof where intermediate steps must be correct for the final answer to be valid.
- ◆Code debugging and optimization — A developer has a slow or broken function and needs Claude to trace execution paths, identify root causes, and evaluate multiple fix strategies before recommending a solution.
- ◆Decision and trade-off analysis — A manager is weighing major business or personal decisions — such as choosing a technology stack, evaluating a job offer, or planning a product roadmap — and needs Claude to systematically weigh competing factors and second-order consequences.
- ◆Deep research and synthesis — A researcher or analyst needs Claude to synthesize information from multiple sources, identify contradictions, and produce a well-reasoned summary or recommendation rather than a surface-level overview.
- ◆Multi-step agentic workflows with tool use — An advanced user or developer is running a long workflow where Claude must use tools (web search, code execution, document analysis) across multiple turns, reasoning between each tool call to adapt its strategy based on results.
What changed recently
- ◆2025-02 — Extended thinking launched with Claude 3.7 Sonnet. Users on claude.ai gained the ability to see Claude's step-by-step reasoning in a visible, expandable 'Thinking' section for the first time. Claude 3.7 Sonnet could produce either near-instant responses or extended thinking responses.
- ◆2025-05 — Interleaved thinking released in public beta (API beta header: interleaved-thinking-2025-05-14). This allows Claude to generate thinking blocks between individual tool calls in agentic workflows, enabling more adaptive multi-step reasoning after receiving tool results.
- ◆2025-10 — The clear_thinking_20251015 context editing strategy introduced for API use. This strategy automatically removes older thinking blocks from previous conversation turns, helping manage context window size in long multi-turn conversations. Note: clearing thinking blocks invalidates prompt cache for those turns.
- ◆2025-late — Claude 4 model family launched with extended thinking support. The Messages API for Claude 4 models returns a summarized version of Claude's thinking rather than the full raw reasoning. Full internal thinking is encrypted in a signature field. Summarized thinking preserves reasoning quality benefits while preventing misuse.
This is the short version
The full chapter has three worked examples, the common pitfalls, and the workflow that makes it pay — plus the other 84 features, kept current.
Get Claude Master — $97 →