← ContentsClaude.ai · beginner

Image input / vision

Image input (also called vision) is Claude's ability to receive, read, and reason about static image files alongside text. When you attach a photo, screenshot, chart, diagram, or document image to a conversation, Claude analyzes the visual content and responds in natural language — describing what it sees, extracting text, interpreting data, comparing designs, or answering questions about the image. This is a one-way capability: Claude can understand images but cannot generate, edit, manipulate, or create them. Vision works across all current Claude models (Claude 3 and Claude 4 families) and is available on every plan tier — Free through Enterprise. You can include multiple images in a single turn; on claude.ai the limit is 20 images per turn (10 MB each), while the API supports up to 600 images per request subject to overall payload size limits. Practical applications range from simple tasks like reading a receipt or describing a photo, to intermediate work like interpreting financial charts or reviewing code screenshots, to advanced workflows like analyzing multi-page PDFs containing embedded charts, converting wireframe sketches into code, or powering document-processing pipelines in enterprise knowledge bases.

When you’d use it

  • Receipt and invoice data extractionA user photographs a crumpled restaurant receipt and asks Claude to pull out the date, vendor name, line items, subtotal, tax, and total for an expense report. Claude reads the image and returns a structured table ready to paste into a spreadsheet.
  • Chart and graph interpretationAn analyst uploads a screenshot of a stacked bar chart from a BI dashboard and asks Claude to summarize the key trends and call out which segment grew fastest quarter-over-quarter. Claude reads axis labels, bar heights, and legend colors to produce a written narrative.
  • Code screenshot reviewA developer shares a screenshot of an error-throwing Python function and asks Claude to identify bugs and suggest fixes. Because the code cannot be copy-pasted (it is in an image), vision allows Claude to read the text and reason about it as code.
  • Wireframe or mockup to code conversionA product designer uploads a hand-drawn wireframe sketch of a web dashboard and asks Claude to generate the corresponding React and Tailwind CSS code. Claude interprets layout, component placement, and hierarchy from the image to scaffold the frontend.
  • Document and form analysisAn operations team uploads scanned paper forms — insurance applications, onboarding documents, or compliance checklists — and asks Claude to extract structured data or verify that all required fields are filled in.

What changed recently

  • 2026-04-16Claude Opus 4.7 introduced high-resolution image support, raising the maximum image resolution to 2,576 pixels on the long edge (up from 1,568 px on prior models). High-resolution images on Opus 4.7 can consume up to approximately 3x more image tokens (~4,784 vs ~1,568 tokens per image).
  • 2026-04-17Claude Design launched in research preview for Pro, Max, Team, and Enterprise subscribers. Powered by Opus 4.7, it accepts image uploads (screenshots, designs, PDFs) and uses vision to power design generation at claude.ai/design.
  • 2025URL-based source blocks for images and PDFs were added to the Messages API. Developers can now reference images via direct URL in API requests instead of base64-encoding them, reducing request size and latency.
  • 2025Vision PDF integration improved: the current Claude engine triggers the vision encoder for charts and diagrams embedded within PDFs simultaneously with text extraction, rather than relying solely on the PDF text layer.
This is the short version

The full chapter has three worked examples, the common pitfalls, and the workflow that makes it pay — plus the other 84 features, kept current.

Get Claude Master — $97
CLAUDEMASTER
An independent publication.
Independent product. Not affiliated with or endorsed by Anthropic. “Claude” is a trademark of Anthropic, used here only to describe the subject of this manual.
© 2026 Claude Master — All rights reserved.