The Anthropic Claude models on Vertex AI offer fully managed and serverless models as APIs. To use a Claude model on Vertex AI, send a request directly to the Vertex AI API endpoint. Because the Anthropic Claude models use a managed API, there's no need to provision or manage infrastructure.
You can stream your Claude responses to reduce the end-user latency perception. A streamed response uses server-sent events (SSE) to incrementally stream the response.
You pay for Claude models as you use them (pay as you go), or you pay a fixed fee when using [provisioned throughput][pt]. For pay-as-you-go pricing, see Anthropic Claude models on the Vertex AI pricing page.
Available Claude models
The following models are available from Anthropic to use in Vertex AI. To access a Claude model, go to its Model Garden model card.
Anthropic's Claude models support Vertex AI request-response logging. Enable 30-day request-response logging of your prompt and completion activity to track any model misuse by your users. For more information, see Log requests and responses.
Claude Opus 4
Claude Opus 4 is Anthropic's most intelligent model and is state-of-the-art for coding and agent capabilities, especially agentic search. It excels for customers needing frontier intelligence:
- Advanced coding: Independently plan and execute complex development tasks end-to-end. It adapts to your style and maintains high code quality throughout.
- Long-horizon tasks and complex problem solving (virtual collaborator): Unlock new use cases that involves long-horizon tasks that require memory, sustained reasoning, and long chains of actions.
- AI agents: Enable agents to tackle complex, multi-step tasks that require peak accuracy.
- Agentic search and research: Connect to multiple data sources to synthesize comprehensive insights across repositories.
- Content creation: Create human-quality content with natural prose. Produce long-form creative content, technical documentation, marketing copy, and frontend design mockups.
- Memory and context management: Incorporates memory capabilities that allow it to effectively summarize and reference previous interactions.
Go to the Claude Opus 4 model card
Claude Sonnet 4
Claude Sonnet 4 balances impressive performance for coding with the right speed and cost for high-volume use cases:
- Coding: Handle everyday development tasks with enhanced performance—power code reviews, bug fixes, API integrations, and feature development with immediate feedback loops.
- AI Assistants: Power production-ready assistants for real-time applications—from customer support automation to operational workflows that require both intelligence and speed.
- Efficient research: Perform focused analysis across multiple data sources while maintaining fast response times. Ideal for rapid business intelligence, competitive analysis, and real-time decision support.
- Large-scale content: Generate and analyze content at scale with improved quality. Create customer communications, analyze user feedback, and produce marketing materials with the right balance of quality and throughput.
Go to the Claude Sonnet 4 model card
Claude 3.7 Sonnet
Claude 3.7 Sonnet is Anthropic's most intelligent model to date and the first Claude model to offer extended thinking—the ability to solve complex problems with careful, step-by-step reasoning. Claude 3.7 Sonnet is a single model where you can balance speed and quality by choosing between standard thinking for near-instant responses or extended thinking for advanced reasoning.
For more information about extended thinking, see Anthropic's documentation.
Claude 3.7 Sonnet is optimized for the following use cases:
- Agentic coding - Claude 3.7 Sonnet is state-of-the-art for agentic coding, and can complete tasks across the entire software development lifecycle—from initial planning to bug fixes, maintenance to large refactors. It offers strong performance in both planning and solving for complex coding tasks, making Claude 3.7 Sonnet an ideal choice to power end-to-end software development processes.
- Customer-facing agents - Claude 3.7 Sonnet offers superior instruction following, tool selection, error correction, and advanced reasoning for customer-facing agents and complex AI workflows.
- Computer use - Claude 3.7 Sonnet is our most accurate model for computer use, enabling developers to direct Claude to use computers the way people do.
- Content generation and analysis - Claude 3.7 Sonnet excels at writing and is able to understand nuance and tone in content to generate more compelling content and analyze content on a deeper level.
- Visual data extraction - With Claude 3.7 Sonnet's robust vision skills, it is the right choice for teams that want to extract raw data from visuals like charts or graphs as part of their AI workflow.
Go to the Claude 3.7 Sonnet model card
Claude 3.5 Sonnet v2
Claude 3.5 Sonnet v2 is a state-of-the-art model for real-world software engineering tasks and agentic capabilities. Claude 3.5 Sonnet v2 delivers these advancements at the same price and speed as Claude 3.5 Sonnet.
The upgraded Claude 3.5 Sonnet model is capable of interacting with tools that can manipulate a computer desktop environment. For more information, see the Anthropic documentation.
Claude 3.5 Sonnet is optimized for the following use cases:
- Agentic tasks and tool use - Claude 3.5 Sonnet offers superior instruction following, tool selection, error correction, and advanced reasoning for agentic workflows that require tool use.
- Coding - For software development tasks ranging from code migrations, code fixes, and translations, Claude 3.5 Sonnet offers strong performance in both planning and solving for complex coding tasks.
- Document Q&A - Claude 3.5 Sonnet combines strong context comprehension, advanced reasoning, and synthesis to deliver accurate and human-like responses.
- Visual data extraction - With Claude 3.5 Sonnet leading vision skills, Claude 3.5 Sonnet can extract raw data from visuals like charts or graphs as part of AI workflows.
- Content generation and analysis - Claude 3.5 Sonnet can understand nuance and tone in content, generating more compelling content and analyzing content on a deeper level.
Go to the Claude 3.5 Sonnet v2 model card
Claude 3.5 Haiku
Claude 3.5 Haiku, the next generation of Anthropic's fastest and most cost-effective model, is optimal for use cases where speed and affordability matter. It improves on its predecessor across every skill set. Claude 3.5 Haiku is optimized for the following use cases:
- Code completions - With its rapid response time and understanding of programming patterns, Claude 3.5 Haiku excels at providing quick, accurate code suggestions and completions in real-time development workflows.
- Interactive chat bots - Claude 3.5 Haiku's improved reasoning and natural conversation abilities make it ideal for creating responsive, engaging chatbots that can handle high volumes of user interactions efficiently.
- Data extraction and labeling - Leveraging its improved analysis skills, Claude 3.5 Haiku efficiently processes and categorizes data, making it useful for rapid data extraction and automated labeling tasks.
- Real-time content moderation - With strong reasoning skills and content understanding, Claude 3.5 Haiku provides fast, reliable content moderation for platforms that require immediate response times at scale.
Go to the Claude 3.5 Haiku model card
Claude 3 Opus
Anthropic's Claude 3 Opus is a powerful AI model with top-level performance on highly complex tasks. It can navigate open-ended prompts and sight-unseen scenarios with remarkable fluency and human-like understanding. Claude 3 Opus is optimized for the following use cases:
Task automation, such as interactive coding and planning, or running complex actions across APIs and databases.
Research and development tasks, such as research review, brainstorming and hypothesis generation, and product testing.
Strategy tasks, such as advanced analysis of charts and graphs, financials and market trends, and forecasting.
Vision tasks, such as processing images to return text output. Also, analysis of charts, graphs, technical diagrams, reports, and other visual content.
Go to the Claude 3 Opus model card
Claude 3 Haiku
Anthropic's Claude 3 Haiku is Anthropic's fastest vision and text model for near-instant responses to basic queries, meant for seamless AI experiences mimicking human interactions.
Live customer interactions and translations.
Content moderation to catch suspicious behavior or customer requests.
Cost-saving tasks, such as inventory management and knowledge extraction from unstructured data.
Vision tasks, such as processing images to return text output, analysis of charts, graphs, technical diagrams, reports, and other visual content.
Go to the Claude 3 Haiku model card
Claude 3.5 Sonnet
Anthropic's Claude 3.5 Sonnet outperforms Claude 3 Opus on a wide range of Anthropic's evaluations, with the speed and cost of Anthropic's mid-tier Claude 3 Sonnet. Claude 3.5 Sonnet is optimized for the following use cases:
Coding, such as writing, editing, and running code with sophisticated reasoning and troubleshooting capabilities.
Handle complex queries from customer support by understanding user context and orchestrating multi-step workflows.
Data science and analysis by navigating unstructured data and leveraging multiple tools to generate insights.
Visual processing, such as interpreting charts and graphs that require visual understanding.
Writing content with a more natural, human-like tone.
Go to the Claude 3.5 Sonnet model card
What's next
Learn how to use Anthropic's models.