Anthropic's Claude models

The Anthropic Claude models on Vertex AI offer fully managed and serverless models as APIs. To use a Claude model on Vertex AI, send a request directly to the Vertex AI API endpoint. Because the Anthropic Claude models use a managed API, there's no need to provision or manage infrastructure.

You can stream your Claude responses to reduce the end-user latency perception. A streamed response uses server-sent events (SSE) to incrementally stream the response.

You pay for Claude models as you use them (pay as you go), or you pay a fixed fee when using provisioned throughput. For pay-as-you-go pricing, see Anthropic Claude models on the Vertex AI pricing page.

Available Claude models

The following models are available from Anthropic to use in Vertex AI. To access a Claude model, go to its Model Garden model card.

Anthropic's Claude models support Vertex AI request-response logging. Enable 30-day request-response logging of your prompt and completion activity to track any model misuse by your users. For more information, see Log requests and responses.

Claude Sonnet 4.5

Claude Sonnet 4.5 is Anthropic's latest Sonnet-class model for powering real-world agents, with industry leading capabilities around coding, computer use, cybersecurity, and working with office files like spreadsheets.

Long-running agents: Power production-ready assistants for multi-step, real-time applications, from customer support automation to complex operational workflows that require peak accuracy, intelligence, and speed.
Coding: Handle everyday development tasks with enhanced performance - or plan and execute complex software projects spanning hours or days - with the ability to save, maintain, and reference information across multiple sessions.
Cybersecurity: Deploy agents that autonomously patch vulnerabilities before exploitation, shifting from reactive detection to proactive defense.
Financial analysis: Conduct entry-level financial analysis, deliver advanced predictive analysis, or preemptively develop intelligent risk management strategies that leverage best-in-class domain knowledge.
Computer use: Anthropic's most accurate model for computer use, enabling developers to direct the model to use computers the way people do.
Business tasks: Generate and edit office files like slides, documents, and spreadsheets with minimal input.
Research: Perform focused analysis across multiple data sources, turning expert analysis into final deliverables. Ideal for complex problem solving, rapid business intelligence, and real-time decision support.

Go to the Claude Sonnet 4.5 model card

Claude Opus 4.1

Claude Opus 4.1 is Anthropic's latest Opus-class model and an industry leader for coding and agent capabilities, especially agentic search. It excels for customers needing frontier intelligence:

AI agents: Enable AI agents to complete complex, multi-step tasks with precision and reliability.
Agentic search and analysis: Connect to multiple data sources to synthesize information and insights across different repositories.
Expert-level coding: Plan and execute complex coding tasks end-to-end, maintaining high-quality code that is consistent with your style.
Virtual collaboration: Use the sustained reasoning capabilities to unlock new use cases involving long-horizon tasks and long chains of actions.
Content creation: Generate content with human-quality, natural prose. Create long-form content, technical documentation, marketing copy, and front-end design mockups.
Long context and memory: Incorporates memory capabilities that allow it to effectively summarize and reference previous interactions.

Go to the Claude Opus 4.1 model card

Claude Haiku 4.5

Claude Haiku 4.5 delivers near-frontier performance for a wide range of use cases, and stands out as one of the best coding models in the world—with the right speed and cost to power free products and high-volume user experiences.

Free tier user experiences: Claude Haiku 4.5 delivers near-frontier performance at a cost and speed that makes free agent products and agentic use cases economically viable at scale.
Latency-sensitive experiences: Claude Haiku 4.5's speed is ideal for real-time applications like customer service agents and chatbots where response time is critical.
Coding sub-agents: Use Claude Haiku 4.5 to power sub-agents, enabling multi-agent systems that tackle complex refactors, migrations, and large feature builds with quality and speed.
Financial analysis: Use Claude Haiku 4.5 to monitor thousands of data streams—tracking regulatory changes, market signals, and portfolio risks to preemptively adapt compliance and trading systems at previously impossible scales.
Research sub-agents: Perform parallel analyses across multiple data sources while maintaining fast response times. Ideal for rapid business intelligence, competitive analysis, and real-time decision support.
Business tasks: Claude Haiku 4.5 is capable of producing and editing office files like slides, documents, and spreadsheets. It also better supports strategy and campaign planning, business analysis and brainstorming.

Go to the Claude Haiku 4.5 model card

Claude Opus 4

Claude Opus 4 is a state-of-the-art model for coding and agent capabilities, especially agentic search. It excels for customers needing frontier intelligence:

Advanced coding: Independently plan and execute complex development tasks end-to-end. It adapts to your style and maintains high code quality throughout.
Long-horizon tasks and complex problem solving (virtual collaborator): Unlock new use cases that involves long-horizon tasks that require memory, sustained reasoning, and long chains of actions.
AI agents: Enable agents to tackle complex, multi-step tasks that require peak accuracy.
Agentic search and research: Connect to multiple data sources to synthesize comprehensive insights across repositories.
Content creation: Create human-quality content with natural prose. Produce long-form creative content, technical documentation, marketing copy, and frontend design mockups.
Memory and context management: Incorporates memory capabilities that allow it to effectively summarize and reference previous interactions.

Go to the Claude Opus 4 model card

Claude Sonnet 4

Claude Sonnet 4 balances impressive performance for coding with the right speed and cost for high-volume use cases:

Coding: Handle everyday development tasks with enhanced performance—power code reviews, bug fixes, API integrations, and feature development with immediate feedback loops.
AI Assistants: Power production-ready assistants for real-time applications—from customer support automation to operational workflows that require both intelligence and speed.
Efficient research: Perform focused analysis across multiple data sources while maintaining fast response times. Ideal for rapid business intelligence, competitive analysis, and real-time decision support.
Large-scale content: Generate and analyze content at scale with improved quality. Create customer communications, analyze user feedback, and produce marketing materials with the right balance of quality and throughput.

Go to the Claude Sonnet 4 model card

Claude 3.7 Sonnet

Claude 3.7 Sonnet is Anthropic's most intelligent model to date and the first Claude model to offer extended thinking—the ability to solve complex problems with careful, step-by-step reasoning. Claude 3.7 Sonnet is a single model where you can balance speed and quality by choosing between standard thinking for near-instant responses or extended thinking for advanced reasoning.

For more information about extended thinking, see Anthropic's documentation.

Claude 3.7 Sonnet is optimized for the following use cases:

Agentic coding - Claude 3.7 Sonnet is state-of-the-art for agentic coding, and can complete tasks across the entire software development lifecycle—from initial planning to bug fixes, maintenance to large refactors. It offers strong performance in both planning and solving for complex coding tasks, making Claude 3.7 Sonnet an ideal choice to power end-to-end software development processes.
Customer-facing agents - Claude 3.7 Sonnet offers superior instruction following, tool selection, error correction, and advanced reasoning for customer-facing agents and complex AI workflows.
Computer use - Claude 3.7 Sonnet is our most accurate model for computer use, enabling developers to direct Claude to use computers the way people do.
Content generation and analysis - Claude 3.7 Sonnet excels at writing and is able to understand nuance and tone in content to generate more compelling content and analyze content on a deeper level.
Visual data extraction - With Claude 3.7 Sonnet's robust vision skills, it is the right choice for teams that want to extract raw data from visuals like charts or graphs as part of their AI workflow.

Go to the Claude 3.7 Sonnet model card

Claude 3.5 Haiku

Claude 3.5 Haiku is optimized for use cases where speed and affordability matter. It improves on its predecessor across every skill set. Claude 3.5 Haiku is optimized for the following use cases:

Code completions - With its rapid response time and understanding of programming patterns, Claude 3.5 Haiku excels at providing quick, accurate code suggestions and completions in real-time development workflows.
Interactive chat bots - Claude 3.5 Haiku's improved reasoning and natural conversation abilities make it ideal for creating responsive, engaging chatbots that can handle high volumes of user interactions efficiently.
Data extraction and labeling - Leveraging its improved analysis skills, Claude 3.5 Haiku efficiently processes and categorizes data, making it useful for rapid data extraction and automated labeling tasks.
Real-time content moderation - With strong reasoning skills and content understanding, Claude 3.5 Haiku provides fast, reliable content moderation for platforms that require immediate response times at scale.

Go to the Claude 3.5 Haiku model card

Claude 3 Haiku

Anthropic's Claude 3 Haiku is Anthropic's fastest vision and text model for near-instant responses to basic queries, meant for seamless AI experiences mimicking human interactions.

Live customer interactions and translations.
Content moderation to catch suspicious behavior or customer requests.
Cost-saving tasks, such as inventory management and knowledge extraction from unstructured data.
Vision tasks, such as processing images to return text output, analysis of charts, graphs, technical diagrams, reports, and other visual content.

Go to the Claude 3 Haiku model card

What's next

Learn how to use Anthropic's models.