Claude 3.5 Haiku

Claude 3.5 Haiku, the next generation of Anthropic's fastest and most cost-effective model, is optimal for use cases where speed and affordability matter.

Try in Vertex AI View model card in Model Garden

Model ID claude-3-5-haiku@20241022
Launch stage GA
Supported inputs & outputs
  • Inputs:
    Text, Code, Images
  • Outputs:
    Text
Token limits
  • Maximum input tokens: 200,000
  • Maximum output tokens: 8,000
Capabilities
Usage types
Technical specifications
Images
  • Limitation and specifications: See Vision in Anthropic's documentation
Documents
  • Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date July 2024
Versions
  • claude-3-5-haiku@20241022
    • Launch stage: Generally available
    • Release date: October 22, 2024
Supported regions

Model availability

(Includes fixed quota & Provisioned Throughput)

  • United States
    • us-east5
  • Europe
    • europe-west1

ML processing

  • United States
    • Multi-region
  • Europe
    • Multi-region
Quota limits

us-east5:

  • QPM: 80
  • TPM: 350,000 (input and output)
  • Context length: 200,000

europe-west1:

  • QPM: 90
  • TPM: 400,000 (input and output)
  • Context length: 200,000

Pricing See Pricing.