Claude 3 Haiku

Anthropic's fastest vision and text model for near-instant responses to basic queries, meant for seamless AI experiences mimicking human interactions.

View model card in Model Garden

Model ID claude-3-haiku@20240307
Launch stage GA
Supported inputs & outputs
  • Inputs:
    Text, Code, Images
  • Outputs:
    Text
Token limits
  • Maximum input tokens: 200,000
  • Maximum output tokens: 8,000
Capabilities
Usage types
Technical specifications
Images
  • Limitation and specifications: See Vision in Anthropic's documentation
Documents
  • Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date August 2023
Versions
  • claude-3-haiku@20240307
    • Launch stage: Generally available
    • Release date: March 19, 2024
Supported regions

Model availability

(Includes fixed quota & Provisioned Throughput)

  • United States
    • us-east5
  • Europe
    • europe-west1
  • Asia pacific
    • asia-southeast1

ML processing

  • United States
    • Multi-region
  • Europe
    • Multi-region
  • Asia pacific
    • asia-southeast1
Quota limits

us-east5:

  • QPM: 245
  • TPM: 600,000 (input and output)
  • Context length: 200,000

europe-west1:

  • QPM: 75
  • TPM: 181,000 (input and output)
  • Context length: 200,000

asia-southeast1:

  • QPM: 70
  • TPM: 174,000 (input and output)
  • Context length: 200,000

Pricing See Pricing.