Gemini 2.0 Flash

Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, built-in tool use, multimodal generation, and a 1M token context window.

Try in Vertex AI View model card in Model Garden (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.
Model ID gemini-2.0-flash
Supported inputs & outputs
  • Inputs:
    Text, Code, Images, Audio, Video
  • Outputs:
    Text, Audio (private preview), Images (private preview)
Token limits
  • Maximum input tokens: 1,048,576 (without Live API), 32,768 (with Live API)
  • Maximum output tokens: 8,192
Capabilities
Usage types
Technical specifications
Images
  • Maximum images per prompt: 3,000
  • Maximum image size: 7 MB
  • Maximum tokens per minute (TPM) per project:
    • High/Medium/Default media resolution:
      • US/Asia: 40 M
      • EU: 10 M
    • Low media resolution:
      • US/Asia: 10 M
      • EU: 3 M
  • Supported MIME types:
    image/png, image/jpeg, image/webp
Documents
  • Maximum number of files per prompt: 3,000
  • Maximum number of pages per file: 1,000
  • Maximum file size per file: 50 MB
  • Maximum tokens per minute (TPM) per project1:
    • High/Medium/Default media resolution:
      • US/Asia: 3.3 M
      • EU: 3.3 M
    • Low media resolution:
      • US/Asia: 179 K
      • EU: 45 K
  • Supported MIME types:
    application/pdf, text/plain
Video
  • Maximum video length (with audio): Approximately 45 minutes
  • Maximum video length (without audio): Approximately 1 hour
  • Maximum number of videos per prompt: 10
  • Maximum tokens per minute (TPM):
    • High/Medium/Default media resolution:
      • US/Asia: 37.9 M
      • EU: 9.5 M
    • Low media resolution:
      • US/Asia: 1 G
      • EU: 2.5 M
  • Supported MIME types:
    video/x-flv, video/quicktime, video/mpeg, video/mpegs, video/mpg, video/mp4, video/webm, video/wmv, video/3gpp
Audio
  • Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens
  • Maximum number of audio files per prompt: 1
  • Speech understanding for: Audio summarization, transcription, and translation
  • Maximum tokens per minute (TPM):
    • US/Asia: 1.7 M
    • EU: 0.4 M
  • Supported MIME types:
    audio/x-aac, audio/flac, audio/mp3, audio/m4a, audio/mpeg, audio/mpga, audio/mp4, audio/opus, audio/pcm, audio/wav, audio/webm
Parameter defaults
  • Temperature: 0-2
  • topP: 0.95
  • topK: 64 (fixed)
  • candidateCount: 1-8
Knowledge cutoff date June 2024
Versions
  • gemini-2.0-flash-live-preview-04-09
    • Launch stage: Public preview
    • Release date: April 9, 2025
    • Discontinuation date: April 9, 2026
  • gemini-2.0-flash-001
    • Launch stage: Generally available
    • Release date: February 5, 2025
    • Discontinuation date: February 5, 2026
Supported regions

Model availability

(Includes dynamic shared quota & Provisioned Throughput)

  • Global
    • global
  • United States
    • us-central1
    • us-east1
    • us-east4
    • us-east5
    • us-south1
    • us-west1
    • us-west4
  • Europe
    • europe-central2
    • europe-north1
    • europe-southwest1
    • europe-west1
    • europe-west4
    • europe-west8
    • europe-west9

ML processing

  • United States
    • Multi-region
  • Europe
    • Multi-region
See Data residency for more information.
Security controls
Online prediction
  • Data residency (at rest) Supported
  • Customer-managed encryption keys (CMEK) Supported
  • VPC Service Controls Supported
  • Access Transparency (AXT) Supported
Batch prediction
  • Data residency (at rest) Supported
  • Customer-managed encryption keys (CMEK) Not supported
  • VPC Service Controls Supported
  • Access Transparency (AXT) Not supported
Tuning
  • Data residency (at rest) Supported
  • Customer-managed encryption keys (CMEK) Supported
  • VPC Service Controls Supported
  • Access Transparency (AXT) Not supported
See Security controls for more information.
Pricing See Pricing.