Gemini 2.0 Flash

Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, built-in tool use, multimodal generation, and a 1M token context window.

Try in Vertex AI View model card in Model Garden (Preview) Deploy example app

Property Description
Model ID gemini-2.0-flash
Supported inputs & outputs
Inputs
Text, Code, Images, Audio, Video
Outputs
Text, Audio (private preview), Images (private preview)
Token limits
Maximum input tokens 1,048,576 (without Live API), 32,768 (with Live API)
Maximum output tokens 8,192
Capabilities
Usage types
Technical specifications
Images
  • Maximum images per prompt: 3,000
  • Maximum image size: 7 MB
  • Supported MIME types:
    image/png, image/jpeg, image/webp
Documents
  • Maximum number of files per prompt: 3,000
  • Maximum number of pages per file: 1,000
  • Maximum file size per file: 50 MB
  • Supported MIME types:
    application/pdf, text/plain
Video
  • Maximum video length (with audio): Approximately 45 minutes
  • Maximum video length (without audio): Approximately 1 hour
  • Maximum number of videos per prompt: 10
  • Supported MIME types:
    video/x-flv, video/quicktime, video/mpeg, video/mpegs, video/mpgs, video/mpg, video/mp4, video/webm, video/wmv, video/3gpp
Audio
  • Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens
  • Maximum number of audio files per prompt: 1
  • Speech understanding for: Audio summarization, transcription, and translation
  • Supported MIME types:
    audio/x-aac, audio/flac, audio/mp3, audio/m4a, audio/mpeg, audio/mpga, audio/mp4, audio/opus, audio/pcm, audio/wav, audio/webm
Parameter defaults
  • Temperature: 0-2
  • topP: 0.95
  • topK: 64
  • candidateCount: 1-8
Knowledge cutoff date June 2024
Versions
  • gemini-2.0-flash-live-preview-04-09
    • Launch stage: Public preview
    • Release date: April 9, 2025
    • Discontinuation date: April 9, 2026
  • gemini-2.0-flash-001
    • Launch stage: Generally available
    • Release date: February 5, 2025
    • Discontinuation date: February 5, 2026
Supported regions

Model availability

(Includes dynamic shared quota & Provisioned Throughput)

  • Global
    • global
  • United States
    • us-central1
    • us-east1
    • us-east4
    • us-east5
    • us-south1
    • us-west1
    • us-west4
  • Europe
    • europe-central2
    • europe-north1
    • europe-southwest1
    • europe-west1
    • europe-west4
    • europe-west8
    • europe-west9

ML processing

  • United States
    • Multi-region
  • Europe
    • Multi-region
See Data residency for more information.
Security controls
Online prediction
  • Data residency (at rest) Supported
  • Customer-managed encryption keys (CMEK) Supported
  • VPC Service Controls Supported
  • Access Transparency (AXT) Supported
Batch prediction
  • Data residency (at rest) Supported
  • Customer-managed encryption keys (CMEK) Not supported
  • VPC Service Controls Supported
  • Access Transparency (AXT) Not supported
Tuning
  • Data residency (at rest) Supported
  • Customer-managed encryption keys (CMEK) Supported
  • VPC Service Controls Supported
  • Access Transparency (AXT) Not supported
See Security controls for more information.
Pricing See Pricing.

Get started with Gemini 2.0 Flash

You can try Gemini 2.0 Flash in the Google Cloud console, or run our sample application:

Try in Vertex AI (Preview) Deploy example app