This page provides an overview of Gemini 2.0 Flash-Lite, our fastest and most cost-efficient Flash model. It's an upgrade path for 1.5 Flash users who want better quality for the same price and speed. This page covers the following topics:
Try in Vertex AI View in Model Garden (Preview) Deploy example app
Model availability (Includes dynamic shared quota & Provisioned Throughput) ML processing
Model ID
gemini-2.0-flash-lite
Supported inputs & outputs
Token limits
Capabilities
Usage types
Input size limit
500 MB
Technical specifications
Images
image/png
,
image/jpeg
,
image/webp
Documents
application/pdf
,
text/plain
Video
video/x-flv
,
video/quicktime
,
video/mpeg
,
video/mpegs
,
video/mpg
,
video/mp4
,
video/webm
,
video/wmv
,
video/3gpp
Audio
audio/x-aac
,
audio/flac
,
audio/mp3
,
audio/m4a
,
audio/mpeg
,
audio/mpga
,
audio/mp4
,
audio/opus
,
audio/pcm
,
audio/wav
,
audio/webm
Parameter defaults
Supported regions
See Data residency for more information.
Knowledge cutoff date
June 2024
Versions
gemini-2.0-flash-lite-001
Security controls
Online prediction
Batch prediction
Tuning
See Security controls for more information.
Pricing
See Pricing.
Gemini 2.0 Flash-Lite
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-18 UTC.