This document provides an overview of Gemini 2.5 Flash-Lite, our most balanced Gemini model, optimized for low-latency use cases. Key features include: For even more detailed technical information on Gemini 2.5 Flash-Lite (such as
performance benchmarks, information on our training datasets, efforts on
sustainability, intended usage and limitations, and our approach to ethics and
safety), see our technical
report
on our Gemini 2.5 models.
Try in Vertex AI (Preview) Deploy example app
Model availability ML processing
Model ID
gemini-2.5-flash-lite
Supported inputs & outputs
Token limits
Capabilities
Usage types
Input size limit
500 MB
Technical specifications
Images
image/png
,
image/jpeg
,
image/webp
Documents
application/pdf
,
text/plain
Video
video/x-flv
,
video/quicktime
,
video/mpeg
,
video/mpegs
,
video/mpg
,
video/mp4
,
video/webm
,
video/wmv
,
video/3gpp
Audio
audio/x-aac
,
audio/flac
,
audio/mp3
,
audio/m4a
,
audio/mpeg
,
audio/mpga
,
audio/mp4
,
audio/opus
,
audio/pcm
,
audio/wav
,
audio/webm
Parameter defaults
Supported regions
See Data residency for more information.
Knowledge cutoff date
January 2025
Versions
gemini-2.5-flash-lite
gemini-2.5-flash-lite-preview-06-17
Security controls
See Security controls for more information.
Pricing
See Pricing.
Gemini 2.5 Flash-Lite
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-26 UTC.