Gemini 2.5 Flash

Caution: The gemini-2.0-flash-preview-image-generation and gemini-2.5-flash-image-preview models will be retired on October 31, 2025. Migrate any workflows to gemini-2.5-flash-image before that date to avoid service disruption.

Gemini 2.5 Flash is our best model in terms of price and performance, and offers well-rounded capabilities. Gemini 2.5 Flash is our first Flash model that features thinking capabilities, which lets you see the thinking process that the model goes through when generating its response.

For even more detailed technical information on Gemini 2.5 Flash (such as performance benchmarks, information on our training datasets, efforts on sustainability, intended usage and limitations, and our approach to ethics and safety), see our technical report on our Gemini 2.5 models.

2.5 Flash

Try in Vertex AI View in Model Garden (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.

Technical specifications
Model ID	`gemini-2.5-flash`
Supported inputs & outputs	Inputs: Text, Code, Images, Audio, Video Outputs: Text
Token limits	Maximum input tokens: 1,048,576 Maximum output tokens: 65,535 (default)
Capabilities	Supported Grounding with Google Search Code execution Tuning System instructions Structured output Function calling Count Tokens Live API Preview feature Thinking Vertex AI RAG Engine Chat completions Not supported
Usage types	Supported Provisioned Throughput Dynamic shared quota Batch prediction Not supported Fixed quota
Input size limit	500 MB
	Images	Maximum images per prompt: 3,000 Maximum image size: 7 MB Supported MIME types: `image/png`, `image/jpeg`, `image/webp`
	Documents	Maximum number of files per prompt: 3,000 Maximum number of pages per file: 1,000 Maximum file size per file for the API or Cloud Storage imports: 50 MB Maximum file size per file for direct uploads through the console: 7 MB Supported MIME types: `application/pdf`, `text/plain`
	Video	Maximum video length (with audio): Approximately 45 minutes Maximum video length (without audio): Approximately 1 hour Maximum number of videos per prompt: 10 Supported MIME types: `video/x-flv`, `video/quicktime`, `video/mpeg`, `video/mpegs`, `video/mpg`, `video/mp4`, `video/webm`, `video/wmv`, `video/3gpp`
	Audio	Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens Maximum number of audio files per prompt: 1 Speech understanding for: Audio summarization, transcription, and translation Supported MIME types: `audio/x-aac`, `audio/flac`, `audio/mp3`, `audio/m4a`, `audio/mpeg`, `audio/mpga`, `audio/mp4`, `audio/ogg`, `audio/pcm`, `audio/wav`, `audio/webm`
	Parameter defaults	Temperature: 0.0-2.0 (default 1.0) topP: 0.0-1.0 (default 0.95) topK: 64 (fixed) candidateCount: 1–8 (default 1)
Supported regions
	Model availability (Includes dynamic shared quota & Provisioned Throughput)	Global global United States us-central1 us-east1 us-east4 us-east5 us-south1 us-west1 us-west4 Europe europe-central2 europe-north1 europe-southwest1 europe-west1 europe-west4 europe-west8
	ML processing	United States Multi-region Canada northamerica-northeast1⁺ Europe Multi-region europe-west2^{* +} europe-west3^{* +} europe-west9^{* +} Asia Pacific asia-northeast1^{* +} asia-northeast3^{* +} asia-south1^{* +} asia-southeast1⁺ australia-southeast1^{* +}
	See Data residency for more information.
Knowledge cutoff date	January 2025
Versions	`gemini-2.5-flash` Launch stage: GA Release date: June 17, 2025 Discontinuation date: June 17, 2026 `gemini-live-2.5-flash` Launch stage: Private GA Release date: June 17, 2025
Security controls
Security controls	See Security controls for more information.
Supported languages	See Supported languages.
Pricing	See Pricing.

+ Supervised fine tuning not supported
* Available for 128K context window only, supervised fine tuning not supported

2.5 Flash

Try in Vertex AI (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.

Technical specifications
Model ID	`gemini-2.5-flash-preview-09-2025`
Supported inputs & outputs	Inputs: Text, Code, Images, Audio, Video Outputs: Text
Token limits	Maximum input tokens: 1,048,576 Maximum output tokens: 65,535 (default)
Capabilities	Supported Grounding with Google Search Code execution System instructions Structured output Function calling Count Tokens Live API Preview feature Thinking Vertex AI RAG Engine Chat completions Not supported Tuning
Usage types	Supported Provisioned Throughput Dynamic shared quota Not supported Fixed quota Batch prediction
	Images	Maximum images per prompt: 3,000 Maximum image size: 7 MB Supported MIME types: `image/png`, `image/jpeg`, `image/webp`
	Documents	Maximum number of files per prompt: 3,000 Maximum number of pages per file: 1,000 Maximum file size per file for the API or Cloud Storage imports: 50 MB Maximum file size per file for direct uploads through the console: 7 MB Supported MIME types: `application/pdf`, `text/plain`
	Video	Maximum video length (with audio): Approximately 45 minutes Maximum video length (without audio): Approximately 1 hour Maximum number of videos per prompt: 10 Supported MIME types: `video/x-flv`, `video/quicktime`, `video/mpeg`, `video/mpegs`, `video/mpg`, `video/mp4`, `video/webm`, `video/wmv`, `video/3gpp`
	Audio	Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens Maximum number of audio files per prompt: 1 Speech understanding for: Audio summarization, transcription, and translation Supported MIME types: `audio/x-aac`, `audio/flac`, `audio/mp3`, `audio/m4a`, `audio/mpeg`, `audio/mpga`, `audio/mp4`, `audio/ogg`, `audio/pcm`, `audio/wav`, `audio/webm`
	Parameter defaults	Temperature: 0.0-2.0 (default 1.0) topP: 0.0-1.0 (default 0.95) topK: 64 (fixed) candidateCount: 1–8 (default 1)
Supported regions
	Model availability (Includes dynamic shared quota & Provisioned Throughput)	Global global
	See Data residency for more information.
Knowledge cutoff date	January 2025
Versions	`gemini-2.5-flash-preview-09-2025` Launch stage: Public preview Release date: September 25, 2025
Security controls
Security controls	See Security controls for more information.
Supported languages	See Supported languages.
Pricing	See Pricing.