Each Vertex AI Generative AI image model is available in distinct
versions. The image models include generation and text models, such as
imagegeneration
and imagetext
. The versions differ by the features they
offer.
Image generation and editing
The following image generation model versions are available for image generation and image editing tasks.
Imagen 3 | Imagen 2 (v.006, default) | Imagen 2 (v.005) | Imagen (v.002) | |
---|---|---|---|---|
Model name and version |
Imagen 3: imagen-3.0-generate-001 Imagen 3 Fast: imagen-3.0-fast-generate-001 This is a low-latency model variant you can use for prototyping or low-latency use cases. Imagen 3 Customization and Editing: imagen-3.0-capability-001
|
imagegeneration@006 |
imagegeneration@005 |
imagegeneration@002 |
Image generation | General Availability | General Availability (approved users) | General Availability (approved users) | General Availability (approved users) |
Image Customization (few-shot learning) | General Availability (approved users)
|
Not supported | Not supported | Not supported |
Image editing |
|
|
Not supported. Feature support for this model is subject to future changes and updates. |
|
Digital watermarking and verification (image generation) | Supported (enabled by default) | Supported (enabled by default) | Supported (not enabled by default) | Supported (not enabled by default) |
User-configurable safety settings |
|
|
Not supported | Not supported |
Image upscaling | Not supported | Not supported | Not supported | Supported (2x or 4x) |
Aspect ratio and resolution |
|
|
|
|
Languages | General availability:
Preview:
|
General availability:
Preview:
|
General availability:
Preview:
|
General availability:
|
Model tuning | Not supported. Feature support for this model is subject to future changes and updates. |
Not supported | Not supported | Model tuning availability:
|
Visual captioning and Visual Question Answering (VQA)
The following imagetext
model versions are available for image
captioning and Visual Question Answering (VQA) tasks.
Model version:imagetext@001 (default) |
|
---|---|
Visual captioning (short-form) | Tokens: 32 (~32 words) Languages: English, Spanish, French, German, Italian |
Visual Question Answering, VQA (short-form) | Tokens (request): 80 Tokens (response): 64 Languages: English |
How to specify the version of an image model
To use a version of a vision model, add the three digit version number to the
model. For example, to specify an imagegeneration
model version, append the
model version (for example, @006
) to the model's name:
POST https://us-central1-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/us-central1/publishers/google/models/imagegeneration@006:predict
POST https://us-central1-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/us-central1/publishers/google/models/imagetext@001:predict
However, this format doesn't apply to Imagen 3 models.
You should always specify a stable model version for production use, as using the default option without a model version specified can lead to changes in model behavior.
If you omit the model version, Imagen uses the default model version:
POST https://us-central1-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/us-central1/publishers/google/models/imagegeneration:predict
POST https://us-central1-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/us-central1/publishers/google/models/imagetext:predict
As a best practice, always specify a model version. If you choose to use the model name without a model version, make sure it is in a testing environment.
What's next
- Learn how to send requests to specific models:
- Learn more about model parameters using the API reference:
- Image generation (
imagegeneration
model) - Image captions (
imagetext
model) - Visual question and answering (VQA) (
imagetext
model)
- Image generation (