Configure aspect ratio

This page describes how to configure the aspect ratio that Imagen on Vertex AI generates images for.

Depending on how you plan to use your generated images, some aspect ratios may work better than others. Choose the aspect ratio that best suits your use case.

There are multiple image generation models that you can use, and certain aspect ratios are available to specific Imagen models. For more information, see Imagen models.

Aspect ratio	Intended use	Sample image
`1:1`	default, square, general use	^{Prompt: overhead shot of a pasta dinner, studio photo in the style of food magazine cover.}
`3:4`	TV, media, film	^{Prompt: commercial photoshoot, fragrance ad, lavender vanilla scented bottle on a light colored background.}
`4:3`	TV, media, film	^{Prompt: commercial photoshoot, green and gray high top sneakers, 4k, dramatic angles.}
`9:16`	portrait, tall objects, mobile devices	^{Prompt: nature photography, a beach in hawaii with the ocean in the background, lens flare, sunset.}
`16:9`	landscape	^{Prompt: skyscrapers in new york city, futuristic rendering, concept, digital art.}

Console

In the Google Cloud console, go to the Vertex AI > Media Studio page.

Go to Media Studio
Click Imagen. The Imagen Media Studio image generation page is displayed.
In the Settings panel, adjust the following options:
- Model: choose a model from the available options.
  
  For more information about available models, see Imagen models
- Aspect ratio: The aspect ratio to use when generating images
In the Write your prompt box, enter your text prompt that describes the images to generate. For example, small boat on water in the morning watercolor illustration.
Click Generate.

REST

Aspect ratio is an optional field in the parameters object of a JSON request body.

Before using any of the request data, make the following replacements:

REGION: The region that your project is located in. For more information about supported regions, see Generative AI on Vertex AI locations.
PROJECT_ID: Your Google Cloud project ID.
MODEL_VERSION: The Imagen model version to use. For more information about available models, see Imagen models.
TEXT_PROMPT: The text prompt that guides what images the model generates. This field is required for both generation and editing.
IMAGE_COUNT: The number of images to generate. The accepted range of values is 1 to 4.

Additional optional parameters

Use the following optional variables depending on your use case. Add some or all of the following parameters in the "parameters": {} object. This list shows common optional parameters and isn't meant to be exhaustive. For more information about optional parameters, see Imagen API reference: Generate images.

"parameters": {
  "sampleCount": IMAGE_COUNT,
  "addWatermark": ADD_WATERMARK,
  "aspectRatio": "ASPECT_RATIO",
  "enhancePrompt": ENABLE_PROMPT_REWRITING,
  "includeRaiReason": INCLUDE_RAI_REASON,
  "includeSafetyAttributes": INCLUDE_SAFETY_ATTRIBUTES,
  "outputOptions": {
    "mimeType": "MIME_TYPE",
    "compressionQuality": COMPRESSION_QUALITY
  },
  "personGeneration": "PERSON_SETTING",
  "safetySetting": "SAFETY_SETTING",
  "seed": SEED_NUMBER,
  "storageUri": "OUTPUT_STORAGE_URI"
}

ADD_WATERMARK: boolean. Optional. Whether to enable a watermark for generated images. Any image generated when the field is set to true contains a digital SynthID that you can use to verify a watermarked image. If you omit this field, the default value of true is used; you must set the value to false to disable this feature. You can use the seed field to get deterministic output only when this field is set to false.
ASPECT_RATIO: string. Optional. A generation mode parameter that controls aspect ratio. Supported ratio values and their intended use:
- 1:1 (default, square)
- 3:4 (Ads, social media)
- 4:3 (TV, photography)
- 16:9 (landscape)
- 9:16 (portrait)
ENABLE_PROMPT_REWRITING: boolean. Optional. A parameter to use an LLM-based prompt rewriting feature to deliver higher quality images that better reflect the original prompt's intent. Disabling this feature may impact image quality and prompt adherence. Default value: true.
INCLUDE_RAI_REASON: boolean. Optional. Whether to enable the Responsible AI filtered reason code in responses with blocked input or output. Default value: true.
INCLUDE_SAFETY_ATTRIBUTES: boolean. Optional. Whether to enable rounded Responsible AI scores for a list of safety attributes in responses for unfiltered input and output. Safety attribute categories: "Death, Harm & Tragedy", "Firearms & Weapons", "Hate", "Health", "Illicit Drugs", "Politics", "Porn", "Religion & Belief", "Toxic", "Violence", "Vulgarity", "War & Conflict". Default value: false.
MIME_TYPE: string. Optional. The MIME type of the content of the image. Available values:
- image/jpeg
- image/gif
- image/png
- image/webp
- image/bmp
- image/tiff
- image/vnd.microsoft.icon
COMPRESSION_QUALITY: integer. Optional. Only applies to JPEG output files. The level of detail the model preserves for images generated in JPEG file format. Values: 0 to 100, where a higher number means more compression. Default: 75.
PERSON_SETTING: string. Optional. The safety setting that controls the type of people or face generation the model allows. Available values:
- allow_adult (default): Allow generation of adults only, except for celebrity generation. Celebrity generation is not allowed for any setting.
- dont_allow: Disable the inclusion of people or faces in generated images.
SAFETY_SETTING: string. Optional. A setting that controls safety filter thresholds for generated images. Available values:
- block_low_and_above: The highest safety threshold, resulting in the largest amount of generated images that are filtered. Previous value: block_most.
- block_medium_and_above (default): A medium safety threshold that balances filtering for potentially harmful and safe content. Previous value: block_some.
- block_only_high: A safety threshold that reduces the number of requests blocked due to safety filters. This setting might increase objectionable content generated by Imagen. Previous value: block_few.
SEED_NUMBER: integer. Optional. Any non-negative integer you provide to make output images deterministic. Providing the same seed number always results in the same output images. If the model you're using supports digital watermarking, you must set "addWatermark": false to use this field. Accepted integer values: 1 - 2147483647.
OUTPUT_STORAGE_URI: string. Optional. The Cloud Storage bucket to store the output images. If not provided, base64-encoded image bytes are returned in the response. Sample value: gs://image-bucket/output/.

HTTP method and URL:

POST https://REGION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/REGION/publishers/google/models/MODEL_VERSION:predict

Request JSON body:

{
  "instances": [
    {
      "prompt": "TEXT_PROMPT"
    }
  ],
  "parameters": {
    "sampleCount": IMAGE_COUNT
  }
}

To send your request, choose one of these options:

curl

Note: The following command assumes that you have logged in to the gcloud CLI with your user account by running gcloud init or gcloud auth login , or by using Cloud Shell, which automatically logs you into the gcloud CLI . You can check the currently active account by running gcloud auth list.

Save the request body in a file named request.json, and execute the following command:

curl -X POST \
     -H "Authorization: Bearer $(gcloud auth print-access-token)" \
     -H "Content-Type: application/json; charset=utf-8" \
     -d @request.json \
     "https://REGION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/REGION/publishers/google/models/MODEL_VERSION:predict"

PowerShell

Note: The following command assumes that you have logged in to the gcloud CLI with your user account by running gcloud init or gcloud auth login . You can check the currently active account by running gcloud auth list.

Save the request body in a file named request.json, and execute the following command:

$cred = gcloud auth print-access-token
$headers = @{ "Authorization" = "Bearer $cred" }

Invoke-WebRequest `
    -Method POST `
    -Headers $headers `
    -ContentType: "application/json; charset=utf-8" `
    -InFile request.json `
    -Uri "https://REGION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/REGION/publishers/google/models/MODEL_VERSION:predict" | Select-Object -Expand Content

The following sample response is for a request with

"sampleCount":
  2

. The response returns two prediction objects, with the generated image bytes base64-encoded.

{
  "predictions": [
    {
      "bytesBase64Encoded": "BASE64_IMG_BYTES",
      "mimeType": "image/png"
    },
    {
      "mimeType": "image/png",
      "bytesBase64Encoded": "BASE64_IMG_BYTES"
    }
  ]
}

If you use a model that supports prompt enhancement, the response includes an additional prompt field with the enhanced prompt used for generation:

{
  "predictions": [
    {
      "mimeType": "MIME_TYPE",
      "prompt": "ENHANCED_PROMPT_1",
      "bytesBase64Encoded": "BASE64_IMG_BYTES_1"
    },
    {
      "mimeType": "MIME_TYPE",
      "prompt": "ENHANCED_PROMPT_2",
      "bytesBase64Encoded": "BASE64_IMG_BYTES_2"
    }
  ]
}

Configure aspect ratio

Console

REST

curl

PowerShell

What's next