此页面由 Cloud Translation API 翻译。

使用文本提示修改图片

注意：自 2025 年 6 月 24 日起，Imagen 版本 1 和 2 将被弃用。Imagen 模型 imagegeneration@002、imagegeneration@005 和 imagegeneration@006 将于 2025 年 9 月 24 日移除。如需详细了解如何迁移到 Imagen 3，请参阅迁移到 Imagen 3。

本页介绍了如何仅使用文本提示进行无蒙版修改。借助无蒙版修改，您可以在不使用蒙版的情况下修改图片。这种修改方法非常适合与整个图片相关的修改，或者修改位置对于您的使用场景并不重要。

无蒙版（整个图片）修改示例

您只能使用文本提示修改基础图片（生成或上传）。您无需指定要修改的区域，更新会应用于整个图片（也称为无蒙版修改）。

如需进行无蒙版修改，请使用描述您要查看的提示，而不是更改内容的说明。例如，假设您要将现有的一张猫的图片更改为狗的图片。无蒙版修改提示“一只狗”可能比“把猫变成狗”更有效。同样，设想一下，您使用“海滩上的猫”提示生成的图片。如需更改此图片，请使用修改提示“海滩上的狗”。

原始的猫图片，旁边是编辑狗图片 — ^{原始图片（左侧）：Cédric VT on
Unsplash。

经过修改的图片（右侧）：使用 Imagen on Vertex AI 生成的图片，其中包含原始基础映像和提示：狗。}

查看 Imagen for Editing and Customization 模型卡片

准备工作

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Make sure that billing is enabled for your Google Cloud project.

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Make sure that billing is enabled for your Google Cloud project.

Enable the Vertex AI API.

Enable the API

为您的环境设置身份验证。

Select the tab for how you plan to use the samples on this page:
Console

When you use the Google Cloud console to access Google Cloud services and APIs, you don't need to set up authentication.
Python

如需在本地开发环境中使用本页面上的 Python 示例，请安装并初始化 gcloud CLI，然后使用您的用户凭据设置应用默认凭据。
1. Install the Google Cloud CLI.
2. If you're using an external identity provider (IdP), you must first sign in to the gcloud CLI with your federated identity.
3. To initialize the gcloud CLI, run the following command:
  gcloud init
4. If you're using a local shell, then create local authentication credentials for your user account:
  gcloud auth application-default login
  You don't need to do this if you're using Cloud Shell.
  
  If an authentication error is returned, and you are using an external identity provider (IdP), confirm that you have signed in to the gcloud CLI with your federated identity.
Google Cloud
REST

如需在本地开发环境中使用本页面上的 REST API 示例，请使用您提供给 gcloud CLI 的凭证。
如需了解详情，请参阅 Google Cloud 身份验证文档中的使用 REST 时进行身份验证。
使用无蒙版修改

使用以下示例修改没有蒙版区域的整张图片。
控制台
1. 在 Google Cloud 控制台中，前往 Vertex AI > Media Studio 页面。
  前往媒体工作室
2. 在下层任务面板中，点击修改图片。
3. 转到 修改图片屏幕。
  
  修改生成的图片
  
  使用文本提示生成图片
  
  点击生成的图片。
  
  点击修改图片。
  
  修改上传的图片
  
  点击上传图片。
  
  选择要修改的本地文件。
4. 输入修改图片的新提示。
5. 可选。修改所有参数。
6. 点击生成。
  
  ^{通过提示符使用 Imagen on Vertex AI 修改的图片的修改图片视图：抹茶蛋糕。原始图片显示在右上角。原始图片来源：
  David Holifield
  on Unsplash
  （显示在 Google Cloud 控制台中）。}
Python

如需了解如何安装或更新 Vertex AI SDK for Python，请参阅安装 Vertex AI SDK for Python。如需了解详情，请参阅 Python API 参考文档。

在此示例中，您将使用 load_from_file 方法引用本地文件作为要修改的基础 Image。指定基础图片后，您可以对 ImageGenerationModel 使用 edit_image 方法，并在本地保存修改后的图片。然后，您可以选择使用笔记本中的 show() 方法显示修改后的图片。
import vertexai from vertexai.preview.vision_models import Image, ImageGenerationModel # TODO(developer): Update and un-comment below lines # PROJECT_ID = "your-project-id" # input_file = "input-image.png" # output_file = "output-image.png" # prompt = "" # The text prompt describing what you want to see. vertexai.init(project=PROJECT_ID, location="us-central1") model = ImageGenerationModel.from_pretrained("imagegeneration@002") base_img = Image.load_from_file(location=input_file) images = model.edit_image( base_image=base_img, prompt=prompt, # Optional parameters seed=1, # Controls the strength of the prompt. # -- 0-9 (low strength), 10-20 (medium strength), 21+ (high strength) guidance_scale=21, number_of_images=1, ) images[0].save(location=output_file, include_generation_parameters=False) # Optional. View the edited image in a notebook. # images[0].show() print(f"Created output image using {len(images[0]._image_bytes)} bytes") # Example response: # Created output image using 1234567 bytes
REST

在使用任何请求数据之前，请先进行以下替换：
- PROJECT_ID：您的 Google Cloud 项目 ID。
- LOCATION：您的项目的区域。例如 us-central1、europe-west2 或 asia-northeast3。如需查看可用区域的列表，请参阅 Vertex AI 上的生成式 AI 位置。
- TEXT_PROMPT：用于指导模型生成什么图片的文本提示。生成和修改都需要此字段。
- B64_BASE_IMAGE：要修改或放大的基础图片。图片必须指定为 base64 编码的字节字符串。大小上限：10 MB。
- EDIT_IMAGE_COUNT：已修改图片的数量。默认值：4。
HTTP 方法和网址：
POST https://LOCATION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/imagegeneration@002:predict
请求 JSON 正文：
{ "instances": [ { "prompt": "TEXT_PROMPT", "image": { "bytesBase64Encoded": "B64_BASE_IMAGE" } } ], "parameters": { "sampleCount": EDIT_IMAGE_COUNT } }
如需发送请求，请选择以下方式之一：
curl

注意：以下命令假定您已使用您的用户账号通过运行 gcloud init 或 gcloud auth login 登录 gcloud CLI，或者使用了 Cloud Shell，这会使您自动登录 gcloud CLI。您可以运行 gcloud auth list 来检查当前活跃的账号。

将请求正文保存在名为 request.json 的文件中，然后执行以下命令：

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json; charset=utf-8" \
-d @request.json \
"https://LOCATION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/imagegeneration@002:predict"

PowerShell

注意：以下命令假定您已使用您的用户账号通过运行 gcloud init 或 gcloud auth login 登录 gcloud CLI。您可以运行 gcloud auth list 来检查当前活跃的账号。

将请求正文保存在名为 request.json 的文件中，然后执行以下命令：

$cred = gcloud auth print-access-token
$headers = @{ "Authorization" = "Bearer $cred" }

Invoke-WebRequest `
-Method POST `
-Headers $headers `
-ContentType: "application/json; charset=utf-8" `
-InFile request.json `
-Uri "https://LOCATION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/imagegeneration@002:predict" | Select-Object -Expand Content
以下示例响应适用于包含 "sampleCount": 2 的请求。响应返回两个预测对象，其中生成的图片字节采用 base64 编码。
{ "predictions": [ { "bytesBase64Encoded": "BASE64_IMG_BYTES", "mimeType": "image/png" }, { "mimeType": "image/png", "bytesBase64Encoded": "BASE64_IMG_BYTES" } ] }
后续步骤

阅读有关 Imagen 和其他 Vertex AI 上的生成式 AI 产品的文章：

使用文本提示修改图片 使用集合让一切井井有条 根据您的偏好保存内容并对其进行分类。

无蒙版（整个图片）修改示例

准备工作

Console

Python

REST

使用无蒙版修改

控制台

修改生成的图片

修改上传的图片

Python

REST

curl

PowerShell

后续步骤

使用文本提示修改图片