管理已排队的资源

借助已排队的资源，您可以按排队方式请求 Cloud TPU 资源。在您请求已排队的资源时，请求会被添加到由 Cloud TPU 服务维护的队列中。请求的资源可用后，会分配给您的 Google Cloud 项目，供您立即专门使用。除非您将该资源删除或被抢占，否则该资源仍会分配给您的项目。只有 TPU Spot 虚拟机和抢占式 TPU 才符合抢占条件。

您可以在已排队的资源请求中指定可选的开始时间和结束时间。开始时间用于指定完成请求的最早时间。如果请求未在指定的结束时间之前完成，则请求会失效。请求在失效后仍会保留在队列中。

已排队的资源请求可以处于以下任一状态：

WAITING_FOR_RESOURCES: 请求已通过初始验证，并已添加到队列中。在有足够的可用资源可开始预配您的请求或分配时间间隔到期之前，请求会保持此状态。当需求较高时，并非所有请求都可以立即预配。如果您需要更可靠的 TPU 可获取性，请考虑购买预留。
重要提示：WAITING_FOR_RESOURCES 取代了 ACCEPTED 状态。如果代码包含等待已排队的资源进入 ACCEPTED 状态的逻辑，您可能需要更新代码以等待 WAITING_FOR_RESOURCES 状态。
PROVISIONING: 已从队列中选择请求，并且正在分配其资源。
ACTIVE: 请求已分配。当已排队的资源请求处于 ACTIVE 状态时，您可以按照管理 TPU 中所述管理 TPU 虚拟机。
FAILED: 请求无法完成，原因是请求存在问题，或者请求的资源在分配时间间隔内不可用。请求会保留在队列中，直到它被明确删除为止。
SUSPENDING: 与请求关联的资源正在被删除。
SUSPENDED: 请求中指定的资源已被删除。当请求处于 SUSPENDED 状态时，就不再符合进一步分配的条件。

前提条件

在运行本指南中的命令之前，您必须安装 Google Cloud CLI、创建 Google Cloud 项目并启用 Cloud TPU API。如需查看相关说明，请参阅设置 Cloud TPU 环境。

如果您使用的是某个 Cloud 客户端库，请按照适用于您所用语言的设置说明执行操作。

Python
Java

请求按需排队的资源

按需资源不会被抢占，但按需配额不能保证有足够的可用 Cloud TPU 资源来满足您的请求。如需详细了解按需资源，请参阅配额类型。

gcloud

gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-central1-a \
    --accelerator-type v5litepod-8 \
    --runtime-version v2-alpha-tpuv5-lite

命令参数说明

queued-resource-id: 已排队的资源请求的用户分配 ID。
node-id: 用户分配的 TPU ID，该 ID 是在分配已排队的资源请求时创建的。
project: 您的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。

curl

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-central1-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5litepod-8',
        'runtime_version': 'v2-alpha-tpuv5-lite',
        }
    }
    }
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-central1-a/queuedResources?queued_resource_id=your-queued-resource-id

命令参数说明

queued-resource-id: 已排队的资源请求的用户分配 ID。
node-id: 用户分配的 TPU ID，该 ID 是在分配已排队的资源请求时创建的。
project: 您的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击创建 TPU。
在名称字段中，输入 TPU 的名称。
在可用区框中，选择您要在其中创建 TPU 的可用区。
在 TPU 类型框中，选择加速器类型。加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
在 TPU 软件版本框中，选择软件版本。创建 Cloud TPU 虚拟机时，TPU 软件版本用于指定要安装的 TPU 运行时的版本。如需了解详情，请参阅 TPU 软件版本。
点击启用排队切换开关。
在已排队资源的名称字段中，输入已排队的资源请求的名称。
点击创建以创建已排队的资源请求。

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.cloud.tpu.v2alpha1.CreateQueuedResourceRequest;
import com.google.cloud.tpu.v2alpha1.Node;
import com.google.cloud.tpu.v2alpha1.QueuedResource;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import java.io.IOException;
import java.util.concurrent.ExecutionException;
import java.util.concurrent.TimeUnit;
import java.util.concurrent.TimeoutException;

public class CreateQueuedResource {
  public static void main(String[] args)
          throws IOException, ExecutionException, InterruptedException, TimeoutException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project you want to create a node.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which to create the TPU.
    // For more information about supported TPU types for specific zones,
    // see https://cloud.google.com/tpu/docs/regions-zones
    String zone = "us-central1-a";
    // The name for your TPU.
    String nodeName = "YOUR_NODE_ID";
    // The accelerator type that specifies the version and size of the Cloud TPU you want to create.
    // For more information about supported accelerator types for each TPU version,
    // see https://cloud.google.com/tpu/docs/system-architecture-tpu-vm#versions.
    String tpuType = "v5litepod-4";
    // Software version that specifies the version of the TPU runtime to install.
    // For more information see https://cloud.google.com/tpu/docs/runtimes
    String tpuSoftwareVersion = "v2-tpuv5-litepod";
    // The name for your Queued Resource.
    String queuedResourceId = "QUEUED_RESOURCE_ID";

    createQueuedResource(
        projectId, zone, queuedResourceId, nodeName, tpuType, tpuSoftwareVersion);
  }

  // Creates a Queued Resource
  public static QueuedResource createQueuedResource(String projectId, String zone,
      String queuedResourceId, String nodeName, String tpuType, String tpuSoftwareVersion)
          throws IOException, ExecutionException, InterruptedException, TimeoutException {
    String resource = String.format("projects/%s/locations/%s/queuedResources/%s",
            projectId, zone, queuedResourceId);
    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create()) {
      String parent = String.format("projects/%s/locations/%s", projectId, zone);
      Node node =
          Node.newBuilder()
              .setName(nodeName)
              .setAcceleratorType(tpuType)
              .setRuntimeVersion(tpuSoftwareVersion)
              .setQueuedResource(resource)
              .build();

      QueuedResource queuedResource =
          QueuedResource.newBuilder()
              .setName(queuedResourceId)
              .setTpu(
                  QueuedResource.Tpu.newBuilder()
                      .addNodeSpec(
                          QueuedResource.Tpu.NodeSpec.newBuilder()
                              .setParent(parent)
                              .setNode(node)
                              .setNodeId(nodeName)
                              .build())
                      .build())
              .build();

      CreateQueuedResourceRequest request =
          CreateQueuedResourceRequest.newBuilder()
              .setParent(parent)
              .setQueuedResourceId(queuedResourceId)
              .setQueuedResource(queuedResource)
              .build();

      return tpuClient.createQueuedResourceAsync(request).get(1, TimeUnit.MINUTES);
    }
  }
}

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-a"
# tpu_name = "tpu-name"
# tpu_type = "v5litepod-4"
# runtime_version = "v2-tpuv5-litepod"
# queued_resource_name = "resource-name"

node = tpu_v2alpha1.Node()
node.accelerator_type = tpu_type
# To see available runtime version use command:
# gcloud compute tpus versions list --zone={ZONE}
node.runtime_version = runtime_version

node_spec = tpu_v2alpha1.QueuedResource.Tpu.NodeSpec()
node_spec.parent = f"projects/{project_id}/locations/{zone}"
node_spec.node_id = tpu_name
node_spec.node = node

resource = tpu_v2alpha1.QueuedResource()
resource.tpu = tpu_v2alpha1.QueuedResource.Tpu(node_spec=[node_spec])

request = tpu_v2alpha1.CreateQueuedResourceRequest(
    parent=f"projects/{project_id}/locations/{zone}",
    queued_resource_id=queued_resource_name,
    queued_resource=resource,
)

client = tpu_v2alpha1.TpuClient()
operation = client.create_queued_resource(request=request)

response = operation.result()
print(response.name)
print(response.state)
# Example response:
# projects/[project_id]/locations/[zone]/queuedResources/resource-name
# State.WAITING_FOR_RESOURCES

使用预留请求已排队的资源

您可以使用预留请求已排队的资源。如需购买预留，请与您的 Google Cloud 客户支持团队联系。

gcloud

gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-central1-a \
    --accelerator-type v5litepod-8 \
    --runtime-version v2-alpha-tpuv5-lite \
    --reserved

命令参数说明

queued-resource-id: 已排队的资源请求的用户分配 ID。
node-id: 用户分配的 TPU ID，该 ID 是在分配已排队的资源请求时创建的。
project: 您的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
reserved: 在请求已排队的资源作为 Cloud TPU 预留的一部分时使用此标志。

curl

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-central1-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5litepod-8',
        'runtime_version': 'v2-alpha-tpuv5-lite',
        }
    }
    },
    'guaranteed': {
    'reserved': true,
    }
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-central1-a/queuedResources?queued_resource_id=your-queued-resource-id

命令参数说明

queued-resource-id: 已排队的资源请求的用户分配 ID。
node-id: 用户分配的 TPU ID，该 ID 是在分配已排队的资源请求时创建的。
project: 您的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
reserved: 在请求已排队的资源作为 Cloud TPU 预留的一部分时使用此标志。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击创建 TPU。
在名称字段中，输入 TPU 的名称。
在可用区框中，选择您要在其中创建 TPU 的可用区。
在 TPU 类型框中，选择加速器类型。加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
在 TPU 软件版本框中，选择软件版本。创建 Cloud TPU 虚拟机时，TPU 软件版本用于指定要安装的 TPU 运行时的版本。如需了解详情，请参阅 TPU 软件版本。
点击启用排队切换开关。
在已排队资源的名称字段中，输入已排队的资源请求的名称。
展开管理部分。
选中使用现有预留复选框。
点击创建以创建已排队的资源请求。

请求 TPU Spot 虚拟机已排队的资源

Spot 虚拟机是一种资源，可以随时抢占并分配给其他工作负载。与非 Spot 虚拟机请求相比，Spot 虚拟机资源的费用更低，并且您可能更快地访问资源。如需详细了解 TPU Spot 虚拟机，请参阅管理 TPU Spot 虚拟机。

gcloud

gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-central1-a \
    --accelerator-type v5litepod-8 \
    --runtime-version v2-alpha-tpuv5-lite \
    --spot

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
spot: 一个布尔值标志，用于指定已排队的资源是 Spot 虚拟机。

curl

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-central1-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5litepod-8',
        'runtime_version': 'v2-alpha-tpuv5-lite'
        }
    }
    },
    'spot': {}
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-central1-a/queuedResources?queued_resource_id=your-queued-resource-id

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
spot: 一个布尔值标志，用于指定已排队的资源是 Spot 虚拟机。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击创建 TPU。
在名称字段中，输入 TPU 的名称。
在可用区框中，选择您要在其中创建 TPU 的可用区。
在 TPU 类型框中，选择加速器类型。加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
在 TPU 软件版本框中，选择软件版本。创建 Cloud TPU 虚拟机时，TPU 软件版本用于指定要安装的 TPU 运行时的版本。如需了解详情，请参阅 TPU 软件版本。
点击启用排队切换开关。
在已排队资源的名称字段中，输入已排队的资源请求的名称。
展开管理部分。
选中将此项设置为 TPU Spot 虚拟机复选框。
点击创建。

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.cloud.tpu.v2alpha1.CreateQueuedResourceRequest;
import com.google.cloud.tpu.v2alpha1.Node;
import com.google.cloud.tpu.v2alpha1.QueuedResource;
import com.google.cloud.tpu.v2alpha1.SchedulingConfig;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import java.io.IOException;
import java.util.concurrent.ExecutionException;

public class CreateSpotQueuedResource {
  public static void main(String[] args)
      throws IOException, ExecutionException, InterruptedException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project you want to create a node.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which to create the TPU.
    // For more information about supported TPU types for specific zones,
    // see https://cloud.google.com/tpu/docs/regions-zones
    String zone = "us-central1-a";
    // The name for your TPU.
    String nodeName = "YOUR_TPU_NAME";
    // The accelerator type that specifies the version and size of the Cloud TPU you want to create.
    // For more information about supported accelerator types for each TPU version,
    // see https://cloud.google.com/tpu/docs/system-architecture-tpu-vm#versions.
    String tpuType = "v5litepod-4";
    // Software version that specifies the version of the TPU runtime to install.
    // For more information see https://cloud.google.com/tpu/docs/runtimes
    String tpuSoftwareVersion = "v2-tpuv5-litepod";
    // The name for your Queued Resource.
    String queuedResourceId = "QUEUED_RESOURCE_ID";

    createQueuedResource(
        projectId, zone, queuedResourceId, nodeName, tpuType, tpuSoftwareVersion);
  }

  // Creates a Queued Resource with --preemptible flag.
  public static QueuedResource createQueuedResource(
      String projectId, String zone, String queuedResourceId,
      String nodeName, String tpuType, String tpuSoftwareVersion)
      throws IOException, ExecutionException, InterruptedException {
    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create()) {
      String parent = String.format("projects/%s/locations/%s", projectId, zone);
      String resourceName = String.format("projects/%s/locations/%s/queuedResources/%s",
              projectId, zone, queuedResourceId);
      SchedulingConfig schedulingConfig = SchedulingConfig.newBuilder()
          .setPreemptible(true)
          .build();

      Node node =
          Node.newBuilder()
              .setName(nodeName)
              .setAcceleratorType(tpuType)
              .setRuntimeVersion(tpuSoftwareVersion)
              .setSchedulingConfig(schedulingConfig)
              .setQueuedResource(resourceName)
              .build();

      QueuedResource queuedResource =
          QueuedResource.newBuilder()
              .setName(queuedResourceId)
              .setTpu(
                  QueuedResource.Tpu.newBuilder()
                      .addNodeSpec(
                          QueuedResource.Tpu.NodeSpec.newBuilder()
                              .setParent(parent)
                              .setNode(node)
                              .setNodeId(nodeName)
                              .build())
                      .build())
              .build();

      CreateQueuedResourceRequest request =
          CreateQueuedResourceRequest.newBuilder()
              .setParent(parent)
              .setQueuedResourceId(queuedResourceId)
              .setQueuedResource(queuedResource)
              .build();

      return tpuClient.createQueuedResourceAsync(request).get();
    }
  }
}

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-a"
# tpu_name = "tpu-name"
# tpu_type = "v5litepod-4"
# runtime_version = "v2-tpuv5-litepod"
# queued_resource_name = "resource-name"

node = tpu_v2alpha1.Node()
node.accelerator_type = tpu_type
# To see available runtime version use command:
# gcloud compute tpus versions list --zone={ZONE}
node.runtime_version = runtime_version

node_spec = tpu_v2alpha1.QueuedResource.Tpu.NodeSpec()
node_spec.parent = f"projects/{project_id}/locations/{zone}"
node_spec.node_id = tpu_name
node_spec.node = node

resource = tpu_v2alpha1.QueuedResource()
resource.tpu = tpu_v2alpha1.QueuedResource.Tpu(node_spec=[node_spec])
# Create a spot resource
resource.spot = tpu_v2alpha1.QueuedResource.Spot()

request = tpu_v2alpha1.CreateQueuedResourceRequest(
    parent=f"projects/{project_id}/locations/{zone}",
    queued_resource_id=queued_resource_name,
    queued_resource=resource,
)

client = tpu_v2alpha1.TpuClient()
operation = client.create_queued_resource(request=request)
response = operation.result()

print(response.name)
print(response.state)
# Example response:
# projects/[project_id]/locations/[zone]/queuedResources/resource-name
# State.WAITING_FOR_RESOURCES

请求在指定时间之前或之后分配已排队的资源

您可以在已排队的资源请求中指定可选的开始时间或结束时间。开始时间或开始时长用于指定完成请求的最早时间。结束时间或结束时长用于指定请求保持有效的时长。如果请求未在指定的结束时间之前或指定的时长内完成，则请求会失效。请求失效后，会保留在队列中，但不再符合分配条件。

您还可以通过指定开始时间或时长以及结束时间或时长来指定分配时间间隔。

如需查看支持的时间戳和时长格式列表，请参阅日期时间。

请求在指定的时间后执行已排队的资源

在已排队的资源请求中，您可以指定在多久时间或时长之后应分配资源。

gcloud

以下命令请求在 2022 年 12 月 14 日上午 9 点之后分配 v5p-4096 TPU。

gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-east5-a \
    --accelerator-type v5p-4096 \
    --runtime-version v2-alpha-tpuv5 \
    --valid-after-time 2022-12-14T09:00:00Z

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-after-time: 在什么时间之后应分配资源。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期时间。

curl

以下命令请求在 2022 年 12 月 14 日上午 9 点之后分配 v5p-4096 TPU。

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-east5-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5p-4096',
        'runtime_version': 'v2-alpha-tpuv5',
        }
    }
    },
    'queueing_policy': {
    'valid_after_time': {
        'seconds': 2022-12-14T09:00:00Z
    }
    }
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-east5-a/queuedResources?queued_resource_id=your-queued-resource-id

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-after-time: 在多久时间之后应分配资源。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期时间。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击创建 TPU。
在名称字段中，输入 TPU 的名称。
在可用区框中，选择您要在其中创建 TPU 的可用区。
在 TPU 类型框中，选择加速器类型。加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
在 TPU 软件版本框中，选择软件版本。创建 Cloud TPU 虚拟机时，TPU 软件版本用于指定要安装的 TPU 运行时的版本。如需了解详情，请参阅 TPU 软件版本。
点击启用排队切换开关。
在已排队资源的名称字段中，输入已排队的资源请求的名称。
在请求开始时间字段中，输入在多久时间之后应分配资源。
点击创建以创建已排队的资源请求。

以下示例请求在六小时后分配 v5p-32。

gcloud

    gcloud compute tpus queued-resources create your-queued-resource-id \
        --node-id your-node-id \
        --project your-project-id \
        --zone us-east5-a \
        --accelerator-type v5p-32 \
        --runtime-version v2-alpha-tpuv5 \
        --valid-after-duration 6h

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-after-duration: 在多久时长之前不得预配 TPU。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期时间

curl

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-east5-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5p-32',
        'runtime_version': 'v2-alpha-tpuv5',
        }
    }
    },
'queueing_policy': {
    'valid_after_duration': {
        'seconds': 21600
    }
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-east5-a/queuedResources?queued_resource_id=your-queued-resource-id

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-after-duration: 在多久时长之前不得预配 TPU。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期时间

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.cloud.tpu.v2alpha1.CreateQueuedResourceRequest;
import com.google.cloud.tpu.v2alpha1.Node;
import com.google.cloud.tpu.v2alpha1.QueuedResource;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import com.google.protobuf.Duration;
import java.io.IOException;
import java.util.concurrent.ExecutionException;

public class CreateTimeBoundQueuedResource {

  public static void main(String[] args)
          throws IOException, ExecutionException, InterruptedException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project you want to create a node.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which to create the TPU.
    // For more information about supported TPU types for specific zones,
    // see https://cloud.google.com/tpu/docs/regions-zones
    String zone = "us-central2-b";
    // The name of your node.
    String nodeId = "YOUR_NODE_ID";
    // The accelerator type that specifies the version and size of the Cloud TPU you want to create.
    // For more information about supported accelerator types for each TPU version,
    // see https://cloud.google.com/tpu/docs/system-architecture-tpu-vm#versions.
    String acceleratorType = "v2-8";
    // Software version that specifies the version of the TPU runtime to install.
    // For more information see https://cloud.google.com/tpu/docs/runtimes
    String runtimeVersion = "v2-tpuv5-litepod";
    // The name of your Queued Resource.
    String queuedResourceId = "YOUR_QUEUED_RESOURCE_ID";

    createTimeBoundQueuedResource(projectId, nodeId,
        queuedResourceId, zone, acceleratorType, runtimeVersion);
  }

  // Creates a Queued Resource with time bound configuration.
  public static QueuedResource createTimeBoundQueuedResource(
      String projectId, String nodeId, String queuedResourceId,
      String zone, String acceleratorType, String runtimeVersion)
          throws IOException, ExecutionException, InterruptedException {
    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create()) {
      String parent = String.format("projects/%s/locations/%s", projectId, zone);
      // Create a Duration object representing 6 hours.
      Duration validAfterDuration = Duration.newBuilder().setSeconds(6 * 3600).build();
      // You could also use timestamps like this:
      // Timestamp validAfterTime = Timestamps.parse("2024-10-14T09:00:00Z");

      Node node =
          Node.newBuilder()
              .setName(nodeId)
              .setAcceleratorType(acceleratorType)
              .setRuntimeVersion(runtimeVersion)
              .setQueuedResource(
                  String.format(
                      "projects/%s/locations/%s/queuedResources/%s",
                      projectId, zone, queuedResourceId))
              .build();

      QueuedResource queuedResource =
          QueuedResource.newBuilder()
              .setName(queuedResourceId)
              .setTpu(
                  QueuedResource.Tpu.newBuilder()
                      .addNodeSpec(
                          QueuedResource.Tpu.NodeSpec.newBuilder()
                              .setParent(parent)
                              .setNode(node)
                              .setNodeId(nodeId)
                              .build())
                      .build())
              .setQueueingPolicy(
                  QueuedResource.QueueingPolicy.newBuilder()
                      .setValidAfterDuration(validAfterDuration)
                      // .setValidAfterTime(validAfterTime)
                      .build())
              .build();

      CreateQueuedResourceRequest request =
          CreateQueuedResourceRequest.newBuilder()
              .setParent(parent)
              .setQueuedResource(queuedResource)
              .setQueuedResourceId(queuedResourceId)
              .build();

      return tpuClient.createQueuedResourceAsync(request).get();
    }
  }
}

请求在指定的时间后过期的已排队资源

在已排队的资源请求中，您可以指定已排队的资源请求保持有效的时长。如果请求未在您指定的时间或时长内完成，则请求会失效。

gcloud

以下命令请求 v5p-4096 TPU。如果请求未在 2022 年 12 月 14 日上午 9:00 之前完成，则请求会失效。

gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-east5-a \
    --accelerator-type v5p-4096 \
    --runtime-version v2-alpha-tpuv5 \
    --valid-until-time 2022-12-14T09:00:00Z

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-until-time: ：在多久时间之后请求会被取消。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式。

curl

以下命令请求 v5p-4096 TPU。如果请求未在 2022 年 12 月 14 日上午 9:00 之前完成，则请求会失效。

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-east5-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5p-4096',
        'runtime_version': 'v2-alpha-tpuv5',
        }
    }
    },
    'queueing_policy': {
    'valid_until_time': {
        'seconds': 1655197200
    }
    }
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-east5-a/queuedResources?queued_resource_id=your-queued-resource-id

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-until-time: ：在多久时间之后请求会被取消。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击创建 TPU。
在名称字段中，输入 TPU 的名称。
在可用区框中，选择您要在其中创建 TPU 的可用区。
在 TPU 类型框中，选择加速器类型。加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
在 TPU 软件版本框中，选择软件版本。创建 Cloud TPU 虚拟机时，TPU 软件版本用于指定要安装的 TPU 运行时的版本。如需了解详情，请参阅 TPU 软件版本。
点击启用排队切换开关。
在已排队资源的名称字段中，输入已排队的资源请求的名称。
在请求取消时间字段中，输入已排队的资源请求在未完成时应失效的时间。
点击创建以创建已排队的资源请求。

以下示例请求 v5p-32。如果在六小时内未完成请求，则请求会失效。

gcloud

    gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-east5-a \
    --accelerator-type v5p-32 \
    --runtime-version v2-alpha-tpuv5 \
    --valid-until-duration 6h

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-until-duration: 请求的有效时长。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式

curl

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-east5-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5p-32',
        'runtime_version': 'v2-alpha-tpuv5',
        }
    }
    },
'queueing_policy': {
    'valid_until_duration': {
        'seconds': 21600
    }
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-east5-a/queuedResources?queued_resource_id=your-queued-resource-id

命令参数说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-until-duration: 请求的有效时长。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-a"
# tpu_name = "tpu-name"
# tpu_type = "v5litepod-4"
# runtime_version = "v2-tpuv5-litepod"
# queued_resource_name = "resource-name"

node = tpu_v2alpha1.Node()
node.accelerator_type = tpu_type
# To see available runtime version use command:
# gcloud compute tpus versions list --zone={ZONE}
node.runtime_version = runtime_version

node_spec = tpu_v2alpha1.QueuedResource.Tpu.NodeSpec()
node_spec.parent = f"projects/{project_id}/locations/{zone}"
node_spec.node_id = tpu_name
node_spec.node = node

resource = tpu_v2alpha1.QueuedResource()
resource.tpu = tpu_v2alpha1.QueuedResource.Tpu(node_spec=[node_spec])

# Use one of the following queueing policies
resource.queueing_policy = tpu_v2alpha1.QueuedResource.QueueingPolicy(
    # valid_after_duration = "6000s", # Duration after which a resource should be allocated
    valid_until_duration="90s",  # Specify how long a queued resource request remains valid
    # valid_after_time="2024-10-31T09:00:00Z", # Specify a time after which a resource should be allocated
    # valid_until_time="2024-10-29T16:00:00Z",  # Specify a time before which the resource should be allocated
)

request = tpu_v2alpha1.CreateQueuedResourceRequest(
    parent=f"projects/{project_id}/locations/{zone}",
    queued_resource_id=queued_resource_name,
    queued_resource=resource,
)

client = tpu_v2alpha1.TpuClient()
operation = client.create_queued_resource(request=request)

response = operation.result()
print(resource.queueing_policy)
print(response.queueing_policy.valid_until_time)
# Example response:
# valid_until_duration {
#   seconds: 90
# }
# 2024-10-29 14:22:53.562090+00:00

请求在指定的时间间隔内分配已排队的资源

您可以通过同时指定开始时间或时长和结束时间或时长来指定分配时间间隔。

gcloud

以下命令请求从当前时间开始的 5 小时 30 分钟内创建 v5p-32，创建时间不得晚于 2022 年 12 月 14 日上午 9:00。

gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-east5-a \
    --accelerator-type v5p-32 \
    --runtime-version v2-alpha-tpuv5 \
    --valid-after-duration 5h30m \
    --valid-until-time 2022-12-14T09:00:00Z

命令标志说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-until-time: ：在多久时间之后请求会被取消。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式。
valid-after-duration: 在多久时长之前不得预配 TPU。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式。

curl

以下命令请求从当前时间开始的 5 小时 30 分钟内创建 v5p-32，创建时间不得晚于 2022 年 12 月 14 日上午 9:00。

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-east5-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5p-32',
        'runtime_version': 'v2-alpha-tpuv5',
        }
    }
    },
'queueing_policy': {
    'validInterval': {
        'startTime': '2022-12-10T14:30:00Z',
        'endTime': '2022-12-14T09:00:00Z'
    }
    },
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-east5-a/queuedResources?queued_resource_id=your-queued-resource-id

命令标志说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
valid-until-timw: ：在多久时间之后请求会被取消。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式。
valid-until-duration: 请求的有效时长。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击创建 TPU。
在名称字段中，输入 TPU 的名称。
在可用区框中，选择您要在其中创建 TPU 的可用区。
在 TPU 类型框中，选择加速器类型。加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
在 TPU 软件版本框中，选择软件版本。创建 Cloud TPU 虚拟机时，TPU 软件版本用于指定要安装的 TPU 运行时的版本。如需了解详情，请参阅 TPU 软件版本。
点击启用排队切换开关。
在已排队资源的名称字段中，输入已排队的资源请求的名称。
在请求开始时间字段中，输入在多久时间之后应分配资源。
在请求取消时间字段中，输入已排队的资源请求在未完成时应失效的时间。
点击创建以创建已排队的资源请求。

使用启动脚本请求已排队的资源

您可以指定脚本在预配后的已排队资源上运行。

gcloud

使用 gcloud 命令时，您可以分别使用 --metadata 或 --metadata-from-file 标志来指定脚本命令或包含脚本代码的文件。以下示例会创建一个运行 startup-script.sh 脚本的已排队资源请求。

gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-central1-a \
    --accelerator-type v5litepod-8 \
    --runtime-version v2-alpha-tpuv5-lite \
    --metadata-from-file='startup-script=startup-script.sh'

命令标志说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
validInterval: 请求有效的时间段，在此时间之后，请求会被取消。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式。
metadata-from-file: ：指定包含元数据的文件。如果您未指定元数据文件的完全限定路径，该命令会假定该文件位于当前目录中。在此示例中，该文件包含一个启动脚本，该脚本会在预配已排队的资源后运行。
metadata: 指定请求的元数据。在此示例中，元数据是预配已排队的资源后运行的启动脚本命令。

curl

使用 curl 时，您必须在 JSON 内容中添加脚本代码。以下示例会在 JSON 正文中添加内嵌脚本。

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    tpu: {
        node_spec: {
        parent: 'projects/your-project-number/locations/us-central1-a',
        node_id: 'your-node-id',
        node: {
            accelerator_type: 'v5e-8',
            runtime_version: 'v2-alpha-tpuv5-lite',
            metadata: {
                "startup-script": "#! /bin/bash\npwd > /tmp/out.txt\nwhoami >> /tmp/out.txt"
            }
        }
        }
    },
'queueing_policy': {
    'validInterval': {
        'startTime': '2022-12-10T14:30:00Z',
        'endTime': '2022-12-14T09:00:00Z'
    }
    },
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-central1-a/queuedResources?queued_resource_id=your-queued-resource-id

命令标志说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
node-id: ：响应请求而创建的 TPU 的用户定义 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
validInterval: 请求有效的时间段，在此时间之后，请求会被取消。如需详细了解时长格式，请参阅 Google Cloud CLI 主题日期格式。
metadata-from-file: ：指定包含元数据的文件。如果您未指定元数据文件的完全限定路径，该命令会假定该文件位于当前目录中。在此示例中，该文件包含一个启动脚本，该脚本会在预配已排队的资源后运行。
metadata: 指定请求的元数据。在此示例中，元数据是预配已排队的资源后运行的启动脚本命令。

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.cloud.tpu.v2alpha1.CreateQueuedResourceRequest;
import com.google.cloud.tpu.v2alpha1.Node;
import com.google.cloud.tpu.v2alpha1.QueuedResource;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import java.io.IOException;
import java.util.HashMap;
import java.util.Map;
import java.util.concurrent.ExecutionException;

public class CreateQueuedResourceWithStartupScript {
  public static void main(String[] args)
      throws IOException, ExecutionException, InterruptedException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project you want to create a node.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which to create the TPU.
    // For more information about supported TPU types for specific zones,
    // see https://cloud.google.com/tpu/docs/regions-zones
    String zone = "us-central1-a";
    // The name for your TPU.
    String nodeName = "YOUR_TPU_NAME";
    // The accelerator type that specifies the version and size of the Cloud TPU you want to create.
    // For more information about supported accelerator types for each TPU version,
    // see https://cloud.google.com/tpu/docs/system-architecture-tpu-vm#versions.
    String tpuType = "v5litepod-4";
    // Software version that specifies the version of the TPU runtime to install.
    // For more information see https://cloud.google.com/tpu/docs/runtimes
    String tpuSoftwareVersion = "v2-tpuv5-litepod";
    // The name for your Queued Resource.
    String queuedResourceId = "QUEUED_RESOURCE_ID";

    createQueuedResource(projectId, zone, queuedResourceId, nodeName,
        tpuType, tpuSoftwareVersion);
  }

  // Creates a Queued Resource with startup script.
  public static QueuedResource createQueuedResource(
      String projectId, String zone, String queuedResourceId,
      String nodeName, String tpuType, String tpuSoftwareVersion)
      throws IOException, ExecutionException, InterruptedException {
    String parent = String.format("projects/%s/locations/%s", projectId, zone);
    String startupScriptContent = "#!/bin/bash\necho \"Hello from the startup script!\"";
    // Add startup script to metadata
    Map<String, String> metadata = new HashMap<>();
    metadata.put("startup-script", startupScriptContent);
    String queuedResourceForTpu =  String.format("projects/%s/locations/%s/queuedResources/%s",
            projectId, zone, queuedResourceId);
    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create()) {
      Node node =
          Node.newBuilder()
              .setName(nodeName)
              .setAcceleratorType(tpuType)
              .setRuntimeVersion(tpuSoftwareVersion)
              .setQueuedResource(queuedResourceForTpu)
              .putAllMetadata(metadata)
              .build();

      QueuedResource queuedResource =
          QueuedResource.newBuilder()
              .setName(queuedResourceId)
              .setTpu(
                  QueuedResource.Tpu.newBuilder()
                      .addNodeSpec(
                          QueuedResource.Tpu.NodeSpec.newBuilder()
                              .setParent(parent)
                              .setNode(node)
                              .setNodeId(nodeName)
                              .build())
                      .build())
              .build();

      CreateQueuedResourceRequest request =
          CreateQueuedResourceRequest.newBuilder()
              .setParent(parent)
              .setQueuedResourceId(queuedResourceId)
              .setQueuedResource(queuedResource)
              .build();
      // You can wait until TPU Node is READY,
      // and check its status using getTpuVm() from "tpu_vm_get" sample.

      return tpuClient.createQueuedResourceAsync(request).get();
    }
  }
}

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-a"
# tpu_name = "tpu-name"
# tpu_type = "v5litepod-4"
# runtime_version = "v2-tpuv5-litepod"
# queued_resource_name = "resource-name"

node = tpu_v2alpha1.Node()
node.accelerator_type = tpu_type
# To see available runtime version use command:
# gcloud compute tpus versions list --zone={ZONE}
node.runtime_version = runtime_version
# This startup script updates numpy to the latest version and logs the output to a file.
script = {
    "startup-script": """#!/bin/bash
echo "Hello World" > /var/log/hello.log
sudo pip3 install --upgrade numpy >> /var/log/hello.log 2>&1
"""
}
node.metadata = script
# Enabling external IPs for internet access from the TPU node for updating numpy
node.network_config = tpu_v2alpha1.NetworkConfig(
    enable_external_ips=True,
)

node_spec = tpu_v2alpha1.QueuedResource.Tpu.NodeSpec()
node_spec.parent = f"projects/{project_id}/locations/{zone}"
node_spec.node_id = tpu_name
node_spec.node = node

resource = tpu_v2alpha1.QueuedResource()
resource.tpu = tpu_v2alpha1.QueuedResource.Tpu(node_spec=[node_spec])

request = tpu_v2alpha1.CreateQueuedResourceRequest(
    parent=f"projects/{project_id}/locations/{zone}",
    queued_resource_id=queued_resource_name,
    queued_resource=resource,
)

client = tpu_v2alpha1.TpuClient()
operation = client.create_queued_resource(request=request)

response = operation.result()
print(response.name)
print(response.tpu.node_spec[0].node.metadata)
# Example response:
# projects/[project_id]/locations/[zone]/queuedResources/resource-name
# {'startup-script': '#!/bin/bash\n    echo "Hello World" > /var/log/hello.log\n
# sudo pip3 install --upgrade numpy >> /var/log/hello.log 2>&1\n    '}

使用指定的网络和子网请求已排队的资源

在已排队的资源请求中，您可以指定要将 TPU 连接到的网络和子网。

gcloud

gcloud compute tpus queued-resources create your-queued-resource-id \
    --node-id your-node-id \
    --project your-project-id \
    --zone us-central1-a \
    --accelerator-type v5e-8 \
    --runtime-version v2-alpha-tpuv5-lite \
    --network network-name \
    --subnetwork subnetwork-name

命令参数说明

queued-resource-id: 已排队的资源请求的用户分配 ID。
node-id: 用户分配的 TPU ID，该 ID 是在分配已排队的资源请求时创建的。
project: 您的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
reserved: 在请求已排队的资源作为 Cloud TPU 预留的一部分时使用此标志。
network: ：已排队的资源所属的网络。
subnetwork: ：已排队的资源所属的子网。

curl

curl -X POST -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
-d "{
    'tpu': {
    'node_spec': {
        'parent': 'projects/your-project-number/locations/us-central1-a',
        'node_id': 'your-node-id',
        'node': {
        'accelerator_type': 'v5e-8',
        'runtime_version': 'v2-alpha-tpuv5-lite',
        'network_config': {
            'network': 'network-name',
            'subnetwork': 'subnetwork-name',
            'enable_external_ips': true
        }
    }
    },
    'guaranteed': {
    'reserved': true,
    }
}" \
https://tpu.googleapis.com/v2alpha1/projects/your-project-id/locations/us-central1-a/queuedResources?queued_resource_id=your-queued-resource-id

命令参数说明

queued-resource-id: 已排队的资源请求的用户分配 ID。
node-id: 用户分配的 TPU ID，该 ID 是在分配已排队的资源请求时创建的。
project: 您的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。
accelerator-type: 加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
runtime-version: Cloud TPU 软件版本。
reserved: 在请求已排队的资源作为 Cloud TPU 预留的一部分时使用此标志。
network: ：已排队的资源所属的网络。
subnetwork: ：已排队的资源所属的子网。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击创建 TPU。
在名称字段中，输入 TPU 的名称。
在可用区框中，选择您要在其中创建 TPU 的可用区。
在 TPU 类型框中，选择加速器类型。加速器类型用于指定您要创建的 Cloud TPU 的版本和大小。如需详细了解每个 TPU 版本支持的加速器类型，请参阅 TPU 版本。
在 TPU 软件版本框中，选择软件版本。创建 Cloud TPU 虚拟机时，TPU 软件版本用于指定要安装的 TPU 运行时的版本。如需了解详情，请参阅 TPU 软件版本。
点击启用排队切换开关。
在已排队资源的名称字段中，输入已排队的资源请求的名称。
展开网络部分。
在网络和子网字段中，选择要使用的网络和子网。
点击创建以创建已排队的资源请求。

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.api.gax.retrying.RetrySettings;
import com.google.cloud.tpu.v2alpha1.CreateQueuedResourceRequest;
import com.google.cloud.tpu.v2alpha1.NetworkConfig;
import com.google.cloud.tpu.v2alpha1.Node;
import com.google.cloud.tpu.v2alpha1.QueuedResource;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import com.google.cloud.tpu.v2alpha1.TpuSettings;
import java.io.IOException;
import java.util.concurrent.ExecutionException;
import org.threeten.bp.Duration;

public class CreateQueuedResourceWithNetwork {
  public static void main(String[] args)
      throws IOException, ExecutionException, InterruptedException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project you want to create a node.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which to create the TPU.
    // For more information about supported TPU types for specific zones,
    // see https://cloud.google.com/tpu/docs/regions-zones
    String zone = "europe-west4-a";
    // The name for your TPU.
    String nodeName = "YOUR_TPU_NAME";
    // The accelerator type that specifies the version and size of the Cloud TPU you want to create.
    // For more information about supported accelerator types for each TPU version,
    // see https://cloud.google.com/tpu/docs/system-architecture-tpu-vm#versions.
    String tpuType = "v5litepod-4";
    // Software version that specifies the version of the TPU runtime to install.
    // For more information see https://cloud.google.com/tpu/docs/runtimes
    String tpuSoftwareVersion = "v2-tpuv5-litepod";
    // The name for your Queued Resource.
    String queuedResourceId = "QUEUED_RESOURCE_ID";
    // The name of the network you want the node to connect to.
    // The network should be assigned to your project.
    String networkName = "YOUR_COMPUTE_TPU_NETWORK";

    createQueuedResourceWithNetwork(projectId, zone, queuedResourceId, nodeName,
        tpuType, tpuSoftwareVersion, networkName);
  }

  // Creates a Queued Resource with network configuration.
  public static QueuedResource createQueuedResourceWithNetwork(
      String projectId, String zone, String queuedResourceId, String nodeName,
      String tpuType, String tpuSoftwareVersion, String networkName)
      throws IOException, ExecutionException, InterruptedException {
    // With these settings the client library handles the Operation's polling mechanism
    // and prevent CancellationException error
    TpuSettings.Builder clientSettings =
        TpuSettings.newBuilder();
    clientSettings
        .createQueuedResourceSettings()
        .setRetrySettings(
            RetrySettings.newBuilder()
                .setInitialRetryDelay(Duration.ofMillis(5000L))
                .setRetryDelayMultiplier(2.0)
                .setInitialRpcTimeout(Duration.ZERO)
                .setRpcTimeoutMultiplier(1.0)
                .setMaxRetryDelay(Duration.ofMillis(45000L))
                .setTotalTimeout(Duration.ofHours(24L))
                .build());
    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create(clientSettings.build())) {
      String parent = String.format("projects/%s/locations/%s", projectId, zone);
      String region = zone.substring(0, zone.length() - 2);

      // Specify the network and subnetwork that you want to connect your TPU to.
      NetworkConfig networkConfig =
          NetworkConfig.newBuilder()
              .setEnableExternalIps(true)
              .setNetwork(String.format("projects/%s/global/networks/%s", projectId, networkName))
              .setSubnetwork(
                  String.format(
                      "projects/%s/regions/%s/subnetworks/%s", projectId, region, networkName))
              .build();

      // Create a node
      Node node =
          Node.newBuilder()
              .setName(nodeName)
              .setAcceleratorType(tpuType)
              .setRuntimeVersion(tpuSoftwareVersion)
              .setNetworkConfig(networkConfig)
              .setQueuedResource(
                  String.format(
                      "projects/%s/locations/%s/queuedResources/%s",
                      projectId, zone, queuedResourceId))
              .build();

      // Create queued resource
      QueuedResource queuedResource =
          QueuedResource.newBuilder()
              .setName(queuedResourceId)
              .setTpu(
                  QueuedResource.Tpu.newBuilder()
                      .addNodeSpec(
                          QueuedResource.Tpu.NodeSpec.newBuilder()
                              .setParent(parent)
                              .setNode(node)
                              .setNodeId(nodeName)
                              .build())
                      .build())
              .build();

      CreateQueuedResourceRequest request =
          CreateQueuedResourceRequest.newBuilder()
              .setParent(parent)
              .setQueuedResource(queuedResource)
              .setQueuedResourceId(queuedResourceId)
              .build();

      // You can wait until TPU Node is READY,
      // and check its status using getTpuVm() from "tpu_vm_get" sample.

      return tpuClient.createQueuedResourceAsync(request).get();
    }
  }
}

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-a"
# tpu_name = "tpu-name"
# tpu_type = "v5litepod-4"
# runtime_version = "v2-tpuv5-litepod"
# queued_resource_name = "resource-name"
# network = "default"

node = tpu_v2alpha1.Node()
node.accelerator_type = tpu_type
node.runtime_version = runtime_version
# Setting network configuration
node.network_config = tpu_v2alpha1.NetworkConfig(
    network=network,  # Update if you want to use a specific network
    subnetwork="default",  # Update if you want to use a specific subnetwork
    enable_external_ips=True,
    can_ip_forward=True,
)

node_spec = tpu_v2alpha1.QueuedResource.Tpu.NodeSpec()
node_spec.parent = f"projects/{project_id}/locations/{zone}"
node_spec.node_id = tpu_name
node_spec.node = node

resource = tpu_v2alpha1.QueuedResource()
resource.tpu = tpu_v2alpha1.QueuedResource.Tpu(node_spec=[node_spec])

request = tpu_v2alpha1.CreateQueuedResourceRequest(
    parent=f"projects/{project_id}/locations/{zone}",
    queued_resource_id=queued_resource_name,
    queued_resource=resource,
)

client = tpu_v2alpha1.TpuClient()
operation = client.create_queued_resource(request=request)

response = operation.result()
print(response.name)
print(response.tpu.node_spec[0].node.network_config)
print(resource.tpu.node_spec[0].node.network_config.network == "default")
# Example response:
# network: "default"
# subnetwork: "default"
# enable_external_ips: true
# can_ip_forward: true

删除已排队的资源请求

您可以删除已排队的资源请求，并通过删除已排队的资源请求来删除与该请求关联的 TPU：

gcloud

将 --force 标志传递给 queued-resource delete 命令：

gcloud compute tpus queued-resources delete your-queued-resource-id \
    --project your-project-id \
    --zone us-central1-a \
    --force \
    --async

命令标志说明

your-queued-resource-id: 已排队的资源请求的用户分配 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 要删除的 Cloud TPU 的可用区。
force: 同时删除 TPU 虚拟机和已排队的资源请求。

curl

在 curl 请求中使用查询参数 force=true：

curl -X DELETE -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
https://tpu.googleapis.com/v2/projects/your-project-id/locations/us-central1-a/queuedResources/your-queued-resource-id?force=true

命令标志说明

your-queued-resource-id: 已排队的资源请求的用户分配 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 要删除的 Cloud TPU 的可用区。
force: 同时删除 TPU 虚拟机和已排队的资源请求。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击已排队的资源标签页。
选中已排队的资源请求旁边的复选框。
点击删除。

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.api.gax.retrying.RetrySettings;
import com.google.cloud.tpu.v2alpha1.DeleteQueuedResourceRequest;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import com.google.cloud.tpu.v2alpha1.TpuSettings;
import java.io.IOException;
import java.util.concurrent.ExecutionException;
import org.threeten.bp.Duration;

public class DeleteForceQueuedResource {
  public static void main(String[] args)
      throws IOException, ExecutionException, InterruptedException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which the TPU was created.
    String zone = "us-central1-f";
    // The name for your Queued Resource.
    String queuedResourceId = "QUEUED_RESOURCE_ID";

    deleteForceQueuedResource(projectId, zone, queuedResourceId);
  }

  // Deletes a Queued Resource asynchronously with --force flag.
  public static void deleteForceQueuedResource(
      String projectId, String zone, String queuedResourceId)
          throws ExecutionException, InterruptedException, IOException {
    String name = String.format("projects/%s/locations/%s/queuedResources/%s",
        projectId, zone, queuedResourceId);
    // With these settings the client library handles the Operation's polling mechanism
    // and prevent CancellationException error
    TpuSettings.Builder clientSettings =
        TpuSettings.newBuilder();
    clientSettings
        .deleteQueuedResourceSettings()
        .setRetrySettings(
            RetrySettings.newBuilder()
                .setInitialRetryDelay(Duration.ofMillis(5000L))
                .setRetryDelayMultiplier(2.0)
                .setInitialRpcTimeout(Duration.ZERO)
                .setRpcTimeoutMultiplier(1.0)
                .setMaxRetryDelay(Duration.ofMillis(45000L))
                .setTotalTimeout(Duration.ofHours(24L))
                .build());

    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create(clientSettings.build())) {
      DeleteQueuedResourceRequest request =
          DeleteQueuedResourceRequest.newBuilder().setName(name).setForce(true).build();
      // Waiting for updates in the library. Until then, the operation will complete successfully,
      // but the user will receive an error message with UnknownException and IllegalStateException.
      tpuClient.deleteQueuedResourceAsync(request).get();

      System.out.printf("Deleted Queued Resource: %s\n", name);
    }
  }
}

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-b"
# queued_resource_name = "resource-name"

client = tpu_v2alpha1.TpuClient()
request = tpu_v2alpha1.DeleteQueuedResourceRequest(
    name=f"projects/{project_id}/locations/{zone}/queuedResources/{queued_resource_name}",
    force=True,  # Set force=True to delete the resource with tpu nodes.
)

try:
    op = client.delete_queued_resource(request=request)
    op.result()
    print(f"Queued resource '{queued_resource_name}' successfully deleted.")
except TypeError as e:
    print(f"Error deleting resource: {e}")
    print(f"Queued resource '{queued_resource_name}' successfully deleted.")

如果您直接删除 TPU，则还需要删除已排队的资源，如以下示例所示。删除 TPU 后，已排队的资源请求会转换为 SUSPENDED 状态，之后便可删除已排队的资源请求。

gcloud

删除 TPU：

$ gcloud compute tpus tpu-vm delete your-node-id \
    --project=your-project-id \
    --zone=us-central1-a \
    --quiet

命令标志说明

project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 要删除的 Cloud TPU 的可用区。
your-node-id: 要删除的 TPU 的名称。

删除 TPU 后，关联的已排队资源会先进入 SUSPENDING 状态，然后进入 SUSPENDED 状态。当已排队的资源处于 SUSPENDED 状态时，您可以将其删除：

gcloud compute tpus queued-resources delete your-queued-resource-id \
    --project your-project-id \
    --zone us-central1-a

命令标志说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 要删除的 Cloud TPU 的可用区。

curl

删除 TPU：

curl -X DELETE -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
https://tpu.googleapis.com/v2/projects/your-project/locations/us-central1-a/nodes?node_id=your-node-id

命令标志说明

project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 要删除的 Cloud TPU 的可用区。
your-node-id: 要删除的 TPU 的名称。

删除 TPU 后，关联的已排队资源会先进入 SUSPENDING 状态，然后进入 SUSPENDED 状态。当已排队的资源处于 SUSPENDED 状态时，您可以将其删除：

curl -X DELETE -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
https://tpu.googleapis.com/v2/projects/your-project-id/locations/us-central1-a/queuedResources/your-queued-resource-id

命令标志说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 要删除的 Cloud TPU 的可用区。

控制台

删除 TPU：

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
选中 TPU 旁边的复选框。
点击删除。

删除 TPU 后，关联的已排队资源会先进入正在暂停状态，然后进入已暂停状态。当已排队的资源处于已暂停状态时，您可以将其删除：

点击已排队的资源标签页。
选中已排队的资源请求旁边的复选框。
点击删除。

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.api.gax.longrunning.OperationTimedPollAlgorithm;
import com.google.api.gax.retrying.RetrySettings;
import com.google.cloud.tpu.v2.DeleteNodeRequest;
import com.google.cloud.tpu.v2.NodeName;
import com.google.cloud.tpu.v2.TpuClient;
import com.google.cloud.tpu.v2.TpuSettings;
import java.io.IOException;
import java.util.concurrent.ExecutionException;
import org.threeten.bp.Duration;

public class DeleteTpuVm {

  public static void main(String[] args)
      throws IOException, ExecutionException, InterruptedException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project you want to create a node.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which to create the TPU.
    // For more information about supported TPU types for specific zones,
    // see https://cloud.google.com/tpu/docs/regions-zones
    String zone = "europe-west4-a";
    // The name for your TPU.
    String nodeName = "YOUR_TPU_NAME";

    deleteTpuVm(projectId, zone, nodeName);
  }

  // Deletes a TPU VM with the specified name in the given project and zone.
  public static void deleteTpuVm(String projectId, String zone, String nodeName)
      throws IOException, ExecutionException, InterruptedException {
    // With these settings the client library handles the Operation's polling mechanism
    // and prevent CancellationException error
    TpuSettings.Builder clientSettings =
        TpuSettings.newBuilder();
    clientSettings
        .deleteNodeOperationSettings()
        .setPollingAlgorithm(
            OperationTimedPollAlgorithm.create(
                RetrySettings.newBuilder()
                    .setInitialRetryDelay(Duration.ofMillis(5000L))
                    .setRetryDelayMultiplier(1.5)
                    .setMaxRetryDelay(Duration.ofMillis(45000L))
                    .setInitialRpcTimeout(Duration.ZERO)
                    .setRpcTimeoutMultiplier(1.0)
                    .setMaxRpcTimeout(Duration.ZERO)
                    .setTotalTimeout(Duration.ofHours(24L))
                    .build()));

    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create(clientSettings.build())) {
      String name = NodeName.of(projectId, zone, nodeName).toString();

      DeleteNodeRequest request = DeleteNodeRequest.newBuilder().setName(name).build();

      tpuClient.deleteNodeAsync(request).get();
      System.out.println("TPU VM deleted");
    }
  }
}

删除 TPU 后，关联的已排队资源会先进入 SUSPENDING 状态，然后进入 SUSPENDED 状态。当已排队的资源处于 SUSPENDED 状态时，您可以将其删除：

import com.google.cloud.tpu.v2alpha1.DeleteQueuedResourceRequest;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import java.io.IOException;
import java.util.concurrent.ExecutionException;

public class DeleteQueuedResource {
  public static void main(String[] args)
      throws IOException, ExecutionException, InterruptedException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which the TPU was created.
    String zone = "us-central1-f";
    // The name for your Queued Resource.
    String queuedResourceId = "QUEUED_RESOURCE_ID";

    deleteQueuedResource(projectId, zone, queuedResourceId);
  }

  // Deletes a Queued Resource asynchronously.
  public static void deleteQueuedResource(String projectId, String zone, String queuedResourceId)
      throws ExecutionException, InterruptedException, IOException {
    String name = String.format("projects/%s/locations/%s/queuedResources/%s",
        projectId, zone, queuedResourceId);
    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create()) {
      // Before deleting the queued resource it is required to delete the TPU VM.
      // For more information about deleting TPU
      // see https://cloud.google.com/tpu/docs/managing-tpus-tpu-vm

      DeleteQueuedResourceRequest request =
              DeleteQueuedResourceRequest.newBuilder().setName(name).build();

      tpuClient.deleteQueuedResourceAsync(request).get();
    }
  }
}

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-b"
# tpu_name = "tpu-name"

client = tpu_v2.TpuClient()
try:
    client.delete_node(
        name=f"projects/{project_id}/locations/{zone}/nodes/{tpu_name}"
    )
    print("The TPU node was deleted.")
except Exception as e:
    print(e)

删除 TPU 后，关联的已排队资源会先进入 SUSPENDING 状态，然后进入 SUSPENDED 状态。当已排队的资源处于 SUSPENDED 状态时，您可以将其删除：

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-b"
# queued_resource_name = "resource-name"

client = tpu_v2alpha1.TpuClient()
name = (
    f"projects/{project_id}/locations/{zone}/queuedResources/{queued_resource_name}"
)

try:
    op = client.delete_queued_resource(name=name)
    op.result()
    print(f"Queued resource '{queued_resource_name}' successfully deleted.")
except TypeError as e:
    print(f"Error deleting resource: {e}")
    print(f"Queued resource '{queued_resource_name}' successfully deleted.")

检索有关已排队的资源请求的状态和诊断信息

检索有关已排队的资源请求的状态和诊断信息：

gcloud

gcloud compute tpus queued-resources describe queued-resource-request-id \
    --project your-project-id \
    --zone us-central1-a

命令标志说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。

curl

curl -X GET -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
https://tpu.googleapis.com/v2/projects/your-project-id/locations/us-central1-a/queuedResources/your-queued-resource-id

命令标志说明

queued-resource-request-id: 已排队的资源请求的用户分配 ID。
project: ：已排队的资源分配到的项目的 ID。
zone: 拟在其中创建 Cloud TPU 的可用区。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击已排队的资源标签页。
点击已排队的资源请求的名称。

预配 TPU 后，您还可以前往 TPU 页面，找到您的 TPU，然后点击相应的已排队资源请求的名称，以查看有关已排队的资源请求的详细信息。

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.cloud.tpu.v2alpha1.GetQueuedResourceRequest;
import com.google.cloud.tpu.v2alpha1.QueuedResource;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import java.io.IOException;

public class GetQueuedResource {
  public static void main(String[] args) throws IOException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which the TPU was created.
    String zone = "us-central1-f";
    // The name for your Queued Resource.
    String queuedResourceId = "QUEUED_RESOURCE_ID";

    getQueuedResource(projectId, zone, queuedResourceId);
  }

  // Get a Queued Resource.
  public static QueuedResource getQueuedResource(
      String projectId, String zone, String queuedResourceId) throws IOException {
    String name = String.format("projects/%s/locations/%s/queuedResources/%s",
        projectId, zone, queuedResourceId);
    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create()) {
      GetQueuedResourceRequest request =
          GetQueuedResourceRequest.newBuilder().setName(name).build();

      return tpuClient.getQueuedResource(request);
    }
  }
}

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-b"
# queued_resource_name = "resource-name"

client = tpu_v2alpha1.TpuClient()
name = (
    f"projects/{project_id}/locations/{zone}/queuedResources/{queued_resource_name}"
)
resource = client.get_queued_resource(name=name)
print("Resource name:", resource.name)
print(resource.state)
# Example response:
# Resource name: projects/{project_id}/locations/{zone}/queuedResources/resource-name
# State.ACTIVE

如果请求失败，输出将包含错误信息。对于正在等待资源的请求，输出类似于以下内容：

gcloud

    name: projects/your-project-id/locations/us-central1-a/queuedResources/your-queued-resource-id
    state:
    state: WAITING_FOR_RESOURCES
    tpu:
    nodeSpec:
    - node:
        acceleratorType: v4-8
        bootDisk: {}
        networkConfig:
            enableExternalIps: true
        queuedResource: projects/your-project-number/locations/us-central1-a/queuedResources/your-queued-resource-id
        runtimeVersion: v2-alpha-tpuv5-lite
        schedulingConfig: {}
        serviceAccount: {}
        shieldedInstanceConfig: {}
        useTpuVm: true
        nodeId: your-node-id
        parent: projects/your-project-number/locations/us-central1-a

控制台

已排队资源的状态字段显示正在等待资源。

列出项目中已排队的资源请求

列出项目中已排队的资源请求：

gcloud

gcloud compute tpus queued-resources list --project your-project-id \
    --zone us-central1-a

命令标志说明

project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。

curl

curl -X GET -H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
https://tpu.googleapis.com/v2/projects/your-project-id/locations/your-zone/queuedResources

命令标志说明

project: ：已排队的资源分配到的 Google Cloud 项目。
zone: 拟在其中创建 Cloud TPU 的可用区。

控制台

在 Google Cloud 控制台中，前往 TPU 页面：

前往 TPU
点击已排队的资源标签页。

Java

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

import com.google.cloud.tpu.v2alpha1.ListQueuedResourcesRequest;
import com.google.cloud.tpu.v2alpha1.QueuedResource;
import com.google.cloud.tpu.v2alpha1.TpuClient;
import com.google.cloud.tpu.v2alpha1.TpuClient.ListQueuedResourcesPage;
import java.io.IOException;

public class ListQueuedResources {
  public static void main(String[] args) throws IOException {
    // TODO(developer): Replace these variables before running the sample.
    // Project ID or project number of the Google Cloud project.
    String projectId = "YOUR_PROJECT_ID";
    // The zone in which the TPU was created.
    String zone = "us-central1-a";

    listQueuedResources(projectId, zone);
  }

  // List Queued Resources.
  public static ListQueuedResourcesPage listQueuedResources(
      String projectId, String zone) throws IOException {
    String parent = String.format("projects/%s/locations/%s", projectId, zone);
    // Initialize client that will be used to send requests. This client only needs to be created
    // once, and can be reused for multiple requests.
    try (TpuClient tpuClient = TpuClient.create()) {
      ListQueuedResourcesRequest request =
          ListQueuedResourcesRequest.newBuilder().setParent(parent).build();
      ListQueuedResourcesPage response =  tpuClient.listQueuedResources(request).getPage();

      for (QueuedResource queuedResource : response.iterateAll()) {
        System.out.println(queuedResource.getName());
      }
      return response;
    }
  }
}

Python

如需向 Cloud TPU 进行身份验证，请设置应用默认凭证。如需了解详情，请参阅为本地开发环境设置身份验证。

from google.cloud import tpu_v2alpha1

# TODO(developer): Update and un-comment below lines
# project_id = "your-project-id"
# zone = "us-central1-b"

client = tpu_v2alpha1.TpuClient()
parent = f"projects/{project_id}/locations/{zone}"
resources = client.list_queued_resources(parent=parent)
for resource in resources:
    print("Resource name:", resource.name)
    print("TPU id:", resource.tpu.node_spec[0].node_id)
# Example response:
# Resource name: projects/{project_id}/locations/{zone}/queuedResources/resource-name
# TPU id: tpu-name