Google Cloud 使用配额来帮助确保公平性并减少资源使用和可用性的激增。配额用于限制您的 Google Cloud 项目可使用的Google Cloud 资源的数量。配额适用于一系列资源类型,包括硬件、软件和网络组件。例如,配额可以限制对某项服务的 API 调用次数、您的项目并发使用的负载均衡器数量或者您可以创建的项目数量。配额可以防止服务过载,从而保护Google Cloud 用户社区。配额还可以帮助您管理自己的 Google Cloud 资源。
您会收到一封邮件,告知您的配额申请是得到批准还是被拒绝。Google Cloud 服务配额增加会逐步进行。这可能会导致在不同区域或资源中持续进行发布。在发布期间, Google Cloud 控制台或 Cloud Quotas API 中显示的配额值不会体现增加后的新配额值,直到发布完成为止。如需了解详情,请参阅查看正在进行的发布。
[[["易于理解","easyToUnderstand","thumb-up"],["解决了我的问题","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["很难理解","hardToUnderstand","thumb-down"],["信息或示例代码不正确","incorrectInformationOrSampleCode","thumb-down"],["没有我需要的信息/示例","missingTheInformationSamplesINeed","thumb-down"],["翻译问题","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],["最后更新时间 (UTC):2025-08-28。"],[],[],null,["# Cloud TPU quotas\n================\n\nThis document lists the quotas that apply to Cloud TPU. For information\nabout Cloud TPU pricing, see [Cloud TPU pricing](/tpu/pricing).\n\nGoogle Cloud uses quotas to help ensure fairness and reduce\nspikes in resource use and availability. A quota restricts how much of a\nGoogle Cloud resource your Google Cloud project can use. Quotas\napply to a range of resource types, including hardware, software, and network\ncomponents. For example, quotas can restrict the number of API calls to a\nservice, the number of load balancers used concurrently by your project, or the\nnumber of projects that you can create. Quotas protect the community of\nGoogle Cloud users by preventing the overloading of services. Quotas also\nhelp you to manage your own Google Cloud resources.\n\nThe Cloud Quotas system does the following:\n\n- Monitors your consumption of Google Cloud products and services\n- Restricts your consumption of those resources\n- Provides a way to [request changes to the quota value](/docs/quotas/help/request_increase) and [automate quota adjustments](/docs/quotas/quota-adjuster)\n\nIn most cases, when you attempt to consume more of a resource than its quota\nallows, the system blocks access to the resource, and the task that\nyou're trying to perform fails.\n\nQuotas generally apply at the Google Cloud project\nlevel. Your use of a resource in one project doesn't affect\nyour available quota in another project. Within a Google Cloud project, quotas\nare shared across all applications and IP addresses.\n| **Important:** This document describes Cloud TPU API quota. If you are [creating TPU slices in GKE](/kubernetes-engine/docs/how-to/tpus#create-node-pool), you use Compute Engine API quota, which is a different quota. For more information about quotas you use for TPUs in GKE, see [Ensure sufficient quota](/kubernetes-engine/docs/how-to/tpus#ensure-quota)\n\nTPU quota\n---------\n\nTPU quotas are limits on the number of Cloud TPU cores you can use with a\nGoogle Cloud project. Each version of TPU is associated with its own quota. In\naddition, each Cloud TPU version quota is divided into on-demand quota and\n[preemptible](/tpu/docs/preemptible) (or spot) quota.\n| **Note:** For more information about TPU chips and TensorCores, see [TPU System\n| architecture](/tpu/docs/system-architecture-tpu-vm).\n\nWhen you create Cloud TPU resources, by default you are creating on-demand\nresources. You can create preemptible resources using the `--spot` parameter\nwhen creating resources with the `gcloud` command. For more information, see\n[Manage TPU resources](/tpu/docs/managing-tpus-tpu-vm).\n\nDefault Cloud TPU quotas\n------------------------\n\nThe following tables show default values for on-demand and preemptible quota for\neach [available zone](/tpu/docs/regions-zones). These values specify the maximum\nnumber of TPU cores you can use within your project. \n\n### Preemptible quotas\n\n### On-demand quotas\n\n| **Note:** All requests for quota in us-central2-b (currently, only TPU v4) need to be manually approved by Google. No default quota is granted. If you require quota in us-central2-b, request quota for v4 using the same process as described in the following section.\n\nView and request additional quota\n---------------------------------\n\nYou can view the quota allocated for your Google Cloud project on the\n[Quotas page in the Google Cloud console](https://console.cloud.google.com/apis/api/tpu.googleapis.com/quotas).\n\nYou can request additional Cloud TPU quota, from the [Quotas page](https://console.cloud.google.com/iam-admin/quotas).\nFind the quota you wish to increase, click the three vertical dots and choose\n**Edit quota** . For more information, see [Request a quota adjustment](/docs/quotas/view-manage#requesting_higher_quota).\nIf you request quota below the auto-approve threshold, your request will be\nautomatically approved. \n\n### Preemptible quotas\n\n### On-demand quotas\n\nYou will receive an email stating if your quota request is approved or denied.\nGoogle Cloud service quota increases take place gradually. This might result in\nongoing rollouts across different regions or resources. During the rollout, the\nquota value that appears in the Google Cloud console or Cloud Quotas API\nwon't reflect the new, increased quota value until the rollout completes. For\nmore information, see [View ongoing\nrollouts](/docs/quotas/view-ongoing-rollouts)."]]