If the number of your requests exceeds the capacity allocated to process
requests, then error code With a Provisioned Throughput subscription, you can reserve an
amount of throughput for specific generative AI models. If you don't have a
Provisioned Throughput subscription and resources aren't available
to your application, then an error code For projects that have purchased Provisioned Throughput,
Vertex AI measures a project's throughput and reserves the purchased
amount of throughput for the project's actual usage. For standard Provisioned Throughput, when you use less than your
purchased amount, errors that might otherwise be On the pay-as-you-go quota framework, you have the following options to
resolving To correct the 429 error generated by Provisioned Throughput, do the
following:429
is returned. The following table displays the
error message generated by each type of quota framework:
Quota framework
Message
Pay-as-you-go
Resource exhausted, please try again later.
Provisioned Throughput
Too many requests. Exceeded the Provisioned Throughput.
429
is returned. Although you don't
have reserved capacity, you can try your request again. However, the request
isn't counted against your error rate as described in your service level
agreement (SLA).429
are returned as 5XX
and
count toward the SLA error rate. For Single Zone Provisioned Throughput,
when you use less than your purchased amount, capacity-related 429
errors are
treated as 5XX
but don't count toward the SLA error rate. When you exceed your
purchased amount, the additional requests are processed on-demand as pay-as-you-go.Pay-as-you-go
429
errors:
Provisioned Throughput
What's next
Error code 429
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-21 UTC.