The number of Knative serving resources is limited by the
configuration of the cluster as well as other dependencies. The following limits
are recommended limits for a properly scaled Kubernetes Engine cluster.
Other resource limitations are imposed by the configuration of the Kubernetes
Engine cluster that the services are running in. For example, you cannot request
more memory than is available in the nodes in the cluster.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-04-17 UTC."],[[["Knative serving is subject to Google Kubernetes Engine quotas and limits."],["The maximum number of services per cluster is limited to 150."],["The maximum number of revisions per cluster is limited to 300."],["The maximum timeout duration per request is 24 hours for version 0.16.0-gke.1 and later, or 900 seconds for version 0.15.0-gke.3 and earlier."],["Additional resource limitations depend on the configuration of the Kubernetes Engine cluster, such as available memory."]]],[]]