The number of Knative serving resources is limited by the
configuration of the cluster as well as other dependencies. The following limits
are recommended limits for a properly scaled Kubernetes Engine cluster.
Other resource limitations are imposed by the configuration of the Kubernetes
Engine cluster that the services are running in. For example, you cannot request
more memory than is available in the nodes in the cluster.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-25 UTC."],[[["\u003cp\u003eKnative serving is subject to Google Kubernetes Engine quotas and limits.\u003c/p\u003e\n"],["\u003cp\u003eThe maximum number of services per cluster is limited to 150.\u003c/p\u003e\n"],["\u003cp\u003eThe maximum number of revisions per cluster is limited to 300.\u003c/p\u003e\n"],["\u003cp\u003eThe maximum timeout duration per request is 24 hours for version 0.16.0-gke.1 and later, or 900 seconds for version 0.15.0-gke.3 and earlier.\u003c/p\u003e\n"],["\u003cp\u003eAdditional resource limitations depend on the configuration of the Kubernetes Engine cluster, such as available memory.\u003c/p\u003e\n"]]],[],null,["# Quotas and Limits\n\nThis page contains usage quota and limits that apply when using\nKnative serving.\n\nKnative serving is subject to the\n[Google Kubernetes Engine quotas and limits](/kubernetes-engine/quotas).\n\nThe number of Knative serving resources is limited by the\nconfiguration of the cluster as well as other dependencies. The following limits\nare recommended limits for a properly scaled Kubernetes Engine cluster.\n\nOther resource limitations are imposed by the configuration of the Kubernetes\nEngine cluster that the services are running in. For example, you cannot request\nmore memory than is available in the nodes in the cluster."]]