prediction.aiplatform.gdc.goog/v1
Package v1 contains API Schema definitions for the prediction.aiplatform.gdc.goog v1 API group.
Autoscaling
Defines the autoscaling parameters for a deployment.
Appears in:
| Field | Description | 
|---|---|
| minReplicainteger | Number of minimum replicas. The default value is 1. The next tag is5. | 
| maxReplicainteger | Number of maximum replicas. | 
| cpuTargetinteger | The threshold of CPU usage for scaling up a pod. | 
| gpuDutyCycleTargetinteger | The threshold of GPU duty-cycle utilization for scaling up a pod. | 
DedicatedResources
Defines the resources that are dedicated to a resource pool. The next ID is 8.
Appears in:
| Field | Description | 
|---|---|
| machineSpec | Not required. Specifies the configuration of a single machine using the machineTypevalue. If not provided, a default value is used. For Prediction, the default machine type for a deployment isn1-standard-2(GKE-based models) orn2-standard-2-gdc(CPU-based models). | 
| autoscalingAutoscaling | Specifies the autoscaling parameters for the user workloads, for example, the predictor deployment for prediction. | 
DeployedModel
Defines the Schema for the DeployedModels API.
Appears in:
| Field | Description | 
|---|---|
| apiVersionstring | prediction.aiplatform.gdc.goog/v1 | 
| kindstring | DeployedModel | 
| metadataObjectMeta | Refer to Kubernetes API documentation for fields of metadata. | 
| specDeployedModelSpec | |
| statusDeployedModelStatus | 
DeployedModelList
Contains a list of DeployedModel resources.
| Field | Description | 
|---|---|
| apiVersionstring | prediction.aiplatform.gdc.goog/v1 | 
| kindstring | DeployedModelList | 
| metadataListMeta | Refer to Kubernetes API documentation for fields of metadata. | 
| itemsDeployedModel array | 
DeployedModelSpec
Defines the expected state of DeployedModel resources.
Appears in:
| Field | Description | 
|---|---|
| endpointPathstring | Specifies the resource name of the endpoint. The format is projects/{project}/locations/{location}/endpoints/{endpoint-id}. The next tag is6. | 
| modelSpecinvalid type | Defines the model specification needed when deploying the model. | 
| resourcePoolRefObjectReference | Specifies the reference of the resource pool with the resource specifications required for this DeployedModel. | 
| sharesResourcePoolboolean | Specifies if the DeployedModelshares a resource pool with other models. | 
DeployedModelStatus
Defines the observed state of the DeployedModel resource.
Appears in:
| Field | Description | 
|---|---|
| readyboolean | Indicates whether the resource is in a ready state. | 
| primaryCondition | Represents the primary condition of a resource. If the resource is ready, then the condition indicates the resource is ready. Otherwise, the condition is the primary reason why the resource is not ready. | 
| resourceConditionsarray | Represents a collection of conditions for a resource and its subresources. You can use it to determine the overall health of a resource and its subresources. | 
| conditionsCondition array | Represents raw resource conditions populated from Kubernetes resources for debugging purposes. | 
| routesRoutes | Represents the container or system routes for the deployed model prediction or health check. | 
| portsPorts | Represents the container HTTP or gRPC ports. | 
| rpcStatusRpcStatus | Indicates a canonical RPC representation of the deployed model's primary condition. | 
| observedGenerationinteger | Indicates the revision of the resource that was most recently reconciled. | 
Ports
Appears in:
| Field | Description | 
|---|---|
| predictorPortsinteger array | Lists the HTTP ports to expose from the predictor. Requests aren't forwarded to ports other than the first one listed. This field corresponds to the portsfield of the Kubernetes Containers v1 core API. | 
| predictorGRPCPortsinteger array | Lists the gRPC ports to expose from the predictor. If this field is omitted, then the gRPC requests to the container are disabled. Requests aren't forwarded to ports other than the first one listed. This field corresponds to the portsfield of the Kubernetes Containers v1 core API. | 
ReplicaStatus
Defines the replica information of the ResourcePool resource.
Appears in:
| Field | Description | 
|---|---|
| resourceTypeResourceType | Specifies the type of the resource. | 
| resourceNamestring | Indicates the unique resource name in the namespace of the resource type. For example, a DeploymentResourceTypehas the Kubernetes deployment name as its resource name. | 
| resourceNamespacestring | Indicates the namespace of the resource. This field is not applicable for cluster-scoped resources. | 
| appTypestring | Indicates the application type of the resource, defined by a specific product. For example, Prediction supports predictorandexplainerasappType. | 
| availableReplicasinteger | Indicates the total number of available replicas. For Kubernetes deployment resource type, this field represents the total number of available pods, ready for at least a minimum of ready seconds, targeted by this deployment. | 
| unavailableReplicasinteger | Indicates the total number of unavailable replicas. For Kubernetes deployment resource type, it represents the total number of unavailable pods targeted by the deployment. | 
ResourcePool
Defines the Schema for the ResourcePools API.
Appears in:
| Field | Description | 
|---|---|
| apiVersionstring | prediction.aiplatform.gdc.goog/v1 | 
| kindstring | ResourcePool | 
| metadataObjectMeta | Refer to Kubernetes API documentation for fields of metadata. | 
| specResourcePoolSpec | |
| statusResourcePoolStatus | 
ResourcePoolList
Contains a list of ResourcePool resources.
| Field | Description | 
|---|---|
| apiVersionstring | prediction.aiplatform.gdc.goog/v1 | 
| kindstring | ResourcePoolList | 
| metadataListMeta | Refer to Kubernetes API documentation for fields of metadata. | 
| itemsResourcePool array | 
ResourcePoolSpec
Defines the expected state of ResourcePool resources.
Appears in:
| Field | Description | 
|---|---|
| resourcePoolIDstring | Represents the system-generated ID of the ResourcePoolresource. This field is only applicable for Google Cloud and GDCE. The next tag is8. | 
| userProvidedIDstring | Represents the user-provided ID of the ResourcePoolresource. This field is only applicable for Google Cloud and GDCE. | 
| dedicatedResourcesDedicatedResources | Contains a description of resources that are dedicated to the resource pool. | 
| enableContainerLoggingboolean | Indicates whether or not container logging is enabled for the ResourcePool. | 
| userGsastring | Indicates the user-provided IAM service account in the user project. If not specified, the default serving service account is used. | 
| customKsaNamestring | Specifies the custom name that the operator creates and the user workload uses for bring your own service account (BYOSA) cases only. If empty, the system uses the default naming pattern. | 
| rolloutStrategyinvalid type | Specifies whether models deployed to this pool must be rolled out to the model server replicas gradually or all at once. | 
ResourcePoolStatus
Defines the observed state of ResourcePool resources.
Appears in:
| Field | Description | 
|---|---|
| readyboolean | Indicates whether the resource is in a ready state. | 
| primaryCondition | Represents the primary condition of a resource. If the resource is ready, then the condition indicates the resource is ready. Otherwise, the condition is the primary reason why the resource is not ready. | 
| resourceConditions | Represents a collection of conditions for a resource and its subresources. You can use it to determine the overall health of a resource and its subresources. | 
| conditionsCondition array | Represents raw resource conditions populated from Kubernetes resources for debugging purposes. | 
| replicaStatusesReplicaStatus array | |
| rpcStatusRpcStatus | Indicates a canonical RPC representation of the ResourcePool's primary condition. | 
| observedGenerationinteger | Indicates what revision of the resource was most recently reconciled. | 
Routes
Appears in:
| Field | Description | 
|---|---|
| predictRoutestring | Represents the routing path on the container to send prediction requests. Vertex AI forwards requests using projects.locations.endpoints.predictto this path on the container's IP address and port. Vertex AI then returns the container's response in the API response. | 
| predictSystemRoutestring | Represents the system routing path to send prediction requests to the cluster ingress. This field is populated internally only when it is copied to the deployedModelduring deployment. | 
| healthRoutestring | Represents the routing path on the container to send health checks. Vertex AI intermittently sends GET requests to this path on the container's IP address and port to check that the container is healthy. | 
| healthSystemRoutestring | Represents the system routing path to send health check requests to the cluster ingress. This field is populated internally only when it is copied to the deployedModelduring deployment. | 
RpcStatus
Encapsulates an RPC code and a message.
Appears in:
| Field | Description | 
|---|---|
| codeCode | Represents the RPC code. The next tag is 4. | 
| messagestring | Contains a user-facing description of the condition. | 
| terminalStateboolean | Indicates a value of trueif the resource has reached a terminal state and it cannot become ready. |