A description of resources that are used for performing batch operations, are dedicated to a Model, and need manual configuration.
Required. Immutable. The specification of a single machine.
startingReplicaCount
integer
Immutable. The number of machine replicas used at the start of the batch operation. If not set, Vertex AI decides starting number, not greater than maxReplicaCount
maxReplicaCount
integer
Immutable. The maximum number of machine replicas the batch operation may be scaled to. The default value is 10.
Optional. Immutable. If set, use DWS resource to schedule the deployment workload. reference: (https://cloud.google.com/blog/products/compute/introducing-dynamic-workload-scheduler)
spot
boolean
Optional. If true, schedule the deployment workload on spot VMs.
JSON representation |
---|
{ "machineSpec": { object ( |