The metric specification that defines the target resource
utilization (CPU utilization, accelerator's duty cycle, and so
on) for calculating the desired replica count.
Attributes
Name
Description
metric_name
str
Required. The resource metric name. Supported metrics:
- For Online Prediction:
- aiplatform.googleapis.com/prediction/online/accelerator/duty_cycle
- aiplatform.googleapis.com/prediction/online/cpu/utilization
target
int
The target resource utilization in percentage
(1% - 100%) for the given metric; once the real
usage deviates from the target by a certain
percentage, the machine replicas change. The
default value is 60 (representing 60%) if not
provided.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-11-19 UTC."],[],[]]