Vertex AI V1 API - Class Google::Cloud::AIPlatform::V1::Featurestore::OnlineServingConfig::Scaling (v0.46.0)

Reference documentation and code samples for the Vertex AI V1 API class Google::Cloud::AIPlatform::V1::Featurestore::OnlineServingConfig::Scaling.

Online serving scaling configuration. If min_node_count and max_node_count are set to the same value, the cluster will be configured with the fixed number of node (no auto-scaling).

Inherits

  • Object

Extended By

  • Google::Protobuf::MessageExts::ClassMethods

Includes

  • Google::Protobuf::MessageExts

Methods

#cpu_utilization_target

def cpu_utilization_target() -> ::Integer
Returns
  • (::Integer) — Optional. The cpu utilization that the Autoscaler should be trying to achieve. This number is on a scale from 0 (no utilization) to 100 (total utilization), and is limited between 10 and 80. When a cluster's CPU utilization exceeds the target that you have set, Bigtable immediately adds nodes to the cluster. When CPU utilization is substantially lower than the target, Bigtable removes nodes. If not set or set to 0, default to 50.

#cpu_utilization_target=

def cpu_utilization_target=(value) -> ::Integer
Parameter
  • value (::Integer) — Optional. The cpu utilization that the Autoscaler should be trying to achieve. This number is on a scale from 0 (no utilization) to 100 (total utilization), and is limited between 10 and 80. When a cluster's CPU utilization exceeds the target that you have set, Bigtable immediately adds nodes to the cluster. When CPU utilization is substantially lower than the target, Bigtable removes nodes. If not set or set to 0, default to 50.
Returns
  • (::Integer) — Optional. The cpu utilization that the Autoscaler should be trying to achieve. This number is on a scale from 0 (no utilization) to 100 (total utilization), and is limited between 10 and 80. When a cluster's CPU utilization exceeds the target that you have set, Bigtable immediately adds nodes to the cluster. When CPU utilization is substantially lower than the target, Bigtable removes nodes. If not set or set to 0, default to 50.

#max_node_count

def max_node_count() -> ::Integer
Returns
  • (::Integer) — The maximum number of nodes to scale up to. Must be greater than min_node_count, and less than or equal to 10 times of 'min_node_count'.

#max_node_count=

def max_node_count=(value) -> ::Integer
Parameter
  • value (::Integer) — The maximum number of nodes to scale up to. Must be greater than min_node_count, and less than or equal to 10 times of 'min_node_count'.
Returns
  • (::Integer) — The maximum number of nodes to scale up to. Must be greater than min_node_count, and less than or equal to 10 times of 'min_node_count'.

#min_node_count

def min_node_count() -> ::Integer
Returns
  • (::Integer) — Required. The minimum number of nodes to scale down to. Must be greater than or equal to 1.

#min_node_count=

def min_node_count=(value) -> ::Integer
Parameter
  • value (::Integer) — Required. The minimum number of nodes to scale down to. Must be greater than or equal to 1.
Returns
  • (::Integer) — Required. The minimum number of nodes to scale down to. Must be greater than or equal to 1.