Class ClusterConfig (2.2.0)

ClusterConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)

The cluster config.

Attributes

NameDescription
config_bucket str
Optional. A Cloud Storage bucket used to stage job dependencies, config files, and job driver console output. If you do not specify a staging bucket, Cloud Dataproc will determine a Cloud Storage location (US, ASIA, or EU) for your cluster's staging bucket according to the Compute Engine zone where your cluster is deployed, and then create and manage this project-level, per-location bucket (see `Dataproc staging bucket
gce_cluster_config .gcd_clusters.GceClusterConfig
Optional. The shared Compute Engine config settings for all instances in a cluster.
master_config .gcd_clusters.InstanceGroupConfig
Optional. The Compute Engine config settings for the master instance in a cluster.
worker_config .gcd_clusters.InstanceGroupConfig
Optional. The Compute Engine config settings for worker instances in a cluster.
secondary_worker_config .gcd_clusters.InstanceGroupConfig
Optional. The Compute Engine config settings for additional worker instances in a cluster.
software_config .gcd_clusters.SoftwareConfig
Optional. The config settings for software inside the cluster.
lifecycle_config .gcd_clusters.LifecycleConfig
Optional. The config setting for auto delete cluster schedule.
initialization_actions Sequence[.gcd_clusters.NodeInitializationAction]
Optional. Commands to execute on each node after config is completed. By default, executables are run on master and all worker nodes. You can test a node's role metadata to run an executable on a master or worker node, as shown below using curl (you can also use wget): :: ROLE=$(curl -H Metadata-Flavor:Google http://metadata/computeMetadata/v1beta2/instance/attributes/dataproc-role) if [[ "${ROLE}" == 'Master' ]]; then ... master specific actions ... else ... worker specific actions ... fi
encryption_config .gcd_clusters.EncryptionConfig
Optional. Encryption settings for the cluster.
autoscaling_config .gcd_clusters.AutoscalingConfig
Optional. Autoscaling config for the policy associated with the cluster. Cluster does not autoscale if this field is unset.
endpoint_config .gcd_clusters.EndpointConfig
Optional. Port/endpoint configuration for this cluster
security_config .gcd_clusters.SecurityConfig
Optional. Security related configuration.
gke_cluster_config .gcd_clusters.GkeClusterConfig
Optional. The Kubernetes Engine config for Dataproc clusters deployed to Kubernetes. Setting this is considered mutually exclusive with Compute Engine-based options such as gce_cluster_config, master_config, worker_config, secondary_worker_config, and autoscaling_config.