You can install additional components like Zookeeper when you create a Dataproc cluster using the Optional components feature. This page describes the Zookeeper component.
The Apache Zookeeper component is centralized service for providing distributed synchronization of data.
Install the component
Install the component when you create a Dataproc cluster.
See Supported Dataproc versions for the component version included in each Dataproc image release.
gcloud command
To create a Dataproc cluster that includes the Zookeeper component,
use the
gcloud dataproc clusters create cluster-name
command with the --optional-components
flag (using image version
1.0 or later).
gcloud dataproc clusters create cluster-name \ --optional-components=ZOOKEEPER \ --region=region \ ... other flags
REST API
The Zookeeper component can be specified through the Dataproc API using SoftwareConfig.Component as part of a clusters.create request.Console
- Enable the component.
- In the Google Cloud console, open the Dataproc Create a cluster page. The Set up cluster panel is selected.
- In the Components section, under Optional components, select Zookeeper and other optional components to install on your cluster.