Stay organized with collections
Save and categorize content based on your preferences.
You can install additional components like Zookeeper when you create a Dataproc
cluster using the
Optional components
feature. This page describes the Zookeeper component.
The Apache Zookeeper
component is centralized service for providing distributed synchronization
of data.
Install the component
Install the component when you create a Dataproc cluster.
To create a Dataproc cluster that includes the Zookeeper component,
use the
gcloud dataproc clusters createcluster-name
command with the --optional-components flag (using image version
1.0 or later).
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-25 UTC."],[[["\u003cp\u003eZookeeper can be installed as an optional component when creating a Dataproc cluster, enabling distributed data synchronization.\u003c/p\u003e\n"],["\u003cp\u003eThe Zookeeper component is installed during Dataproc cluster creation through the use of the \u003ccode\u003e--optional-components\u003c/code\u003e flag.\u003c/p\u003e\n"],["\u003cp\u003eYou can install the Zookeeper component by using the gcloud command, through a REST API request, or through the Google Cloud console when creating the cluster.\u003c/p\u003e\n"],["\u003cp\u003eThe specific version of the Zookeeper component that will be installed is dependent on the Dataproc image version being used for the cluster.\u003c/p\u003e\n"]]],[],null,["# Dataproc optional Zookeeper component\n\nYou can install additional components like Zookeeper when you create a Dataproc\ncluster using the\n[Optional components](/dataproc/docs/concepts/components/overview#available_optional_components)\nfeature. This page describes the Zookeeper component.\n\nThe [Apache Zookeeper](https://zookeeper.apache.org/)\ncomponent is centralized service for providing distributed synchronization\nof data.\n\nInstall the component\n---------------------\n\nInstall the component when you create a Dataproc cluster.\n\nSee\n[Supported Dataproc versions](/dataproc/docs/concepts/versioning/dataproc-versions#supported_cloud_dataproc_versions)\nfor the component version included in each Dataproc image release. \n\n### gcloud command\n\nTo create a Dataproc cluster that includes the Zookeeper component,\nuse the\n[gcloud dataproc clusters create](/sdk/gcloud/reference/dataproc/clusters/create) \u003cvar translate=\"no\"\u003ecluster-name\u003c/var\u003e\ncommand with the `--optional-components` flag (using image version\n1.0 or later). \n\n```\ngcloud dataproc clusters create cluster-name \\\n --optional-components=ZOOKEEPER \\\n --region=region \\\n ... other flags\n```\n\n### REST API\n\nThe Zookeeper component can be specified through the Dataproc API using [SoftwareConfig.Component](/dataproc/docs/reference/rest/v1/ClusterConfig#Component) as part of a [clusters.create](/dataproc/docs/reference/rest/v1/projects.regions.clusters/create) request.\n\n### Console\n\n1. Enable the component.\n - In the Google Cloud console, open the Dataproc [Create a cluster](https://console.cloud.google.com/dataproc/clustersAdd) page. The Set up cluster panel is selected.\n - In the Components section, under Optional components, select Zookeeper and other optional components to install on your cluster."]]