Stay organized with collections
Save and categorize content based on your preferences.
You can install additional components like Zeppelin when you create a Dataproc
cluster using the
Optional components
feature. This page describes the Zeppelin component.
The Zeppelin Notebook
component is a Web-based notebook for interactive data analytics. The Zeppelin
Web UI is available on port 8080 on the cluster's first master node.
By default, notebooks are saved in Cloud Storage
in the Dataproc staging bucket, which is specified by the user or
auto-created
when the cluster is created. The location can be changed at cluster creation
time via the
zeppelin:zeppelin.notebook.gcs.dir property.
Install the component
Install the component when you create a Dataproc cluster.
Components can be added to clusters created with
Dataproc version 1.3
and later.
To create a Dataproc cluster that includes the Zeppelin component,
use the
gcloud dataproc clusters createcluster-name
command with the --optional-components flag.
See Viewing and Accessing Component Gateway URLs
to click Component Gateway links on the Google Cloud console to open the
Zeppelin notebook UI running on the cluster's master node in your local
browser.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-04-02 UTC."],[[["The Zeppelin component, a web-based notebook for interactive data analytics, can be installed as an optional component when creating a Dataproc cluster."],["Zeppelin notebooks are saved in a Cloud Storage staging bucket by default, but this location can be customized during cluster creation using a specific property."],["The Zeppelin Web UI is accessible on port 8080 of the cluster's first master node, and the Component Gateway can be enabled to provide secured connections to the UI."],["To install Zeppelin, use the `gcloud dataproc clusters create` command with the `--optional-components=ZEPPELIN` flag, and the `--enable-component-gateway` flag to enable secure connections when creating the cluster with an image version of 1.3.29 or later."],["You can install the Zeppelin component through the Google cloud console by selecting it in the optional components section, and also selecting to enable the component gateway when creating a new cluster."]]],[]]