Tetap teratur dengan koleksi
Simpan dan kategorikan konten berdasarkan preferensi Anda.
Anda dapat menginstal komponen tambahan seperti Apache Pig
saat membuat cluster Dataproc menggunakan fitur
Komponen opsional. Halaman ini menjelaskan komponen Pig, platform open source untuk
menganalisis set data besar.
Menginstal komponen
Instal komponen saat Anda membuat cluster Dataproc.
Apache Pig adalah komponen opsional di versi image Dataproc 2.3 dan yang lebih baru.
Untuk membuat cluster Dataproc yang menyertakan komponen Pig, gunakan perintah
gcloud dataproc clusters create CLUSTER_NAME
dengan flag --optional-components (menggunakan versi image 2.3 atau yang lebih baru).
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-08-22 UTC."],[],[],null,["# Dataproc optional Pig component\n\nYou can install additional components like [Apache Pig](https://pig.apache.org/)\nwhen you create a Dataproc cluster using the\n[Optional components](/dataproc/docs/concepts/components/overview#available_optional_components)\nfeature. This page describes the Pig component, an open source platform for\nanalyzing large data sets.\n\nInstall the component\n---------------------\n\nInstall the component when you create a Dataproc cluster.\n\nApache Pig is an optional component in Dataproc `2.3` and later\nimage versions.\n| **Note:** Apache Pig is automatically installed on Dataproc `2.2` and earlier image versions.\n\nSee\n[Supported Dataproc versions](/dataproc/docs/concepts/versioning/dataproc-versions#supported_cloud_dataproc_versions)\nfor component versions included in the latest Dataproc image\nreleases. \n\n### gcloud\n\nTo create a Dataproc cluster that includes the Pig component,\nuse the\n[`gcloud dataproc clusters create `\u003cvar translate=\"no\"\u003eCLUSTER_NAME\u003c/var\u003e](/sdk/gcloud/reference/dataproc/clusters/create)\ncommand with the `--optional-components` flag (using image version\n2.3 or later). \n\n```\ngcloud dataproc clusters create CLUSTER_NAME \\\n --region=REGION \\\n --optional-components=PIG \\\n --image-version=2.3 \\\n ... other flags\n```\n\n### REST API\n\nThe Pig component can be specified through the Dataproc API\nusing\n[SoftwareConfig.Component](/dataproc/docs/reference/rest/v1/ClusterConfig#Component)\nas part of a\n[clusters.create](/dataproc/docs/reference/rest/v1/projects.regions.clusters/create)\nrequest.\n\n### Console\n\nEnable the component:\n\n1. In the Google Cloud console, open the Dataproc [Create a cluster](https://console.cloud.google.com/dataproc/clustersAdd) page. The Set up cluster panel is selected.\n2. In the Components section, under Optional components, select Pig and other optional components to install on your cluster."]]