Mantenha tudo organizado com as coleções
Salve e categorize o conteúdo com base nas suas preferências.
Para complementar o disco de inicialização, você pode anexar
unidades de estado sólido locais (SSDs locais)
aos workers mestre, primário e nó de trabalho secundário do cluster.
Quando os SSDs locais são fornecidos para o cluster, os dados do HDFS e do zero,
como saídas aleatórias, usam os SSDs locais em vez do
disco permanente de inicialização.
Os SSDs locais podem oferecer tempos de leitura e gravação mais rápidos do que disco permanente. Consulte Desempenho do SSD local.
O tamanho de cada SSD local (375 GB) é fixo, mas é possível anexar vários SSDs locais para aumentar o armazenamento SSD (consulte Sobre SSDs locais).
Cada SSD local é ativado em /mnt/<id> nos nós de cluster do Dataproc.
Os SSDs locais usam ext4 como sistema de arquivos padrão.
Usar SSDs locais
Comando gcloud
Use o comando
gcloud dataproc clusters create
com as flags --num-master-local-ssds,
--num-workers-local-ssds e
--num-secondary-worker-local-ssds para anexar SSDs locais aos nós mestre, de worker primário e de worker secundário do cluster.
Os SSDs locais podem ser conectados a VMs do Dataproc usando uma interface SCSI
(Small Computer System Interface) ou NVME (Non-Volatile Memory Express). Consulte
desempenho do SSD local.
A interface padrão de SSD local da VM do cluster do Dataproc é a SCSI. Use o comando
gcloud dataproc clusters create
com as flags --master-local-ssd-interface,
--worker-local-ssd-interface e
--secondary-worker-local-ssd-interface
para especificar a interface de SSD local para nós de trabalho mestre, primário e secundário.
Defina o campo numLocalSsds em masterConfig, workerConfig e secondaryWorkerConfigInstanceGroupConfig em uma solicitação de API cluster.create para anexar SSDs locais aos nós mestre, de worker primário e de worker secundário do cluster.
Os SSDs locais podem ser conectados a VMs do Dataproc usando uma interface SCSI
(Small Computer System Interface) ou NVME (Non-Volatile Memory Express). Consulte
desempenho do SSD local.
A interface padrão de SSD local da VM do cluster do Dataproc é a SCSI. Defina o campo localSsdInterface em masterConfig, workerConfig e secondaryWorkerConfigInstanceGroupConfig em uma solicitação de API cluster.create para especificar a interface "SCSI" ou "NVME" e anexar SSDs locais aos nós mestre, de worker primário e de worker secundário do cluster.
Console
Crie um cluster e anexe SSDs locais aos nós de trabalho mestre,
principais e secundários no painel "Configurar nós" da
página
Criar um cluster do
Dataproc do console Google Cloud .
[[["Fácil de entender","easyToUnderstand","thumb-up"],["Meu problema foi resolvido","solvedMyProblem","thumb-up"],["Outro","otherUp","thumb-up"]],[["Difícil de entender","hardToUnderstand","thumb-down"],["Informações incorretas ou exemplo de código","incorrectInformationOrSampleCode","thumb-down"],["Não contém as informações/amostras de que eu preciso","missingTheInformationSamplesINeed","thumb-down"],["Problema na tradução","translationIssue","thumb-down"],["Outro","otherDown","thumb-down"]],["Última atualização 2025-08-22 UTC."],[[["\u003cp\u003eLocal SSDs can be attached to master, primary worker, and secondary worker nodes in a cluster, offering faster read and write speeds compared to persistent disks.\u003c/p\u003e\n"],["\u003cp\u003eEach local SSD has a fixed size of 375GB, but multiple SSDs can be attached to increase the total SSD storage capacity.\u003c/p\u003e\n"],["\u003cp\u003eThe gcloud command-line tool and REST API provide methods to configure the number and interface type (SCSI or NVME) of local SSDs attached to each node type.\u003c/p\u003e\n"],["\u003cp\u003eLocal SSDs are automatically mounted to the \u003ccode\u003e/mnt/<id>\u003c/code\u003e directory on Dataproc cluster nodes and use the ext4 file system by default.\u003c/p\u003e\n"],["\u003cp\u003eYou can create a cluster with local SSDs from the "Configure Nodes" panel of the Dataproc cluster creation page of the google cloud console.\u003c/p\u003e\n"]]],[],null,["# Dataproc local SSDs\n\nTo supplement the boot disk, you can attach\n[local Solid State Drives (local SSDs)](/compute/docs/disks/local-ssd)\nto master, primary worker, and secondary worker nodes in your cluster.\nWhen local SSDs are provided to the cluster, both HDFS and scratch data,\nsuch as shuffle outputs, use the local SSDs instead of the boot\npersistent disk.\n\n- Local SSDs can provide faster read and write times than persistent disk (see [Local SSD Performance](/compute/docs/disks/local-ssd#performance)).\n- The 375GB size of each local SSD is fixed, but you can attach multiple local SSDs to increase SSD storage (see [About Local SSDs](/compute/docs/disks/local-ssd)).\n- Each local SSD is mounted to `/mnt/\u003cid\u003e` in Dataproc cluster nodes.\n- Local SSDs use [`ext4`](https://en.wikipedia.org/wiki/Ext4) as the default filesystem.\n\nUse local SSDs\n--------------\n\n### gcloud command\n\nUse the\n[`gcloud dataproc clusters create`](/sdk/gcloud/reference/dataproc/clusters/create)\ncommand with the `--num-master-local-ssds`,\n`--num-workers-local-ssds`, and\n`--num-secondary-worker-local-ssds` flags to attach local\nSSDs to the cluster's master, primary, and secondary worker\nnodes.\n\nLocal SSDs can be attached to Dataproc VMs using a SCSI\n(Small Computer System Interface) or NVME (Non-Volatile Memory Express) interface (see\n[local SSD performance](/compute/docs/disks/local-ssd#performance)).\nThe default Dataproc cluster VM local SSD interface is the SCSI interface. Use the\n[gcloud dataproc clusters create](/sdk/gcloud/reference/dataproc/clusters/create)\ncommand with the `--master-local-ssd-interface`,\n`--worker-local-ssd-interface`, and\n`--secondary-worker-local-ssd-interface` flags\nto specify the local SSD interface for master, primary, and secondary\nworker nodes.\n\n**Example:** \n\n```\ngcloud dataproc clusters create cluster-name \\\n --region=region \\\n --num-master-local-ssds=1 \\\n --num-worker-local-ssds=1 \\\n --num-secondary-worker-local-ssds=1 \\\n --master-local-ssd-interface=NVME \\\n --worker-local-ssd-interface=NVME \\\n --secondary-worker-local-ssd-interface=NVME \\\n ... other args ...\n```\n\n### REST API\n\nSet the\n[numLocalSsds](/dataproc/docs/reference/rest/v1/ClusterConfig#diskconfig)\nfield in the `masterConfig`, `workerConfig`, and\n`secondaryWorkerConfig`\n[InstanceGroupConfig](/dataproc/docs/reference/rest/v1/ClusterConfig#InstanceGroupConfig)\nin a\n[cluster.create](/dataproc/docs/reference/rest/v1/projects.regions.clusters/create)\nAPI request to attach local SSDs to the cluster's master, primary worker, and\nsecondary worker nodes.\n\nLocal SSDs can be attached to Dataproc VMs using a SCSI\n(Small Computer System Interface) or NVME (Non-Volatile Memory Express) interface (see\n[local SSD performance](/compute/docs/disks/local-ssd#performance)).\nThe default Dataproc cluster VM local SSD interface is the SCSI interface. Set the\n[localSsdInterface](/dataproc/docs/reference/rest/v1/ClusterConfig#diskconfig)\nfield in the `masterConfig`, `workerConfig`, and\n`secondaryWorkerConfig`\n[InstanceGroupConfig](/dataproc/docs/reference/rest/v1/ClusterConfig#InstanceGroupConfig)\nin a\n[cluster.create](/dataproc/docs/reference/rest/v1/projects.regions.clusters/create)\nAPI request to specify the \"SCSI\" or \"NVME\" interface to attach local SSDs to the cluster's master,\nprimary worker, and secondary worker nodes.\n\n### Console\n\nCreate a cluster and attach local SSDs to the master,\nprimary, and secondary worker nodes from the Configure nodes panel of the\nDataproc\n[Create a cluster](https://console.cloud.google.com/dataproc/clustersAdd) page\nof the Google Cloud console."]]