Stay organized with collections
Save and categorize content based on your preferences.
A provisioner is responsible for creating and tearing down the cloud cluster
where the pipeline is executed. Different provisioners are capable of
creating different types of clusters on various clouds.
Each provisioner exposes a set of configuration settings that control the type
of cluster that's created for a run. For example, the Dataproc
and Amazon EMR provisioners have cluster size settings. Provisioners also have
settings for the credentials required to talk to their respective clouds and
provision the required compute nodes.
Supported provisioners in Cloud Data Fusion
Cloud Data Fusion supports the following provisioners:
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-21 UTC."],[[["Provisioners manage the creation and deletion of cloud clusters for pipeline execution."],["Different provisioners enable the creation of various cluster types on different cloud platforms."],["Provisioners offer configuration settings to define the characteristics of the created cluster and credentials for cloud access."],["Cloud Data Fusion supports Dataproc, Amazon EMR, and Remote Hadoop provisioners."]]],[]]