Stay organized with collections
Save and categorize content based on your preferences.
Cloud Data Fusion pricing
This document explains the pricing for Cloud Data Fusion. To see the
pricing for other products, read the
Pricing documentation.
For pricing purposes, usage is measured as the length of time, in minutes,
between the time a Cloud Data Fusion instance is created to the time it
is deleted. Although the rate for pricing is defined on the hour, Cloud Data
Fusion is billed by the minute. Usage is measured in hours (30
minutes is 0.5 hours, for example) to apply hourly pricing to
minute-by-minute use.
If you pay in a currency other than USD, the prices listed in your currency on
Google Cloud SKUs
apply.
Pricing overview
Cloud Data Fusion pricing is split across two functions:
pipeline development and execution.
Development
For pipeline development, Cloud Data Fusion offers the following three
editions:
Cloud Data Fusion Edition
Price per instance per hour
Developer
$0.35 (~$250 per month)
Basic
$1.80 (~$1100 per month)
Enterprise
$4.20 (~$3000 per month)
The Basic edition offers the first 120 hours per month per account free.
Execution
For pipeline execution, you are charged for the Dataproc clusters
that Cloud Data Fusion creates to run your pipelines at the
current Dataproc rates.
Comparison of Developer, Basic, and Enterprise editions
* Concurrent users: in general,
Cloud Data Fusion supports a maximum of 50 users per instance. If
RBAC is enabled, the maximum is 25 users.
** Concurrent pipeline execution is limited and
based on the instance version being used. For access to
scalability details,
reach out to a Google Cloud representative.
Usage of other Google Cloud resources
In addition to the development cost of a Cloud Data Fusion instance,̦
you are billed only for any resources that you use to execute your pipelines,
such as:
*Data Lineage
in Cloud Data Fusion isn't supported in africa-south1,
me-central1, me-central1, or
europe-west12.
Pricing example
Consider a Cloud Data Fusion instance that has been running for 24
hours, and there are no free hours remaining for the Basic edition. Based on the
edition, the instance charge for Cloud Data Fusion is summarized in the
following table:
Edition
Cost per hour
Number of hours
Development cost
Developer
$0.35
24
24*0.35 = $8.4
Basic
$1.80
24
24*1.8 = $43.2
Enterprise
$4.20
24
24*4.2 = $100.8
During this 24-hour period, you ran a pipeline that read raw data from
Cloud Storage, performed transformations, and wrote the data to
BigQuery every hour. Each run took approximately 15 minutes to
complete. In other words, the Dataproc clusters that were
created for these runs were alive for 15 minutes (0.25 hours) each. Assume that
the configuration of each Dataproc cluster was the following:
Item
Machine Type
Virtual CPUs
Attached Persistent Disk
Number in cluster
Master Node
n1-standard-4
4
500 GB
1
Worker Nodes
n1-standard-4
4
500 GB
5
The Dataproc clusters each have 24 virtual CPUs: 4 for the
master and 20 spread across the workers. For Dataproc billing
purposes, the pricing for this cluster would be based on those 24 virtual CPUs
and the length of time each cluster ran.
Across all runs of your pipeline, the total charge incurred for
Dataproc can be calculated as:
Dataproc charge = # of vCPUs * number of clusters * hours per cluster * Dataproc price
= 24 * 24 * 0.25 * $0.01
= $1.44
The Dataproc clusters use other Google Cloud products, which
would be billed separately. Specifically, these clusters would incur charges for
Compute Engine
and Standard
Persistent Disk
Provisioned Space. You will incur storage charges for
Cloud Storage
and
BigQuery,
depending on the amount of data your pipeline processes.
To determine these additional costs based on current rates, you can use the
billing calculator.
With Google Cloud's pay-as-you-go pricing, you only pay for the services you
use. Connect with our sales team to get a custom quote for your organization.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],[],[[["\u003cp\u003eCloud Data Fusion's pricing is based on both pipeline development and execution, with billing calculated per minute based on hourly rates.\u003c/p\u003e\n"],["\u003cp\u003eDevelopment costs vary by edition: Developer ($0.35/hour), Basic ($1.80/hour, with 120 free hours per month), and Enterprise ($4.20/hour).\u003c/p\u003e\n"],["\u003cp\u003ePipeline execution costs are based on the Dataproc clusters used, charged at current Dataproc rates, and are calculated by the number of virtual CPUs, the number of clusters, the time in hours, and the price per hour.\u003c/p\u003e\n"],["\u003cp\u003eIn addition to development and Dataproc execution costs, users will be charged for other utilized Google Cloud resources such as Cloud Storage, Networking, and BigQuery.\u003c/p\u003e\n"],["\u003cp\u003eCloud Data Fusion supports various regions, and the pricing for the service is the same across them all.\u003c/p\u003e\n"]]],[],null,["# Pricing\n\nCloud Data Fusion pricing\n=========================\n\nThis document explains the pricing for Cloud Data Fusion. To see the\npricing for other products, read the\n[Pricing documentation](/pricing).\n\nFor pricing purposes, usage is measured as the length of time, in minutes,\nbetween the time a Cloud Data Fusion instance is created to the time it\nis deleted. Although the rate for pricing is defined on the hour, Cloud Data\nFusion is billed by the minute. Usage is measured in hours (30\nminutes is 0.5 hours, for example) to apply hourly pricing to\nminute-by-minute use.\n\nIf you pay in a currency other than USD, the prices listed in your currency on\n[Google Cloud SKUs](/skus)\napply.\n\nPricing overview\n----------------\n\nCloud Data Fusion pricing is split across two functions:\npipeline development and execution.\n\n### Development\n\nFor pipeline development, Cloud Data Fusion offers the following three\neditions:\n\nThe Basic edition offers the first 120 hours per month per account free.\n\n### Execution\n\nFor pipeline execution, you are charged for the Dataproc clusters\nthat Cloud Data Fusion creates to run your pipelines at the\n[current Dataproc rates](/dataproc/pricing).\n\nComparison of Developer, Basic, and Enterprise editions\n-------------------------------------------------------\n\n^\\*^ Concurrent users: in general, Cloud Data Fusion supports a maximum of 50 users per instance. If RBAC is enabled, the maximum is 25 users.\n\n\u003cbr /\u003e\n\n^\\*\\*^ Concurrent pipeline execution is limited and based on the instance version being used. For access to [scalability details](/data-fusion/docs/concepts/scalability-overview), reach out to a Google Cloud representative.\n\n\u003cbr /\u003e\n\nUsage of other Google Cloud resources\n-------------------------------------\n\nIn addition to the development cost of a Cloud Data Fusion instance,̦\nyou are billed only for any resources that you use to execute your pipelines,\nsuch as:\n\n- [Dataproc](/dataproc/pricing)\n- [Cloud Storage](/storage/pricing)\n- [Networking](/vpc/network-pricing)\n- [BigQuery](/bigquery/pricing)\n\n| For building replication jobs, BigQuery **[flat-rate pricing](/bigquery/pricing#flat_rate_pricing)** is recommended, not on-demand pricing.\n\nSupported regions\n-----------------\n\nCurrently, pricing for Cloud Data Fusion is the same for all supported\nregions.\n\n^\\*^ [Data Lineage](/data-fusion/docs/how-to/view-lineage-in-dataplex) in Cloud Data Fusion isn't supported in `africa-south1`, `me-central1`, `me-central1`, or `europe-west12`.\n\n\u003cbr /\u003e\n\nPricing example\n---------------\n\nConsider a Cloud Data Fusion instance that has been running for 24\nhours, and there are no free hours remaining for the Basic edition. Based on the\nedition, the instance charge for Cloud Data Fusion is summarized in the\nfollowing table:\n\n| **Note:** Cloud Data Fusion instances, once provisioned, always need to be available. After you delete instances, they cannot be recovered and any pipeline data is lost. For estimated monthly costs, refer to the [Pricing\n| overview](#pricing_overview).\n\nDuring this 24-hour period, you ran a pipeline that read raw data from\nCloud Storage, performed transformations, and wrote the data to\nBigQuery every hour. Each run took approximately 15 minutes to\ncomplete. In other words, the Dataproc clusters that were\ncreated for these runs were alive for 15 minutes (0.25 hours) each. Assume that\nthe configuration of each Dataproc cluster was the following:\n\nThe Dataproc clusters each have 24 virtual CPUs: 4 for the\nmaster and 20 spread across the workers. For Dataproc billing\npurposes, the pricing for this cluster would be based on those 24 virtual CPUs\nand the length of time each cluster ran.\n\nAcross all runs of your pipeline, the total charge incurred for\nDataproc can be calculated as: \n\n```\nDataproc charge = # of vCPUs * number of clusters * hours per cluster * Dataproc price\n = 24 * 24 * 0.25 * $0.01\n = $1.44\n```\n\nThe Dataproc clusters use other Google Cloud products, which\nwould be billed separately. Specifically, these clusters would incur charges for\n[Compute Engine](/compute/pricing)\nand Standard\n[Persistent Disk](/compute/disks-image-pricing#disk/)\nProvisioned Space. You will incur storage charges for\n[Cloud Storage](/storage/pricing)\nand\n[BigQuery](/bigquery/pricing),\ndepending on the amount of data your pipeline processes.\n\nTo determine these additional costs based on current rates, you can use the\n[billing calculator](/products/calculator).\n\nWhat's next\n-----------\n\n- Read the [Cloud Data Fusion documentation](/data-fusion/docs).\n- Get started with [Cloud Data Fusion](/data-fusion/docs/create-data-pipeline).\n- Try the [Pricing calculator](/products/calculator).\n\n#### Request a custom quote\n\nWith Google Cloud's pay-as-you-go pricing, you only pay for the services you use. Connect with our sales team to get a custom quote for your organization.\n[Contact sales](/contact?direct=true)"]]