Stay organized with collections
Save and categorize content based on your preferences.
All Dataproc clusters come with the
BigQuery connector for Hadoop built in. This means
you can quickly read and write BigQuery data to and from
Dataproc.
Use Spark
See Using the BigQuery Connector with Spark
for an example on using Spark with the BigQuery connector for Hadoop.
This example should work for Dataproc clusters.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-07-02 UTC."],[[["Cloud Dataproc clusters include a built-in BigQuery connector for Hadoop, facilitating seamless data transfer between BigQuery and Cloud Dataproc."],["Spark users can leverage the BigQuery connector with Cloud Dataproc clusters, as demonstrated in the provided example."],["Java MapReduce jobs can also utilize the BigQuery connector with Cloud Dataproc clusters, with an example provided for reference."],["Additional details about the BigQuery Hadoop Connector are available in its dedicated documentation."]]],[]]