Stay organized with collections
Save and categorize content based on your preferences.
All Cloud Dataproc clusters come with the BigQuery connector for Hadoop built in. This means you can easily and quickly read and write BigQuery data to and from Cloud Dataproc.
Using Spark
See Using the BigQuery Connector with Spark for an example on using Spark with the BigQuery connector for Hadoop. This example should work for Cloud Dataproc clusters.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-21 UTC."],[[["Cloud Dataproc clusters include a built-in BigQuery connector for Hadoop, facilitating seamless data transfer between BigQuery and Cloud Dataproc."],["Spark users can leverage the BigQuery connector with Cloud Dataproc clusters, as demonstrated in the provided example."],["Java MapReduce jobs can also utilize the BigQuery connector with Cloud Dataproc clusters, with an example provided for reference."],["Additional details about the BigQuery Hadoop Connector are available in its dedicated documentation."]]],[]]