Logging or monitoring for third-party applications are not available
unless the Ops Agent
is installed.
Notes:
The source code to image 2.2 libraries that are licensed under Reciprocal
and Restricted licenses is available at the
/usr/local/share/google/dataproc/third-party-sources path on
Dataproc cluster VMs.
The following Hudi procedures
are known to not work on a Hudi table backed by the Cloud Storage file system:
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-21 UTC."],[[["The 2.2 image version includes a consistent set of components across multiple release dates, such as Apache Atlas 2.2.0, Apache Flink 1.17.0, Apache Hadoop 3.3.6, and Apache Spark 3.5.1."],["Several components are installed by default, including Apache Hadoop, Hive, Pig, Spark, Tez, BigQuery Connector, Cloud Storage Connector, Conscrypt, Java, Python (conda 23.11.0 with Python 3.11), R, and Scala."],["Optional components like Apache Flink, Hive WebHCat, Hudi, Docker, JupyterLab Notebook, Ranger, Solr, Trino, Zeppelin Notebook, and Zookeeper are available for use."],["Initialization actions are provided for Apache Atlas, Kafka, Sqoop, Hue, and Oozie."],["Data Lineage and Legacy Agents are not available in the 2.2 image version, with monitoring agent defaults only available if the Ops Agent is installed, and certain Hudi procedures are unsupported on Cloud Storage."]]],[]]