This page describes how to enable Cloud Logging for your Cloud Data Fusion Dataproc clusters and access advanced pipeline logs.
Enable Dataproc Cloud Logging
To view pipeline and cluster issues in Cloud Logging, enable advanced logs in new or existing Cloud Data Fusion instances. To enable advanced logs in an existing instance, do the following:
In the Google Cloud console, go to the Cloud Data Fusion Instances page.
Click the instance name.
In the Advanced monitoring and logging section, for Dataproc Cloud Logging, click
Edit.In the Cloud Logging window, select the Enable Cloud Logging checkbox.
Click Save.
View logs
Every Cloud Data Fusion pipeline run is assigned a unique RunID
.
After you deploy and run your pipeline, find its RunID
. Then, in
Logging, use the RunID
to view your pipeline
logs.
Get the pipeline's RunID
- Go to your instance:
In the Google Cloud console, go to the Cloud Data Fusion page.
To open the instance in the Cloud Data Fusion Studio, click Instances, and then click View instance.
- Click List.
- Click the pipeline for which you want to get the Run ID.
- Click Summary.
- In the Run history section, click Table.
- To copy the Run ID, right-click the RunID, and click Copy.
View the logs in Logs Explorer
In the Google Cloud console, go to the Cloud Logging > Logs Explorer page:
In the All resources drop-down, select Cloud Dataproc Cluster >
cdap-PIPELINE_NAME-YOUR_RUNID
.
Optional: Filter the logs
Use the filter menus to filter your logs. You can filter by various log
severity levels
or by components such as datafusion-pipeline-logs
.
Optional: Download the logs
Click Download logs.
For more information, see downloading log entries.
What's next
- Learn how to view audit logs.
- Learn how to view pipeline logs from the Cloud Data Fusion Pipeline Studio.