This document describes how to access and view pipeline logs and service logs for Cloud Data Fusion.
Starting with Cloud Data Fusion version 6.11, pipeline logs and service logs are available in Cloud Logging.
About log types
Cloud Data Fusion generates several types of logs to help monitor and troubleshoot data integration processes:
Pricing
Cloud Logging and Cloud Monitoring usage incurs charges. For more information, see Google Cloud Observability pricing.
Optional: Import the Cloud Data Fusion Logging dashboard
To view pipeline logs and service logs using the Cloud Data Fusion Logging dashboard, import the dashboard:
In the Google Cloud console, go to the Cloud Monitoring Dashboards page.
Click View dashboard templates.
Search for Cloud Data Fusion Logging and select the dashboard.
Click
Add Cloud Data Fusion Logging dashboard to your list.
View pipeline logs
You can view pipeline logs using the Cloud Data Fusion Logging dashboard or directly in the Logs Explorer.
View pipeline logs using dashboard
If you haven't done already, import the Cloud Data Fusion Logging dashboard.
In the My dashboards section, click Cloud Data Fusion Logging.
In the Pipeline logs section, view the list of pipeline logs. You can filter the logs by severity, field names, and values.
To refine your search using queries, use Logs Explorer.
View pipeline logs in Logs Explorer
In the Google Cloud console, go to the Logs Explorer page.
Enter the following query:
resource.type="datafusion.googleapis.com/PipelineV2"
This displays the list of pipeline logs. You can use filters to refine the results.
Filter pipeline logs
You can filter pipeline logs by run ID, instance ID, pipeline ID, location, namespace, or custom labels.
Every Cloud Data Fusion pipeline run is assigned a unique RunID
.
After you deploy and run your pipeline, you can find the RunID
of your
pipeline and view the corresponding pipeline logs.
To filter pipeline logs by RunID
, follow these steps:
In the Google Cloud console, go to the Logs Explorer page.
Enter the following query:
resource.type="datafusion.googleapis.com/PipelineV2" resource.labels.run_id=RUN_ID
View service logs
You can view service logs using the Cloud Data Fusion Logging dashboard or in the Logs Explorer.
View service logs using dashboard
If you haven't done already, import the Cloud Data Fusion Logging dashboard.
In the My dashboards section, click Cloud Data Fusion Logging.
In the Service logs section, view the list of service logs. You can filter the logs by severity, field names, and values.
To refine your search using queries, use Logs Explorer.
View service logs in Logs Explorer
In the Google Cloud console, go to the Logs Explorer page.
Find the service logs by entering the specific query for that service.
Service name Log query Appfabric resource.type="datafusion.googleapis.com/InstanceV2" labels.".serviceId"="appfabric"
AppFabric processor resource.type="datafusion.googleapis.com/InstanceV2" labels.".serviceId"="appfabric.processor"
Dataset executor resource.type="datafusion.googleapis.com/InstanceV2" labels.".serviceId"="dataset.executor"
Log saver resource.type="datafusion.googleapis.com/InstanceV2" labels.".serviceId"="log.saver"
Metadata service resource.type="datafusion.googleapis.com/InstanceV2" labels.".serviceId"="metadata.service"
Metrics resource.type="datafusion.googleapis.com/InstanceV2" labels.".serviceId"="metrics"
Pipeline Studio resource.type="datafusion.googleapis.com/InstanceV2" resource.labels.namespace="system" labels.".userserviceid"="studio"
Runtime resource.type="datafusion.googleapis.com/InstanceV2" labels.".serviceId"="runtime"
Wrangler service resource.type="datafusion.googleapis.com/InstanceV2" resource.labels.namespace="system" labels.".applicationId"="dataprep" labels.".userserviceid"="service"
Configurable logging in Cloud Data Fusion
Cloud Data Fusion 6.11.0 offers configurable logging, with Cloud Logging enabled by default. While disabling Cloud Logging is possible, it's strongly recommended to keep it enabled to ensure you have access to critical pipeline and instance logs.
To disable Cloud Logging, run the following command:
echo '{ "loggingConfig": {"instance_cloud_logging_disabled": "true"}}' | curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
--data @- \
"https://datafusion.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/instances?instanceId=$INSTANCE_ID?updateMask=logging_config"
Replace the following:
PROJECT_ID
: the Google Cloud project IDLOCATION
: the location of your instanceINSTANCE_ID
: the ID of your Cloud Data Fusion instance
What's next
- Learn more about Cloud Data Fusion audit logging.
- Learn how to view advanced pipeline logs.