Tetap teratur dengan koleksi
Simpan dan kategorikan konten berdasarkan preferensi Anda.
Halaman ini menjelaskan cara melihat silsilah data yang dihasilkan oleh
pipeline Cloud Data Fusion Anda dengan pergerakan data lainnya di Google Cloud,
untuk tujuan penemuan dan tata kelola. Anda dapat
melihat grafik garis keturunan untuk sumber data yang didukung di
halaman Katalog Universal Dataplex di konsol, atau menggunakan Data Lineage API
untuk mengambil data lengkap data lineage.
Plugin yang mendukung silsilah data Katalog Universal Dataplex
Cloud Data Fusion dan Dataplex Universal Catalog mendukung silsilah tingkat
aktiva untuk plugin berikut:
Amazon S3
BigQuery
Penampung BigQuery Multi Table (versi 6.9.1 dan yang lebih baru)
Spanner
Cloud Storage
Cloud SQL untuk MySQL
Cloud SQL untuk PostgreSQL
Katalog Universal Dataplex
FTP
Database Umum
HTTP
MSSQL/SQL Server
Beberapa sumber Tabel Database (versi 6.9.1 dan yang lebih baru)
Berikan peran Data Lineage Events Producer
(roles/datalineage.producer)
ke akun layanan yang dikelola Cloud Data Fusion, Cloud Data
Fusion API Service
Agent.
Prosesnya bervariasi jika instance Anda berjalan di Cloud Data Fusion versi sebelumnya dan RBAC diaktifkan.
6.10+ atau tidak ada RBAC
Jika instance Cloud Data Fusion Anda menggunakan versi 6.10.0 atau yang lebih baru, atau
instance Anda menggunakan versi sebelumnya dan RBAC tidak diaktifkan, ikuti langkah-langkah
berikut:
Centang kotak Include Google-provided role grants.
Pilih akun layanan Cloud Data Fusion API Service Agent, lalu klik
edit
Edit.
Klik Tambahkan peran lain, lalu pilih peran Produser Peristiwa Lineage Data.
Klik Simpan.
<6.10 with RBAC
Jika instance Cloud Data Fusion Anda menggunakan versi sebelum 6.10.0 dan RBAC diaktifkan, akun layanan tidak akan muncul dalam daftar akun utama di halaman IAM. Anda harus memasukkan nama akun layanan secara manual.
Untuk memberikan peran yang diperlukan, ikuti langkah-langkah berikut:
Di kolom New principals, masukkan akun layanan Agen Layanan Cloud Data Fusion API. Gunakan format berikut:
datafusion-system@TENANT_PROJECT_ID..
Ganti TENANT_PROJECT_ID dengan ID tenant untuk instance Anda. Untuk melihat ID project tenant, buka
halaman Instance, lalu klik nama instance untuk detail
instance.
Mengaktifkan silsilah data Katalog Universal Dataplex di Cloud Data Fusion
Untuk instance baru di Cloud Data Fusion, silsilah data Katalog Universal Dataplex dinonaktifkan secara default. Jika Anda membuat instance sebelum 27 Januari
2024 dengan versi 6.8.0 atau yang lebih baru, instance akan diaktifkan secara default setelah menyelesaikan
langkah-langkah di Sebelum memulai.
Mengaktifkan silsilah data Katalog Universal Dataplex saat Anda membuat instance
Konsol
Untuk mengaktifkan silsilah data Katalog Universal Dataplex saat Anda membuat instance,
ikuti langkah-langkah berikut:
Buka halaman Instance Cloud Data Fusion, lalu klik Create instance.
Saat Anda mengonfigurasi instance, luaskan bagian Advanced options dan klik Enable integration with Dataplex data lineage. Untuk mengetahui informasi selengkapnya tentang cara membuat instance, lihat Membuat instance publik.
REST API
Untuk mengaktifkan silsilah data Katalog Universal Dataplex saat Anda membuat instance,
tetapkan properti dataplex_data_lineage_integration_enabled opsional ke
true:
Untuk menonaktifkannya, tetapkan properti ke salah atau hapus properti, karena
lineage dinonaktifkan secara default saat Anda membuat instance baru.
Mengaktifkan atau menonaktifkan silsilah data Katalog Universal Dataplex di instance yang ada
Konsol
Untuk mengaktifkan atau menonaktifkan silsilah data Katalog Universal Dataplex di instance yang ada di
Cloud Data Fusion, ikuti langkah-langkah berikut:
Lihat detail instance:
Di konsol Google Cloud , buka halaman Cloud Data Fusion.
Klik Instance, lalu klik nama instance untuk membuka halaman
Instance details.
Di kolom Integrasi silsilah data Dataplex, klik Edit.
Aktifkan atau nonaktifkan silsilah data Katalog Universal Dataplex, lalu klik
Simpan.
REST API
Untuk mengaktifkan silsilah data Katalog Universal Dataplex di instance yang ada di
Cloud Data Fusion, tetapkan properti dataplex_data_lineage_integration_enabled
ke true dan sertakan nilai parameter updateMask:
Untuk menonaktifkan silsilah data Katalog Universal Dataplex di instance yang ada di
Cloud Data Fusion, tetapkan properti dataplex_data_lineage_integration_enabled
ke false dan sertakan nilai parameter updateMask:
Melihat silsilah di Katalog Universal Dataplex memiliki batasan berikut:
Silsilah di Katalog Universal Dataplex hanya dapat ditemukan jika ada
entitas BigQuery yang terhubung ke plugin yang didukung. Untuk mengetahui informasi selengkapnya tentang kapan grafik silsilah data tersedia, lihat Tentang silsilah data.
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-09-03 UTC."],[[["\u003cp\u003eThis page details how to view data lineage generated by Cloud Data Fusion pipelines, integrated with other data movement activities in Google Cloud, for enhanced discovery and governance.\u003c/p\u003e\n"],["\u003cp\u003eLineage graphs for supported data sources can be accessed through the Dataplex page in the console, or using the Data Lineage API for complete records.\u003c/p\u003e\n"],["\u003cp\u003eEnabling the Data Lineage API and granting the Data Lineage Events Producer role to the Cloud Data Fusion service account are essential prerequisites for viewing Cloud Data Fusion lineage graphs in Dataplex.\u003c/p\u003e\n"],["\u003cp\u003eDataplex data lineage is turned off by default for new Cloud Data Fusion instances, but can be enabled either during instance creation or for existing instances via the console or REST API.\u003c/p\u003e\n"],["\u003cp\u003eThere are certain limitations to viewing lineage in Dataplex, such as the necessity of a connected BigQuery entity, lack of support for customer-managed encryption keys, and region restrictions.\u003c/p\u003e\n"]]],[],null,["# View lineage in Dataplex Universal Catalog\n\nThis page describes how to view the data lineage generated by your\nCloud Data Fusion pipelines with other data movement on Google Cloud,\nfor discovery and governance purposes. You can\nview the lineage graphs for supported data sources on the\nDataplex Universal Catalog page in the console, or use the Data Lineage API\nto retrieve complete data lineage records. \n\nPlugins that support Dataplex Universal Catalog data lineage\n------------------------------------------------------------\n\nCloud Data Fusion and Dataplex Universal Catalog support asset-level\nlineage for the following plugins:\n\n- Amazon S3\n- BigQuery\n- BigQuery Multi Table sink (version 6.9.1 and later)\n- Spanner\n- Cloud Storage\n- Cloud SQL for MySQL\n- Cloud SQL for PostgreSQL\n- Dataplex Universal Catalog\n- FTP\n- Generic Database\n- HTTP\n- MSSQL/SQL Server\n- Multiple Database Tables source (version 6.9.1 and later)\n- MySQL\n- Oracle\n- PostgreSQL\n- SAP OData\n- SAP ODP\n- SAP Table\n\nFor more information, see\n[Cloud Data Fusion plugins](/data-fusion/plugins).\n\nBefore you begin\n----------------\n\nTo enable viewing Cloud Data Fusion lineage graphs on the\nDataplex Universal Catalog page in the console, do the following:\n\n1. [Create a data pipeline](/data-fusion/docs/create-data-pipeline)\n that uses only the [supported plugins](#supported_plugins).\n\n2. [Enable the Data Lineage API](/dataplex/docs/use-lineage) in the project that contains your\n Cloud Data Fusion instance.\n\n3. Grant the Data Lineage Events Producer role\n ([`roles/datalineage.producer`](/iam/docs/understanding-roles#datalineage.producer))\n to the Cloud Data Fusion-managed service account, the [Cloud Data\n Fusion API Service\n Agent](/data-fusion/docs/concepts/service-accounts#cloud-data-fusion-service-account).\n The process varies if your instance runs in an earlier version of\n Cloud Data Fusion and RBAC is enabled.\n\n ### 6.10+ or no RBAC\n\n\n If your Cloud Data Fusion instance uses version 6.10.0 or later, or\n your instance uses an earlier version and RBAC isn't enabled, follow these\n steps:\n 1. In the Google Cloud console, go to the **IAM** page.\n\n [Go to IAM](https://console.cloud.google.com/iam-admin/iam?supportedpurview=project)\n 2. Select the **Include Google-provided role grants** checkbox.\n\n 3. Select the Cloud Data Fusion API Service Agent service account and\n click\n edit\n **Edit**.\n\n 4. Click **Add another role** and select the **Data Lineage Events\n Producer** role.\n\n 5. Click **Save**.\n\n ### \\\u003c6.10 with RBAC\n\n If your Cloud Data Fusion instance uses a version earlier than\n 6.10.0 and RBAC is enabled, the service account doesn't appear in the\n list of principals on the IAM page. You must enter the\n service account name manually.\n\n To grant the required role, follow these steps:\n 1. In the Google Cloud console, go to the **IAM** page.\n\n [Go to IAM](https://console.cloud.google.com/iam-admin/iam?supportedpurview=project)\n 2. Click **Grant access**.\n\n 3. In the **New principals** field, enter the Cloud Data Fusion API\n Service Agent service account. Use the following format:\n `datafusion-system@`\u003cvar translate=\"no\"\u003eTENANT_PROJECT_ID\u003c/var\u003e`.iam.gserviceaccount.com`.\n\n Replace \u003cvar translate=\"no\"\u003eTENANT_PROJECT_ID\u003c/var\u003e with the\n tenant ID for your instance. To view the tenant project ID, go to\n the **Instances** page and click the instance name for instance\n details.\n\n [Go to Instances](https://console.cloud.google.com/data-fusion/locations/-/instances)\n 4. Select the **Data Lineage Events Producer** role.\n\n 5. Click **Save**.\n\nEnable Dataplex Universal Catalog data lineage in Cloud Data Fusion\n-------------------------------------------------------------------\n\nFor new instances in Cloud Data Fusion, Dataplex Universal Catalog data\nlineage is turned off by default. If you created the instance before January 27,\n2024 with version 6.8.0 or later, it's turned on by default after completing the\nsteps in [Before you begin](#before_you_begin).\n\n### Enable Dataplex Universal Catalog data lineage when you create an instance\n\n### Console\n\n\nTo enable Dataplex Universal Catalog data lineage when you create an instance,\nfollow these steps:\n\n1. Go to the Cloud Data Fusion **Instances** page and click **Create an\n instance**.\n\n [Create an instance](https://console.cloud.google.com/data-fusion/instance-create)\n2. When you configure the instance, expand the **Advanced options** section\n and click **Enable integration with Dataplex data lineage** . For more\n information about creating instances, see [Create a public\n instance](/data-fusion/docs/how-to/create-instance).\n\n### REST API\n\nTo enable Dataplex Universal Catalog data lineage when you create an instance,\nset the optional `dataplex_data_lineage_integration_enabled` property to\n`true`: \n\n echo '{ \"description\": \"CDAPinstance\", \"dataplex_data_lineage_integration_enabled\": \"true\"}' | curl -X POST \\\n -H \"Authorization: Bearer $(gcloud auth print-access-token)\" \\\n -H \"Content-Type: application/json\" \\\n --data @- \\\n \"https://datafusion.googleapis.com/v1/projects/\u003cvar translate=\"no\"\u003ePROJECT\u003c/var\u003e/locations/\u003cvar translate=\"no\"\u003eLOCATION\u003c/var\u003e/instances?instanceId=\u003cvar translate=\"no\"\u003eINSTANCE_NAME\u003c/var\u003e\"\n\nTo turn it off, either set the property to false or omit the property, as\nlineage is turned off by default when you create a new instance.\n\n### Enable or disable Dataplex Universal Catalog data lineage in an existing instance\n\n### Console\n\n\nTo enable or disable Dataplex Universal Catalog data lineage in an existing instance in\nCloud Data Fusion, follow these steps:\n\n1. View the instance details:\n 1. In the Google Cloud console, go to the Cloud Data Fusion page.\n\n 2. Click **Instances** , and then click the instance's name to go to the\n **Instance details** page.\n\n [Go to Instances](https://console.cloud.google.com/data-fusion/locations/-/instances)\n2. In the **Dataplex data lineage integration** field, click **Edit**.\n3. Enable or disable Dataplex Universal Catalog data lineage, and then click **Save**.\n\n### REST API\n\nTo enable Dataplex Universal Catalog data lineage in an existing instance in\nCloud Data Fusion, set the `dataplex_data_lineage_integration_enabled`\nproperty to `true` and include the `updateMask` parameter value: \n\n echo '{ \"description\": \"CDAPinstance\", \"dataplex_data_lineage_integration_enabled\": \"true\"}' | curl -X POST \\\n -H \"Authorization: Bearer $(gcloud auth print-access-token)\" \\\n -H \"Content-Type: application/json\" \\\n --data @- \\\n \"https://datafusion.googleapis.com/v1/projects/\u003cvar translate=\"no\"\u003ePROJECT\u003c/var\u003e/locations/\u003cvar translate=\"no\"\u003eLOCATION\u003c/var\u003e/instances?instanceId=\u003cvar translate=\"no\"\u003eINSTANCE_NAME\u003c/var\u003e?updateMask=dataplex_data_lineage_integration_enabled\"\n\nTo disable Dataplex Universal Catalog data lineage in an existing instance in\nCloud Data Fusion, set the `dataplex_data_lineage_integration_enabled`\nproperty to `false` and include the `updateMask` parameter value: \n\n echo '{ \"description\": \"CDAPinstance\", \"dataplex_data_lineage_integration_enabled\": \"false\"}' | curl -X POST \\\n -H \"Authorization: Bearer $(gcloud auth print-access-token)\" \\\n -H \"Content-Type: application/json\" \\\n --data @- \\\n \"https://datafusion.googleapis.com/v1/projects/\u003cvar translate=\"no\"\u003ePROJECT\u003c/var\u003e/locations/\u003cvar translate=\"no\"\u003eLOCATION\u003c/var\u003e/instances?instanceId=\u003cvar translate=\"no\"\u003eINSTANCE_NAME\u003c/var\u003e?updateMask=dataplex_data_lineage_integration_enabled\"\n\nView data lineage graphs\n------------------------\n\nTo view lineage graphs for entities across all Google Cloud services,\ndo the following:\n\n1. Go to your instance in Cloud Data Fusion and run a data pipeline\n that uses supported plugins.\n\n2. [View the lineage graphs on the Dataplex Universal Catalog page in the console](/dataplex/docs/use-lineage#view-lineage-graphs)\n and find the asset for which you want to view lineage information.\n\nLimitations\n-----------\n\nViewing lineage in Dataplex Universal Catalog has the following limitations:\n\n- The lineage in Dataplex Universal Catalog is only discoverable if there is a\n BigQuery entity connected to the supported plugins. For more\n information about when data lineage graphs are available, see\n [About data lineage](/dataplex/docs/about-data-lineage).\n\n- The Data Lineage API [doesn't support customer-managed encryption keys (CMEK)](/dataplex/docs/lineage-considerations#compliance).\n\n- Cloud Data Fusion doesn't support this feature in `me-central1` or\n `europe-west12` locations.\n\n- Review the\n [data lineage considerations](/dataplex/docs/lineage-considerations).\n\n| **Warning:** Dataplex Universal Catalog uses IP addresses to form a fully qualified name that uniquely identifies sources and sinks (such as a Database sink) to display lineage. If you must prevent sharing IP address or hostname information, don't enable Dataplex Universal Catalog data lineage integration.\n\nWhat's next\n-----------\n\n- Learn more about [data lineage](/dataplex/docs/about-data-lineage)."]]