您创建并拥有此项目。默认情况下,Cloud Data Fusion 会在此项目中创建临时 Dataproc 集群,以运行您的流水线。
下图展示了在租户项目中运行的 Cloud Data Fusion 实例和客户项目中的 Dataproc 集群上运行的流水线。
Cloud Data Fusion 中的服务账号
服务账号可为 Cloud Data Fusion 提供身份,该身份可让 Cloud Data Fusion 访问您的资源。
启用 Cloud Data Fusion API 并创建 Cloud Data Fusion 实例后,系统会向您的项目添加一个服务账号,以便访问 Service Networking、Dataproc、Cloud Storage、BigQuery、Spanner 和 Bigtable 等资源。此服务账号称为 Cloud Data Fusion API Service Agent。系统会自动向此服务代理授予角色。
服务代理(称为 Cloud Data Fusion API Service Agent),Cloud Data Fusion 会创建该代理以获得对客户资源的访问权限,以便其代表客户执行操作。它在租户项目中用于访问客户项目资源。例如,预览版在内存(而不是 Dataproc 集群)中运行。
默认情况下,分配给 Cloud Data Fusion 服务账号的 Cloud Data Fusion API Service Agent (roles/datafusion.serviceAgent) Identity and Access Management 角色包含额外的权限,以确保提供最佳用户体验。为增强安全性,您可以创建一个自定义角色,为其分配一组任务所需的最低权限,然后将其分配给 Cloud Data Fusion 服务账号。
[[["易于理解","easyToUnderstand","thumb-up"],["解决了我的问题","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["很难理解","hardToUnderstand","thumb-down"],["信息或示例代码不正确","incorrectInformationOrSampleCode","thumb-down"],["没有我需要的信息/示例","missingTheInformationSamplesINeed","thumb-down"],["翻译问题","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],["最后更新时间 (UTC):2025-04-02。"],[[["Cloud Data Fusion uses service accounts to access resources in both tenant and customer projects, enabling it to manage pipelines on the user's behalf."],["The Cloud Data Fusion API Service Agent is a service account created automatically when enabling the Cloud Data Fusion API, granting it access to resources like Service Networking, Dataproc, Cloud Storage, and others."],["A default Compute Engine service account is also created to deploy jobs that access other Google Cloud resources, which can attach to a Dataproc cluster VM to enable Cloud Data Fusion to access Dataproc resources during pipeline runs."],["In Cloud Data Fusion Enterprise edition, pipelines can run from a user-managed service account by creating a profile in the Cloud Data Fusion console, enhancing control and customization."],["Customer project is owned by the customer and is the location where the ephemeral Dataproc cluster is located in order to run the user's pipelines."]]],[]]