Stay organized with collections
Save and categorize content based on your preferences.
You can convert invoices into structured data in Cloud Data Fusion
using the Invoice Parser plugin, which is powered by Document AI. The
structured data gets stored in BigQuery.
Before you begin
To parse invoices, you need a Cloud Data Fusion instance running in
version 6.4.1 or later. For more information, see
Upgrading Cloud Data Fusion instances.
Create a processor
In the Google Cloud console, go to the Document AI Processors
page.
Ensure that the desired instance has been upgraded to version 6.4.1 or
later. For earlier versions, upgrade the instance.
Click View instance.
The Cloud Data Fusion UI opens.
Click Hub.
Click GCP, and then deploy GCP Plugins.
Click DocAI, and then deploy the Doc AI Plugins.
Click the Invoice Parser Quickstart>Create.
Customize your pipeline by entering the Invoice Parser processor ID,
Cloud Storage bucket path, and BigQuery table details.
Deploy and run the pipeline.
Parsed invoices are stored in the output table in BigQuery.
Metadata from the invoices is stored in the Metadata table and includes
parsing status, Cloud Storage path, and upload timestamp of the raw
invoice. Records in the output and metadata tables can be joined with the
invoice_uuid key.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-25 UTC."],[[["The Invoice Parser plugin in Cloud Data Fusion, powered by Document AI, allows you to convert invoices into structured data."],["Parsed data from invoices is stored in BigQuery, with invoice metadata stored in a separate table for additional information."],["To use the Invoice Parser plugin, you must have a Cloud Data Fusion instance running version 6.4.1 or later and you need to have created an Invoice Parser processor in Document AI."],["The pipeline is customizable, allowing you to specify the Invoice Parser processor ID, Cloud Storage bucket path, and BigQuery table details during configuration."]]],[]]