Stay organized with collections
Save and categorize content based on your preferences.
Normalization
For many specific supported fields, Document AI also returns an
entity.normalizedValue
in addition to the raw extracted field obtained through the textAnchor of each
entity. It normalize the literal text. Normalization often breaks the text value
up into sub-fields.
This contain the data in a standardized format to reduce post processing, and
enable conversion to whatever format is selected. The mentionText, representing
what is literally on the document, is never changed by normalization.
Normalized fields belong to one of the following categories.
Normalized values in the console
In the Google Cloud console, the normalized fields are annotated with G. For example:
Sample normalized field shown in the web application.
Supported processors
Here are the processors and fields that support entity enrichment and normalization:
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-25 UTC."],[[["Document AI provides `entity.normalizedValue` for supported fields, standardizing extracted data for easier post-processing and format conversion."],["Normalization breaks down text values into sub-fields while preserving the original `mentionText` found on the document."],["Several pretrained processors, including Bank Statement, US Passport, Utility, Identity Document Proofing, Pay Slip, US Driver License, Expense, and Invoice Parsers, support entity normalization."],["Custom Extractor supports normalization for common data types such as `dateTime`, `currency`, `money`, and `number`."],["Normalized fields are labeled with a **G** in the Google Cloud console for easy identification."]]],[]]