Reference documentation and code samples for the Cloud Natural Language v2 API enum EncodingType.
Represents the text encoding that the caller uses to process the output.
Providing an EncodingType is recommended because the API provides the
beginning offsets for various outputs, such as tokens and mentions, and
languages that natively use different text encodings may access offsets
differently.
If EncodingType is not specified, encoding-dependent information (such as
begin_offset) will be set at -1.
Utf16
Encoding-dependent information (such as begin_offset) is calculated based
on the UTF-16 encoding of the input. Java and JavaScript are examples of
languages that use this encoding natively.
Utf32
Encoding-dependent information (such as begin_offset) is calculated based
on the UTF-32 encoding of the input. Python is an example of a language
that uses this encoding natively.
Utf8
Encoding-dependent information (such as begin_offset) is calculated based
on the UTF-8 encoding of the input. C++ and Go are examples of languages
that use this encoding natively.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-21 UTC."],[[["This documentation covers the `EncodingType` enum within the Google Cloud Natural Language v2 API, specifically versions 1.0.0-beta04 and 1.0.0-beta05."],["The `EncodingType` enum is used to specify the text encoding the caller uses, which is recommended because it impacts how the API calculates beginning offsets for elements like tokens and mentions."],["The `EncodingType` enum includes options for `None` (no encoding specified), `Utf16`, `Utf32`, and `Utf8`, each corresponding to different text encoding standards."],["Each encoding option (`Utf16`, `Utf32`, `Utf8`) represents different language encodings and determines how the API calculates `begin_offset` for elements."]]],[]]