Cloud Text-to-Speech V1 API - Module Google::Cloud::TextToSpeech::V1::CustomPronunciationParams::PhoneticEncoding (v1.9.0)

Reference documentation and code samples for the Cloud Text-to-Speech V1 API module Google::Cloud::TextToSpeech::V1::CustomPronunciationParams::PhoneticEncoding.

The phonetic encoding of the phrase.

Constants

PHONETIC_ENCODING_UNSPECIFIED

value: 0
Not specified.

PHONETIC_ENCODING_IPA

value: 1
IPA, such as apple -> ˈæpəl. https://en.wikipedia.org/wiki/International_Phonetic_Alphabet

PHONETIC_ENCODING_X_SAMPA

value: 2
X-SAMPA, such as apple -> "{p@l". https://en.wikipedia.org/wiki/X-SAMPA

PHONETIC_ENCODING_JAPANESE_YOMIGANA

value: 3
For reading-to-pron conversion to work well, the pronunciation field should only contain Kanji, Hiragana, and Katakana.

The pronunciation can also contain pitch accents. The start of a pitch phrase is specified with ^ and the down-pitch position is specified with !, for example:

phrase:端  pronunciation:^はし
phrase:箸  pronunciation:^は!し
phrase:橋  pronunciation:^はし!

We currently only support the Tokyo dialect, which allows at most one down-pitch per phrase (i.e. at most one ! between ^).

PHONETIC_ENCODING_PINYIN

value: 4
Used to specify pronunciations for Mandarin words. See https://en.wikipedia.org/wiki/Pinyin.

For example: 朝阳, the pronunciation is "chao2 yang2". The number represents the tone, and there is a space between syllables. Neutral tones are represented by 5, for example 孩子 "hai2 zi5".