Cloud Text-to-Speech V1 API - Module Google::Cloud::TextToSpeech::V1::AudioEncoding (v1.14.0)

Reference documentation and code samples for the Cloud Text-to-Speech V1 API module Google::Cloud::TextToSpeech::V1::AudioEncoding.

Configuration to set up audio encoder. The encoding determines the output audio format that we'd like.

Constants

AUDIO_ENCODING_UNSPECIFIED

value: 0
Not specified. Only used by GenerateVoiceCloningKey. Otherwise, will return result [google.rpc.Code.INVALID_ARGUMENT][google.rpc.Code.INVALID_ARGUMENT].

LINEAR16

value: 1
Uncompressed 16-bit signed little-endian samples (Linear PCM). Audio content returned as LINEAR16 also contains a WAV header.

MP3

value: 2
MP3 audio at 32kbps.

OGG_OPUS

value: 3
Opus encoded audio wrapped in an ogg container. The result is a file which can be played natively on Android, and in browsers (at least Chrome and Firefox). The quality of the encoding is considerably higher than MP3 while using approximately the same bitrate.

MULAW

value: 5
8-bit samples that compand 14-bit audio samples using G.711 PCMU/mu-law. Audio content returned as MULAW also contains a WAV header.

ALAW

value: 6
8-bit samples that compand 14-bit audio samples using G.711 PCMU/A-law. Audio content returned as ALAW also contains a WAV header.

PCM

value: 7
Uncompressed 16-bit signed little-endian samples (Linear PCM). Note that as opposed to LINEAR16, audio won't be wrapped in a WAV (or any other) header.

M4A

value: 8
M4A audio.

Cloud Text-to-Speech V1 API - Module Google::Cloud::TextToSpeech::V1::AudioEncoding (v1.14.0) Stay organized with collections Save and categorize content based on your preferences.