Stay organized with collections
Save and categorize content based on your preferences.
Dialogflow voice agents use
Speech-to-Text
for speech recognition,
which is included in
Dialogflow pricing.
Dialogflow automatically selects a speech recognition model for you,
but you can optionally specify the model.
If a model is not explicitly specified,
then Dialogflow auto-selects a model based on
the audio configuration in API requests and agent settings.
If
enhanced speech model
is enabled for the agent
and an enhanced version of the specified model for the language does not exist,
then the speech is recognized using the standard version of the specified model.
The following models typically have the best performance:
telephony_short (best for telephony Dialogflow)
telephony (best for Agent Assist)
phone_call (good for Agent Assist and telephony Dialogflow)
latest_short (best for non-telephony Dialogflow)
command_and_search (best for languages where other models are not available)
Specify a model
You can supply the model when calling the
detectIntent or streamingDetectIntent
methods on the
Sessions
type;
or when configuring the
ConversationProfile
for
Agent Assist.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-04-02 UTC."],[[["Dialogflow voice agents utilize Speech-to-Text for speech recognition, which is factored into Dialogflow's pricing."],["While Dialogflow automatically chooses a speech recognition model, users have the option to select a specific model."],["The best-performing models typically include `telephony_short`, `telephony`, `phone_call`, `latest_short`, and `command_and_search`, depending on the use case."],["A speech model can be set during API calls or when configuring the ConversationProfile for Agent Assist, dictating which model is used for speech recognition."]]],[]]