Transcribe speech to text to create a written document, such as a
text-message, email or report.
Discussion
Multiple people in a conversation or discussion. For example in a
meeting with two or more people actively participating. Typically
all the primary people speaking would be in the same room (if not,
see PHONE_CALL)
PhoneCall
A phone-call or video-conference in which two or more people, who are
not in the same room, are actively participating.
Presentation
One or more persons lecturing or presenting to others, mostly
uninterrupted.
ProfessionallyProduced
Professionally produced audio (eg. TV Show, Podcast).
Unspecified
Use case is either unknown or is something other than one of the other
values below.
VoiceCommand
Transcribe voice commands, such as for controlling a device.
Voicemail
A recorded message intended for another person to listen to.
VoiceSearch
Transcribe spoken questions and queries into text.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-03-21 UTC."],[[["The content outlines different versions of the Google Cloud Speech V1 API, ranging from version 2.2.0 to the latest version, 3.8.0."],["The `InteractionType` enumeration defines categories for audio recognition use cases, such as `Dictation`, `Discussion`, `PhoneCall`, `Presentation`, and `ProfessionallyProduced`."],["The `InteractionType` enum also includes values like `Unspecified`, `VoiceCommand`, `Voicemail`, and `VoiceSearch` to cover a wide array of speech-to-text applications."],["Each version's documentation is accessible through a hyperlink, allowing users to navigate to specific version details."]]],[]]