Stay organized with collections
Save and categorize content based on your preferences.
Overview
Speech-to-Text On Device enables speech technology on
embedded devices. This feature allows you to run streaming speech recognition
fully on device, without any connection to a network or Google servers. The
on-device solution offers several benefits for this use case when compared to a
server-side solution: Speech recognition is available even if the device isn't
connected to the network or network connection is limited, and the user's data
doesn't leave the device.
Key capabilities
High quality transcription
Apply Google's algorithms to automatic speech recognition.
Offline
Speech Recognition without internet connection.
Low Latency
Speech Recognition runs fast locally on device.
Efficient models
Deploy efficiently with models that are less than 1 GB in size and consume minimal resources.
Voice Activity Detection
Detects start and end of human speech.
Confidence
Get confidence estimates on the transcription.
Model adaptation
Boost the transcription accuracy of rare and domain-specific words or phrases.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-29 UTC."],[],[],null,["# Cloud Speech-to-Text On Device\n\n| **Private feature** \n| This product is a private feature. The documentation is publicly available but you must [contact Google](https://cloud.google.com/contact) for full access.\n\nOverview\n--------\n\n**Speech-to-Text On Device** enables speech technology on\nembedded devices. This feature allows you to run streaming speech recognition\nfully on device, without any connection to a network or Google servers. The\non-device solution offers several benefits for this use case when compared to a\nserver-side solution: Speech recognition is available even if the device isn't\nconnected to the network or network connection is limited, and the user's data\ndoesn't leave the device.\n\nWhat's next\n-----------\n\n[Contact Google Sales](https://cloud.google.com/contact) for access to [source](https://libgspeech.googlesource.com/), runtime and models. Then proceed to [get started](/speech-to-text/ondevice/docs/get_started)"]]