Stay organized with collections
Save and categorize content based on your preferences.
Overview
Cloud Speech-to-Text On-Prem enables easy integration of Google speech recognition
technologies into your on-premises solution. The Speech-To-Text (STT) On-Prem solution gives you
full control over your infrastructure and protected speech data in order to
meet data residency and compliance requirements. This best-in-class machine
learning technology gives you access to the next-generation speech recognition
models that are more accurate, smaller in size, and require fewer computing
resources to run than existing solutions.
Speech-to-Text On-Prem is a
Google Cloud Marketplace application
and can be deployed as a container to any GKE cluster. This gives you
flexibility and greater control in deployment, whether you decide to deploy on
Google Cloud with GKE or on-premises with Anthos. This lets you to take
advantage of the simplicity, agility, and cost-effectiveness of Google's
container hosting and management across hybrid environments.
Key capabilities
High quality transcription
Apply Google's advanced deep learning neural network algorithms to automatic speech recognition.
Deployable anywhere
Run in any GKE or Anthos cluster.
Efficient models
Deploy efficiently with models that are less than 1 GB in size and consume minimal resources.
API compatible
Full compatibility with the Speech-to-Text API and its client libraries.
Istio service mesh
Use our pre-built Istio objects to seamlessly scale up to thousands of connections.
Stackdriver integration
Export metadata logs to one centralized location.
Supported languages
Support your global user base with language supports in English, French, German, Spanish, Portuguese, Cantonese, and Japanese.
Reference architecture
Deployment and installation
See the Speech-to-Text On-Prem
pricing page for an outline
of how cost is calculated.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-29 UTC."],[],[],null,["# Cloud Speech-to-Text On-Prem\n\n| **Private feature** \n| This product is a private feature. The documentation is publicly available but you must [contact Google](https://cloud.google.com/contact) for full access.\n\nOverview\n--------\n\nCloud Speech-to-Text On-Prem enables easy integration of Google speech recognition\ntechnologies into your on-premises solution. The Speech-To-Text (STT) On-Prem solution gives you\nfull control over your infrastructure and protected speech data in order to\nmeet data residency and compliance requirements. This best-in-class machine\nlearning technology gives you access to the next-generation speech recognition\nmodels that are more accurate, smaller in size, and require fewer computing\nresources to run than existing solutions.\n\nSpeech-to-Text On-Prem is a\n[Google Cloud Marketplace](https://cloud.google.com/marketplace/) application\nand can be deployed as a container to any GKE cluster. This gives you\nflexibility and greater control in deployment, whether you decide to deploy on\nGoogle Cloud with GKE or on-premises with Anthos. This lets you to take\nadvantage of the simplicity, agility, and cost-effectiveness of Google's\ncontainer hosting and management across hybrid environments.\n\nReference architecture\n----------------------\n\nDeployment and installation\n---------------------------\n\n1. See the Speech-to-Text On-Prem [pricing page](/speech-to-text/priv/pricing) for an outline of how cost is calculated.\n2. [Contact your seller](https://cloud.google.com/contact) to get access to the solution.\n3. Deploy the application to your cluster.\n4. Configure your chosen client library to access your deployment.\n5. Start transcribing your audio files."]]