Mantenha tudo organizado com as coleções
Salve e categorize o conteúdo com base nas suas preferências.
Transcrever uma voz em texto usando a gcloud CLI
Veja nesta página como enviar uma solicitação de reconhecimento de fala para
o Speech-to-Text usando a ferramenta gcloud a partir da
linha de comando.
O Speech-to-Text permite a fácil integração das tecnologias de reconhecimento
de fala do Google nos aplicativos do desenvolvedor. Você pode enviar
dados de áudio para a API Speech-to-Text, que em seguida retorna uma
transcrição de texto desse arquivo de áudio. Para mais informações sobre o serviço
, consulte
Conceitos básicos do Speech-to-Text.
Antes de começar
Antes de enviar uma solicitação para a API Speech-to-Text, é necessário concluir
as ações a seguir. Consulte a
página antes de começar para ver os detalhes.
Ative o Speech-to-Text em um projeto do GCP.
Verifique se o faturamento está ativado para o Speech-to-Text.
Instale a CLI do Google Cloud.
Após a instalação,
inicialize a CLI do Google Cloud executando o seguinte comando:
(Opcional) Criar um novo bucket do Google Cloud Storage para armazenar dados
de áudio.
Fazer uma solicitação de transcrição de áudio
Use o Speech-to-Text para transcrever um arquivo de áudio para
texto. Use o exemplo de código a seguir para enviar uma solicitação
recognize para a API Speech-to-Text.
Abra o shell da linha de comando e execute este comando.
[[["Fácil de entender","easyToUnderstand","thumb-up"],["Meu problema foi resolvido","solvedMyProblem","thumb-up"],["Outro","otherUp","thumb-up"]],[["Difícil de entender","hardToUnderstand","thumb-down"],["Informações incorretas ou exemplo de código","incorrectInformationOrSampleCode","thumb-down"],["Não contém as informações/amostras de que eu preciso","missingTheInformationSamplesINeed","thumb-down"],["Problema na tradução","translationIssue","thumb-down"],["Outro","otherDown","thumb-down"]],["Última atualização 2025-08-29 UTC."],[],[],null,["# Quickstart: Transcribe speech to text by using the gcloud CLI\n\nTranscribe speech to text by using the gcloud CLI\n=================================================\n\nThis page shows you how to send a speech recognition request to\nSpeech-to-Text using the\n[`gcloud`](/sdk/gcloud/reference/ml/speech) tool from the command\nline.\n\n\nSpeech-to-Text enables easy integration of Google speech\nrecognition technologies into developer applications. You can send\naudio data to the Speech-to-Text API, which then returns a\ntext transcription of that audio file. For more information about\nthe service, see\n[Speech-to-Text basics](/speech-to-text/docs/basics).\n\nBefore you begin\n----------------\n\nBefore you can send a request to the Speech-to-Text API, you must have completed\nthe following actions. See the\n[before you begin](/speech-to-text/docs/before-you-begin) page for details.\n\n- Enable Speech-to-Text on a GCP project.\n 1. Make sure billing is enabled for Speech-to-Text.\n-\n [Install](/sdk/docs/install) the Google Cloud CLI.\n\n After installation,\n [initialize](/sdk/docs/initializing) the Google Cloud CLI by running the following command:\n\n ```bash\n gcloud init\n ```\n\n\n If you're using an external identity provider (IdP), you must first\n [sign in to the gcloud CLI with your federated identity](/iam/docs/workforce-log-in-gcloud).\n- (Optional) Create a new Google Cloud Storage bucket to store your audio data.\n\nMake an audio transcription request\n-----------------------------------\n\nNow you can use Speech-to-Text to transcribe an audio file\nto text. Use the following code sample to send a\n[`recognize`](/speech-to-text/docs/reference/rest/v1/speech/recognize) request to the Speech-to-Text API.\n\nOpen the command line shell and run the following command. \n\n```bash\ngcloud ml speech recognize gs://cloud-samples-tests/speech/brooklyn.flac \\\n --language-code=en-US\n```\n\nThis command requests that Speech-to-Text transcribe\nthe audio contained in a [FLAC](/speech/docs/encoding) hosted at\na publicly accessible location.\n\nIf the request is successful, the server returns a response in\nJSON format: \n\n```text\n{\n \"results\": [\n {\n \"alternatives\": [\n {\n \"confidence\": 0.9840146,\n \"transcript\": \"how old is the Brooklyn Bridge\"\n }\n ]\n }\n ]\n}\n```\n\nCongratulations! You've sent your first request to Speech-to-Text.\n\n\nIf you receive an error or an empty response from\nSpeech-to-Text, take a look at the\n[troubleshooting](/speech-to-text/docs/troubleshooting) and\n[error mitigation](/speech-to-text/docs/error-messages) steps.\n\nClean up\n--------\n\n\nTo avoid incurring charges to your Google Cloud account for\nthe resources used on this page, follow these steps.\n\n- Use the [Google Cloud console](https://console.cloud.google.com/) to delete your project if you do not need it.\n\nWhat's next\n-----------\n\n\n- Practice [transcribing short audio files](/speech-to-text/docs/sync-recognize).\n- Learn how to [batch long audio files for speech recognition](/speech-to-text/docs/async-recognize).\n- Learn how to [transcribe streaming audio](/speech-to-text/docs/streaming-recognize) like from a microphone.\n- Get started with the Speech-to-Text in your language of choice by using a [Speech-to-Text client library](/speech-to-text/docs/reference/libraries).\n- Work through the [sample applications](/speech-to-text/docs/samples).\n- For best performance, accuracy, and other tips, see the [best practices](/speech-to-text/docs/best-practices) documentation.\n\n\u003cbr /\u003e"]]