Tetap teratur dengan koleksi
Simpan dan kategorikan konten berdasarkan preferensi Anda.
Mentranskripsikan speech to text dengan menggunakan gcloud CLI
Halaman ini menunjukkan cara mengirimkan permintaan pengenalan ucapan ke
Speech-to-Text menggunakan alat
gcloud dari command
line.
Speech-to-Text memudahkan integrasi teknologi pengenalan ucapan Google ke dalam aplikasi developer. Anda dapat mengirim
data audio ke Speech-to-Text API, yang kemudian menampilkan
transkripsi teks dari file audio tersebut. Untuk mengetahui informasi selengkapnya tentang
layanan ini, lihat
Dasar-dasar Speech-to-Text.
Sebelum memulai
Sebelum dapat mengirim permintaan ke Speech-to-Text API, Anda harus sudah menyelesaikan tindakan berikut. Lihat halaman sebelum memulai untuk mengetahui detailnya.
Aktifkan Speech-to-Text di project GCP.
Pastikan penagihan diaktifkan untuk Speech-to-Text.
Menginstal Google Cloud CLI.
Setelah penginstalan,
lakukan inisialisasi Google Cloud CLI dengan menjalankan perintah berikut:
(Opsional) Buat bucket Google Cloud Storage baru untuk menyimpan data audio Anda.
Membuat permintaan transkripsi audio
Kini Anda dapat menggunakan Speech-to-Text untuk mentranskripsikan file audio
ke teks. Gunakan contoh kode berikut untuk mengirim
permintaan recognize ke Speech-to-Text API.
Buka shell command line dan jalankan perintah berikut.
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-08-18 UTC."],[],[],null,["# Quickstart: Transcribe speech to text by using the gcloud CLI\n\nTranscribe speech to text by using the gcloud CLI\n=================================================\n\nThis page shows you how to send a speech recognition request to\nSpeech-to-Text using the\n[`gcloud`](/sdk/gcloud/reference/ml/speech) tool from the command\nline.\n\n\nSpeech-to-Text enables easy integration of Google speech\nrecognition technologies into developer applications. You can send\naudio data to the Speech-to-Text API, which then returns a\ntext transcription of that audio file. For more information about\nthe service, see\n[Speech-to-Text basics](/speech-to-text/docs/basics).\n\nBefore you begin\n----------------\n\nBefore you can send a request to the Speech-to-Text API, you must have completed\nthe following actions. See the\n[before you begin](/speech-to-text/docs/before-you-begin) page for details.\n\n- Enable Speech-to-Text on a GCP project.\n 1. Make sure billing is enabled for Speech-to-Text.\n-\n [Install](/sdk/docs/install) the Google Cloud CLI.\n\n After installation,\n [initialize](/sdk/docs/initializing) the Google Cloud CLI by running the following command:\n\n ```bash\n gcloud init\n ```\n\n\n If you're using an external identity provider (IdP), you must first\n [sign in to the gcloud CLI with your federated identity](/iam/docs/workforce-log-in-gcloud).\n- (Optional) Create a new Google Cloud Storage bucket to store your audio data.\n\nMake an audio transcription request\n-----------------------------------\n\nNow you can use Speech-to-Text to transcribe an audio file\nto text. Use the following code sample to send a\n[`recognize`](/speech-to-text/docs/reference/rest/v1/speech/recognize) request to the Speech-to-Text API.\n\nOpen the command line shell and run the following command. \n\n```bash\ngcloud ml speech recognize gs://cloud-samples-tests/speech/brooklyn.flac \\\n --language-code=en-US\n```\n\nThis command requests that Speech-to-Text transcribe\nthe audio contained in a [FLAC](/speech/docs/encoding) hosted at\na publicly accessible location.\n\nIf the request is successful, the server returns a response in\nJSON format: \n\n```text\n{\n \"results\": [\n {\n \"alternatives\": [\n {\n \"confidence\": 0.9840146,\n \"transcript\": \"how old is the Brooklyn Bridge\"\n }\n ]\n }\n ]\n}\n```\n\nCongratulations! You've sent your first request to Speech-to-Text.\n\n\nIf you receive an error or an empty response from\nSpeech-to-Text, take a look at the\n[troubleshooting](/speech-to-text/docs/troubleshooting) and\n[error mitigation](/speech-to-text/docs/error-messages) steps.\n\nClean up\n--------\n\n\nTo avoid incurring charges to your Google Cloud account for\nthe resources used on this page, follow these steps.\n\n- Use the [Google Cloud console](https://console.cloud.google.com/) to delete your project if you do not need it.\n\nWhat's next\n-----------\n\n\n- Practice [transcribing short audio files](/speech-to-text/docs/sync-recognize).\n- Learn how to [batch long audio files for speech recognition](/speech-to-text/docs/async-recognize).\n- Learn how to [transcribe streaming audio](/speech-to-text/docs/streaming-recognize) like from a microphone.\n- Get started with the Speech-to-Text in your language of choice by using a [Speech-to-Text client library](/speech-to-text/docs/reference/libraries).\n- Work through the [sample applications](/speech-to-text/docs/samples).\n- For best performance, accuracy, and other tips, see the [best practices](/speech-to-text/docs/best-practices) documentation.\n\n\u003cbr /\u003e"]]