Tetap teratur dengan koleksi
Simpan dan kategorikan konten berdasarkan preferensi Anda.
Mentranskripsi ucapan menjadi teks menggunakan API
Halaman ini menjelaskan cara mengirim permintaan pengenalan ucapan ke Speech-to-Text menggunakan antarmuka REST dan perintah curl.
Speech-to-Text memudahkan integrasi teknologi pengenalan ucapan Google ke dalam aplikasi developer. Anda dapat mengirim
data audio ke Speech-to-Text API, yang kemudian menampilkan
transkripsi teks dari file audio tersebut. Untuk mengetahui informasi selengkapnya tentang
layanan ini, lihat
Dasar-dasar Speech-to-Text.
Sebelum memulai
Sebelum dapat mengirim permintaan ke Speech-to-Text API, Anda harus sudah menyelesaikan tindakan berikut. Lihat halaman sebelum memulai untuk mengetahui detailnya.
Aktifkan Speech-to-Text di project GCP.
Pastikan penagihan diaktifkan untuk Speech-to-Text.
Menginstal Google Cloud CLI.
Setelah penginstalan,
lakukan inisialisasi Google Cloud CLI dengan menjalankan perintah berikut:
(Opsional) Buat bucket Google Cloud Storage baru untuk menyimpan data audio Anda.
Membuat permintaan transkripsi audio
Sekarang Anda dapat menggunakan Speech-to-Text untuk mentranskripsi file audio menjadi teks. Gunakan contoh kode berikut untuk mengirim permintaan REST recognize ke Speech-to-Text API.
Buat file permintaan JSON dengan teks berikut, lalu simpan sebagai file teks biasa sync-request.json:
Cuplikan JSON di atas menunjukkan bahwa file audio ini memiliki format encoding FLAC, frekuensi sampel 16.000 Hz, dan disimpan di Google Cloud Storage pada URI yang ditentukan. File audio ini dapat diakses secara publik, jadi Anda tidak memerlukan kredensial autentikasi untuk mengaksesnya.
Gunakan curl untuk membuat permintaan speech:recognize, dengan meneruskan nama file permintaan JSON yang Anda siapkan di langkah 1:
Contoh perintah curl ini menggunakan perintah gcloud auth print-access-token untuk mendapatkan token autentikasi.
Perhatikan bahwa untuk meneruskan nama file ke curl, Anda menggunakan opsi -d (untuk "data") dan mengawali nama file dengan tanda @. File ini harus berada di direktori yang sama dengan tempat Anda menjalankan perintah curl.
Anda akan melihat respons seperti berikut:
{"results":[{"alternatives":[{"transcript":"how old is the Brooklyn Bridge","confidence":0.98267895}]}]}
Selamat! Anda telah mengirimkan permintaan pertama ke Speech-to-Text.
Jika Anda menerima error atau respons kosong dari Speech-to-Text, lihat langkah-langkah pemecahan masalah dan mitigasi error.
Pembersihan
Agar akun Google Cloud Anda tidak dikenai biaya untuk
resource yang digunakan pada halaman ini, ikuti langkah-langkah berikut.
Gunakan Google Cloud console untuk menghapus project jika Anda tidak membutuhkannya.
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-08-18 UTC."],[],[],null,["# Quickstart: Transcribe speech to text by using the API\n\nTranscribe speech to text by using the API\n==========================================\n\nThis page shows you how to send a speech recognition request to\nSpeech-to-Text using the [REST interface](/speech-to-text/docs/reference/rest)\nand the `curl` command.\n\n\nSpeech-to-Text enables easy integration of Google speech\nrecognition technologies into developer applications. You can send\naudio data to the Speech-to-Text API, which then returns a\ntext transcription of that audio file. For more information about\nthe service, see\n[Speech-to-Text basics](/speech-to-text/docs/basics).\n\n\nBefore you begin\n----------------\n\nBefore you can send a request to the Speech-to-Text API, you must have completed\nthe following actions. See the\n[before you begin](/speech-to-text/docs/before-you-begin) page for details.\n\n- Enable Speech-to-Text on a GCP project.\n 1. Make sure billing is enabled for Speech-to-Text.\n-\n [Install](/sdk/docs/install) the Google Cloud CLI.\n\n After installation,\n [initialize](/sdk/docs/initializing) the Google Cloud CLI by running the following command:\n\n ```bash\n gcloud init\n ```\n\n\n If you're using an external identity provider (IdP), you must first\n [sign in to the gcloud CLI with your federated identity](/iam/docs/workforce-log-in-gcloud).\n- (Optional) Create a new Google Cloud Storage bucket to store your audio data.\n\nMake an audio transcription request\n-----------------------------------\n\nNow you can use Speech-to-Text to transcribe an audio file\nto text. Use the following code sample to send a\n[`recognize`](/speech-to-text/docs/reference/rest/v1/speech/recognize) REST request to the Speech-to-Text API.\n\n1. Create a JSON request file with the following text, and save it as a\n `sync-request.json` plain text file:\n\n ```json\n {\n \"config\": {\n \"encoding\":\"FLAC\",\n \"sampleRateHertz\": 16000,\n \"languageCode\": \"en-US\",\n \"enableWordTimeOffsets\": false\n },\n \"audio\": {\n \"uri\":\"gs://cloud-samples-tests/speech/brooklyn.flac\"\n }\n }\n \n ```\n\n This JSON snippet indicates that the audio file has a FLAC encoding format,\n a sample rate of 16000 Hz, and that the audio file is stored on Google Cloud\n Storage at the given URI. The audio file is publicly accessible, so you\n don't need authentication credentials to access the file.\n2. Use `curl` to make a `speech:recognize`\n request, passing it the filename of the JSON request you set up in step 1:\n\n The sample `curl` command uses the `gcloud auth print-access-token` command to get an authentication token. \n\n ```sh\n curl -s -H \"Content-Type: application/json\" \\\n -H \"Authorization: Bearer \"$(gcloud auth print-access-token) \\\n https://speech.googleapis.com/v1/speech:recognize \\\n -d @sync-request.json\n \n ```\n\n Note that to pass a filename to `curl` you use the\n `-d` option (for \"data\") and precede the filename with an\n `@` sign. This file should be in the same directory in which you\n execute the `curl` command.\n\n You should see a response similar to the following: \n\n ```json\n {\n \"results\": [\n {\n \"alternatives\": [\n {\n \"transcript\": \"how old is the Brooklyn Bridge\",\n \"confidence\": 0.98267895\n }\n ]\n }\n ]\n }\n \n ```\n\nCongratulations! You've sent your first request to Speech-to-Text.\n\n\nIf you receive an error or an empty response from\nSpeech-to-Text, take a look at the\n[troubleshooting](/speech-to-text/docs/troubleshooting) and\n[error mitigation](/speech-to-text/docs/error-messages) steps.\n\nClean up\n--------\n\n\nTo avoid incurring charges to your Google Cloud account for\nthe resources used on this page, follow these steps.\n\n- Use the [Google Cloud console](https://console.cloud.google.com/) to delete your project if you do not need it.\n\nWhat's next\n-----------\n\n\n- Practice [transcribing short audio files](/speech-to-text/docs/sync-recognize).\n- Learn how to [batch long audio files for speech recognition](/speech-to-text/docs/async-recognize).\n- Learn how to [transcribe streaming audio](/speech-to-text/docs/streaming-recognize) like from a microphone.\n- Get started with the Speech-to-Text in your language of choice by using a [Speech-to-Text client library](/speech-to-text/docs/reference/libraries).\n- Work through the [sample applications](/speech-to-text/docs/samples).\n- For best performance, accuracy, and other tips, see the [best practices](/speech-to-text/docs/best-practices) documentation.\n\n\u003cbr /\u003e"]]