Tetap teratur dengan koleksi
Simpan dan kategorikan konten berdasarkan preferensi Anda.
Speech-to-Text menyediakan model latest_short untuk mengenali ucapan yang
terdiri dari satu ucapan. Model ini berfungsi ketika pengguna
mengeluarkan satu perintah suara, bukan monolog atau dikte berdurasi panjang.
Saat pengenal dengan model latest_short digunakan untuk permintaan
pengenalan, Speech-to-Text akan berhenti melakukan pengenalan setelah mendeteksi
bahwa ucapan telah selesai. Speech-to-Text akan menampilkan respons peristiwa
aktivitas ucapan dengan jenis END_OF_SINGLE_UTTERANCE diikuti dengan
hasil transkripsi.
Ucapan tunggal dan StreamingRecognize
Jika model latest_shortRecognizer
dipilih untuk permintaan StreamingRecognize, Speech-to-Text akan menutup
streaming secara otomatis setelah ucapan berakhir.
Dengan peristiwa aktivitas suara
Jika peristiwa aktivitas suara juga telah diaktifkan untuk
permintaan StreamingRecognize, Speech-to-Text akan tetap menampilkan peristiwa
aktivitas suara mulai/akhir ucapan.
Waktu tunggu aktivitas suara untuk memulai ucapan akan tetap diterapkan. Waktu tunggu aktivitas suara untuk akhir ucapan tidak akan diterapkan karena streaming akan diakhiri segera setelah ucapan berakhir.
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-09-02 UTC."],[],[],null,["# Single utterance behavior\n\n| **Preview**\n|\n|\n| This product or feature is subject to the \"Pre-GA Offerings Terms\" in the General Service Terms section\n| of the [Service Specific Terms](/terms/service-terms#1).\n|\n| Pre-GA products and features are available \"as is\" and might have limited support.\n|\n| For more information, see the\n| [launch stage descriptions](/products#product-launch-stages).\n\nSpeech-to-Text provides the `latest_short` model for recognizing speech that\nconsists of single utterances. This may be useful for applications where users\nare issuing single voice commands as opposed to long-form monologue or\ndictation.\n\nWhen a recognizer with the `latest_short` model is used for a recognition\nrequest, Speech-to-Text will stop performing recognition once it detects\nan utterance has finished. Speech-to-Text will return a speech activity\nevent response with the type `END_OF_SINGLE_UTTERANCE` followed by the\ntranscription results.\n\nSingle utterance and StreamingRecognize\n---------------------------------------\n\nIn the case where a `latest_short` model [`Recognizer`](/speech-to-text/v2/docs/reference/rest/v2/projects.locations.recognizers#Recognizer)\nis selected for a `StreamingRecognize` request, Speech-to-Text will close\nthe stream automatically after the utterance has ended.\n\n### With voice activity events\n\nIn the case where voice activity events have also been enabled for a\nStreamingRecognize request, Speech-to-Text will still return speech\nbegin/end voice activity events.\nVoice activity timeouts for speech begin will still be applied. Voice activity\ntimeouts for speech end will not be applied, since the stream will be closed as\nsoon as the utterance ends."]]