SpeechRecognizer
A brief introduction to SpeechRecognizer.
Property | Value |
---|---|
Google Cloud Service Name | Speech-to-Text |
Google Cloud Service Documentation | /speech-to-text/docs/ |
Google Cloud REST Resource Name | v2.projects.locations.recognizers |
Google Cloud REST Resource Documentation | /speech-to-text/docs/reference/rest/v2/projects.locations.recognizers |
Config Connector Resource Short Names | gcpspeechrecognizer gcpspeechrecognizers speechrecognizer |
Config Connector Service Name | speech.googleapis.com |
Config Connector Resource Fully Qualified Name | speechrecognizers.speech.cnrm.cloud.google.com |
Can Be Referenced by IAMPolicy/IAMPolicyMember | No |
Config Connector Default Average Reconcile Interval In Seconds | 600 |
Custom Resource Definition Properties
Spec
Schema
annotations:
string: string
defaultRecognitionConfig:
languageCodes:
- string
model: string
displayName: string
location: string
projectRef:
external: string
kind: string
name: string
namespace: string
resourceID: string
Fields | |
---|---|
Optional |
Allows users to store small amounts of arbitrary data. Both the key and the value must be 63 characters or less each. At most 100 annotations. |
Optional |
Default configuration to use for requests with this Recognizer. This can be overwritten by inline configuration in the [RecognizeRequest.config][google.cloud.speech.v2.RecognizeRequest.config] field. |
Optional |
Optional. The language of the supplied audio as a [BCP-47](https://www.rfc-editor.org/rfc/bcp/bcp47.txt) language tag. Language tags are normalized to BCP-47 before they are used eg "en-us" becomes "en-US". Supported languages for each model are listed in the [Table of Supported Models](https://cloud.google.com/speech-to-text/v2/docs/speech-to-text-supported-languages). If additional languages are provided, recognition result will contain recognition in the most likely language detected. The recognition result will include the language tag of the language detected in the audio. |
Optional |
|
Optional |
Optional. Which model to use for recognition requests. Select the model best suited to your domain to get best results. Guidance for choosing which model to use can be found in the [Transcription Models Documentation](https://cloud.google.com/speech-to-text/v2/docs/transcription-model) and the models supported in each region can be found in the [Table Of Supported Models](https://cloud.google.com/speech-to-text/v2/docs/speech-to-text-supported-languages). |
Optional |
User-settable, human-readable name for the Recognizer. Must be 63 characters or less. |
Required |
Immutable. |
Required |
The Project that this resource belongs to. |
Optional |
The `projectID` field of a project, when not managed by Config Connector. |
Optional |
The kind of the Project resource; optional but must be `Project` if provided. |
Optional |
The `name` field of a `Project` resource. |
Optional |
The `namespace` field of a `Project` resource. |
Optional |
The SpeechRecognizer name. If not given, the metadata.name will be used. |
Status
Schema
conditions:
- lastTransitionTime: string
message: string
reason: string
status: string
type: string
externalRef: string
observedGeneration: integer
observedState:
createTime: string
defaultRecognitionConfig:
adaptation:
customClasses:
- createTime: string
deleteTime: string
etag: string
expireTime: string
kmsKeyName: string
kmsKeyVersionName: string
name: string
reconciling: boolean
state: string
uid: string
updateTime: string
phraseSets:
- inlinePhraseSet:
createTime: string
deleteTime: string
etag: string
expireTime: string
kmsKeyName: string
kmsKeyVersionName: string
name: string
reconciling: boolean
state: string
uid: string
updateTime: string
deleteTime: string
etag: string
expireTime: string
kmsKeyName: string
kmsKeyVersionName: string
reconciling: boolean
state: string
uid: string
updateTime: string
Fields | |
---|---|
conditions |
Conditions represent the latest available observations of the object's current state. |
conditions[] |
|
conditions[].lastTransitionTime |
Last time the condition transitioned from one status to another. |
conditions[].message |
Human-readable message indicating details about last transition. |
conditions[].reason |
Unique, one-word, CamelCase reason for the condition's last transition. |
conditions[].status |
Status is the status of the condition. Can be True, False, Unknown. |
conditions[].type |
Type is the type of the condition. |
externalRef |
A unique specifier for the SpeechRecognizer resource in GCP. |
observedGeneration |
ObservedGeneration is the generation of the resource that was most recently observed by the Config Connector controller. If this is equal to metadata.generation, then that means that the current reported status reflects the most recent desired state of the resource. |
observedState |
ObservedState is the state of the resource as most recently observed in GCP. |
observedState.createTime |
Output only. Creation time. |
observedState.defaultRecognitionConfig |
Default configuration to use for requests with this Recognizer. This can be overwritten by inline configuration in the [RecognizeRequest.config][google.cloud.speech.v2.RecognizeRequest.config] field. |
observedState.defaultRecognitionConfig.adaptation |
Speech adaptation context that weights recognizer predictions for specific words and phrases. |
observedState.defaultRecognitionConfig.adaptation.customClasses |
A list of inline CustomClasses. Existing CustomClass resources can be referenced directly in a PhraseSet. |
observedState.defaultRecognitionConfig.adaptation.customClasses[] |
|
observedState.defaultRecognitionConfig.adaptation.customClasses[].createTime |
Output only. Creation time. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].deleteTime |
Output only. The time at which this resource was requested for deletion. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].etag |
Output only. This checksum is computed by the server based on the value of other fields. This may be sent on update, undelete, and delete requests to ensure the client has an up-to-date value before proceeding. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].expireTime |
Output only. The time at which this resource will be purged. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].kmsKeyName |
Output only. The [KMS key name](https://cloud.google.com/kms/docs/resource-hierarchy#keys) with which the CustomClass is encrypted. The expected format is `projects/{project}/locations/{location}/keyRings/{key_ring}/cryptoKeys/{crypto_key}`. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].kmsKeyVersionName |
Output only. The [KMS key version name](https://cloud.google.com/kms/docs/resource-hierarchy#key_versions) with which the CustomClass is encrypted. The expected format is `projects/{project}/locations/{location}/keyRings/{key_ring}/cryptoKeys/{crypto_key}/cryptoKeyVersions/{crypto_key_version}`. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].name |
Output only. Identifier. The resource name of the CustomClass. Format: `projects/{project}/locations/{location}/customClasses/{custom_class}`. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].reconciling |
Output only. Whether or not this CustomClass is in the process of being updated. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].state |
Output only. The CustomClass lifecycle state. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].uid |
Output only. System-assigned unique identifier for the CustomClass. |
observedState.defaultRecognitionConfig.adaptation.customClasses[].updateTime |
Output only. The most recent time this resource was modified. |
observedState.defaultRecognitionConfig.adaptation.phraseSets |
A list of inline or referenced PhraseSets. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[] |
|
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet |
An inline defined PhraseSet. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.createTime |
Output only. Creation time. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.deleteTime |
Output only. The time at which this resource was requested for deletion. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.etag |
Output only. This checksum is computed by the server based on the value of other fields. This may be sent on update, undelete, and delete requests to ensure the client has an up-to-date value before proceeding. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.expireTime |
Output only. The time at which this resource will be purged. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.kmsKeyName |
Output only. The [KMS key name](https://cloud.google.com/kms/docs/resource-hierarchy#keys) with which the PhraseSet is encrypted. The expected format is `projects/{project}/locations/{location}/keyRings/{key_ring}/cryptoKeys/{crypto_key}`. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.kmsKeyVersionName |
Output only. The [KMS key version name](https://cloud.google.com/kms/docs/resource-hierarchy#key_versions) with which the PhraseSet is encrypted. The expected format is `projects/{project}/locations/{location}/keyRings/{key_ring}/cryptoKeys/{crypto_key}/cryptoKeyVersions/{crypto_key_version}`. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.name |
Output only. Identifier. The resource name of the PhraseSet. Format: `projects/{project}/locations/{location}/phraseSets/{phrase_set}`. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.reconciling |
Output only. Whether or not this PhraseSet is in the process of being updated. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.state |
Output only. The PhraseSet lifecycle state. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.uid |
Output only. System-assigned unique identifier for the PhraseSet. |
observedState.defaultRecognitionConfig.adaptation.phraseSets[].inlinePhraseSet.updateTime |
Output only. The most recent time this resource was modified. |
observedState.deleteTime |
Output only. The time at which this Recognizer was requested for deletion. |
observedState.etag |
Output only. This checksum is computed by the server based on the value of other fields. This may be sent on update, undelete, and delete requests to ensure the client has an up-to-date value before proceeding. |
observedState.expireTime |
Output only. The time at which this Recognizer will be purged. |
observedState.kmsKeyName |
Output only. The [KMS key name](https://cloud.google.com/kms/docs/resource-hierarchy#keys) with which the Recognizer is encrypted. The expected format is `projects/{project}/locations/{location}/keyRings/{key_ring}/cryptoKeys/{crypto_key}`. |
observedState.kmsKeyVersionName |
Output only. The [KMS key version name](https://cloud.google.com/kms/docs/resource-hierarchy#key_versions) with which the Recognizer is encrypted. The expected format is `projects/{project}/locations/{location}/keyRings/{key_ring}/cryptoKeys/{crypto_key}/cryptoKeyVersions/{crypto_key_version}`. |
observedState.reconciling |
Output only. Whether or not this Recognizer is in the process of being updated. |
observedState.state |
Output only. The Recognizer lifecycle state. |
observedState.uid |
Output only. System-assigned unique identifier for the Recognizer. |
observedState.updateTime |
Output only. The most recent time this Recognizer was modified. |
Sample YAML(s)
Typical Use Case
# Copyright 2025 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
apiVersion: speech.cnrm.cloud.google.com/v1beta1
kind: SpeechRecognizer
metadata:
name: speechrecognizer-sample
spec:
projectRef:
external: "projects/${PROJECT_ID?}"
location: global
displayName: "Sample Speech Recognizer"
defaultRecognitionConfig:
model: long
languageCodes:
- en-US