本頁面由 Cloud Translation API 翻譯而成。

使用醫療模型辨識語音

除了其他標準和強化版語音辨識模型，Speech-to-Text 還提供兩種醫療模型。醫療模型專門用於辨識醫療環境中常見的字詞，例如診斷、藥物、症狀、治療和病況。如要辨識這類音訊資料，可以使用這些模型來提升轉錄結果的準確度。

我們提供兩種醫療模型，分別適用於特定用途：

medical_conversation：適用於醫療服務提供者 (例如醫生或護士) 與病患之間的對話。如果醫療服務提供者和病患都會說話，請使用這個模型。系統會自動偵測每位說話者說的字詞，並在傳回的轉錄稿中加上標籤。
medical_dictation：適用於單一醫療服務提供者口述的筆記，例如醫生口述病患的血液檢測結果。

請只搭配下列 Speech-to-Text 功能使用醫療模型。這份清單中未列出的功能無法與任一醫療模型搭配使用。自動標點符號功能預設為啟用。

醫療對話模型支援下列功能：

說話者分段標記

醫療聽寫模型支援下列功能：

口說標點符號
格式化指令
朗讀標題

傳送轉錄要求

REST

以下程式碼範例會使用 medical_conversation 模型，轉錄公開 Cloud Storage bucket 中的音訊檔案。

使用任何要求資料之前，請先替換以下項目：

LANGUAGE_CODE：音訊片段中使用的語言 BCP-47 代碼。醫療模型僅支援 en-US。
ENCODING：要轉錄的音訊編碼。如果您使用公開音訊樣本，編碼為 LINEAR16。
PROJECT_ID：專案的英數字元 ID。 Google Cloud

HTTP 方法和網址：

POST https://speech.googleapis.com/v1/speech:recognize

JSON 要求主體：

{
  "config": {
    "languageCode": "LANGUAGE_CODE",
    "encoding": "ENCODING",
    "model": "medical_conversation"
  },
  "audio": {
    "uri": "gs://cloud-samples-data/speech/medical_conversation_2.wav"
  }
}

如要傳送要求，請展開以下其中一個選項：

curl (Linux、macOS 或 Cloud Shell)

注意： 下列指令假設您已執行 gcloud init 或 gcloud auth login，透過使用者帳戶登入 gcloud CLI，或使用 Cloud Shell，自動登入 gcloud CLI。您可以執行 gcloud auth list 查看目前有效的帳戶。

將要求主體儲存在名為 request.json 的檔案中，然後執行下列指令：

curl -X POST \
     -H "Authorization: Bearer $(gcloud auth print-access-token)" \
     -H "x-goog-user-project: PROJECT_ID" \
     -H "Content-Type: application/json; charset=utf-8" \
     -d @request.json \
     "https://speech.googleapis.com/v1/speech:recognize"

PowerShell (Windows)

注意： 下列指令假設您已執行 gcloud init 或 gcloud auth login，透過使用者帳戶登入 gcloud CLI。您可以執行 gcloud auth list 查看目前有效的帳戶。

將要求主體儲存在名為 request.json 的檔案中，然後執行下列指令：

$cred = gcloud auth print-access-token
$headers = @{ "Authorization" = "Bearer $cred"; "x-goog-user-project" = "PROJECT_ID" }

Invoke-WebRequest `
    -Method POST `
    -Headers $headers `
    -ContentType: "application/json; charset=utf-8" `
    -InFile request.json `
    -Uri "https://speech.googleapis.com/v1/speech:recognize" | Select-Object -Expand Content

您應該會收到如下的 JSON 回應：

  "results": [
    {
      "alternatives": [
        {
          "transcript": "Um-hum . Yeah. Hello , good morning . Good
          morning . So , tell me what's going on . Uh , sure , so , um , I
          woke up probably three or four days ago , which , uh , wheezing and short of breath .
          Okay , any cough or chest pain ? I cough infrequently , but no ,
          uh , chest pain . Have you been exposed to anyone with covid ?
          Uh , no , and I also took a test , which was negative . Uh , is it getting
          worse , or better ? Uh , it has been getting a lot worse"
        }
      ]
    },
    {
      "alternatives": [
        {
          "transcript": "Okay . Was there something that triggered this exposure to cold , for
          example ? Um , I had a gone hiking , and I got caught in the rain the day
          before this all started ."
        }
      ]
    }
  ]
}

口說標點符號

醫療聽寫模型支援口說標點符號，方便你撰寫醫療筆記。這項功能預設為啟用，且無法停用。語音轉錄稿中的標點符號會以方括號標示。舉例來說，傳回的轉錄稿可能如下所示：

Patient could be showing signs of trauma [question mark] They said they were [quote] having elevated heart rate [unquote]。

Speech-to-Text 支援下列口說標點符號：

英文句號鍵
逗號鍵
冒號鍵
帽子
斜線鍵
破折號
連字號鍵
問號
分號鍵
引號
unquote
後引號
左括弧
右括弧
右括號

格式設定指令

醫療聽寫模型支援語音指令，可格式化附註。這項功能預設為啟用，且無法停用。語音轉錄稿會以方括號標示口語指令。舉例來說，傳回的轉錄稿可能如下所示：

[next line] Patient says they are experiencing fever [next point]。

語音轉文字功能支援下列語音指令：

下一個點
下一個號碼
下一段
帽子
大寫
換行符號
下一個項目
下一個問題
下一個問題編號
下一列
下一個區段
下一個號碼
scratch
scratch that
結束語音輸入

語音標題

醫療聽寫模型支援為聽寫的附註加上口語標題。這項功能預設為啟用，且無法停用。轉錄稿中的標題會以方括號標示，並以大寫字母開頭。舉例來說，傳回的轉錄稿可能如下所示：

[CURRENT MEDICATIONS] Patient is currently taking no medications。

語音轉文字功能支援下列口語標題：

主要申訴
目前服用的藥物
出院用藥
出院計畫
家族病史
發現項目
系統審查
現病史
適應症
LABS
過去手術史
身體檢查
系統審查
RADIOLOGY

使用醫療模型辨識語音 透過集合功能整理內容 你可以依據偏好儲存及分類內容。

傳送轉錄要求

REST

curl (Linux、macOS 或 Cloud Shell)

PowerShell (Windows)

口說標點符號

格式設定指令

語音標題

使用醫療模型辨識語音