此页面由 Cloud Translation API 翻译。

使用 RAG 生成有依据的回答

在 AI 应用中体验检索增强生成 (RAG) 功能时，您可以根据以下依据来源生成基于事实的提示回答：

Google 搜索：如果您想将模型与世界知识、各种主题或互联网上的最新信息相关联，请使用“依托 Google 搜索进行接地”功能。使用 Google 搜索建立依据支持动态检索，让您可以选择仅在必要时使用 Google 搜索生成接地结果。因此，动态检索配置会评估提示是否需要了解近期活动，并启用“依托 Google 搜索进行接地”功能。如需了解详情，请参阅动态检索。
重要提示：如果您在回答中收到 Google 搜索建议，则该回答属于“接地结果”，受服务条款部分中的“依托 Google 搜索进行接地”条款的约束。如需使用 Google 搜索建议，请参阅使用 Google 搜索建议。
内嵌文本：使用内嵌文本进行接地，以将答案基于请求中提供的称为事实文本的文本片段。事实文本是指用户提供的陈述，在给定请求中被认为是事实。模型不会检查事实文本的真实性。
Vertex AI Search 数据存储区：如果您想将模型连接到 Vertex AI Search 数据存储区中的企业文档，请使用依托 Vertex AI Search 进行接地。

本页介绍了如何使用以下方法，根据这些建立依据来源生成有依据的回答：

单轮回答生成
- 内嵌文本和 Vertex AI Search 数据存储区
- Google 搜索
多轮回答生成

此外，您还可以选择以流式传输方式获取模型回答。通过流式传输生成接地回答是一项实验性功能。

您可以使用其他方法生成有依据的回答，以满足您的应用需求。如需了解详情，请参阅用于打造搜索和 RAG 体验的 Vertex AI API。

术语

在使用基于事实的回答生成方法之前，最好先了解输入和输出、如何构建请求以及与 RAG 相关的术语。

RAG 术语

RAG 是一种方法，可让大语言模型 (LLM) 生成基于您所选数据源的回答。RAG 分为两个阶段：

检索：快速获取最相关的事实可能是一个常见的搜索问题。借助 RAG，您可以快速检索生成回答所需的重要事实。
生成：LLM 使用检索到的事实生成有依据的回答。

因此，有依据的回答生成方法会从依据来源检索事实，并生成有依据的回答。

输入数据

基于事实的回答生成方法需要在请求中提供以下输入内容：

角色：给定文本的发送者，可以是用户 (user) 或模型 (model)。
文本：当角色为 user 时，文本是提示；当角色为 model 时，文本是接地回答。您在请求中指定角色和文本的方式取决于以下因素：
- 对于单轮回答生成，用户在请求中发送提示文本，模型在回答中发送回答文本。
- 对于多轮回答生成，请求包含所有先前轮次的提示-回答对以及用户在当前轮次中的提示文本。因此，在此类请求中，提示文本的角色为 user，回答文本的角色为 model。
系统指令：提示的序言，用于控制模型的行为并相应地修改输出。例如，您可以为生成的回答添加角色，也可以指示模型以某种方式设置输出文本的格式。对于多轮回答生成，您必须在每一轮中提供系统指令。如需了解详情，请参阅使用系统指令。
依据来源：回答的依据来源，可以是以下一项或多项：
- Google 搜索：依托 Google 搜索结果为回答接地。当接地源为 Google 搜索时，您可以指定动态检索配置，并设置动态检索阈值。如需了解详情，请参阅动态检索。
  重要提示：如果您在回答中收到 Google 搜索建议，则该回答属于“接地结果”，受服务条款部分中的“依托 Google 搜索进行接地”条款的约束。如需使用 Google 搜索建议，请参阅使用 Google 搜索建议。
- 内嵌文本：根据请求中提供的事实文本来确定答案。事实文本是指用户提供的陈述，在给定请求中被认为是事实。模型不会检查事实文本的真实性。您可以在每个内嵌文本来源中提供最多 100 个事实文本。可以使用元属性（例如标题、作者和 URI）来支持事实文本。在引用支持答案的块时，系统会在响应中返回这些元属性。
- Vertex AI Search 数据存储区：以 Vertex AI Search 数据存储区中的文档为回答提供依据。您无法将网站搜索数据存储区指定为依据来源。
在给定的请求中，您可以同时提供内嵌文本源和 Vertex AI Search 数据存储区源。您无法将 Google 搜索与上述任一来源结合使用。因此，如果您想根据 Google 搜索结果来生成答案，则必须发送单独的请求，并将 Google 搜索指定为唯一的依据来源。

您可以按任意顺序提供最多 10 个依据来源。例如，假设您按以下顺序提供以下数量的基础来源，以获得总共 10 个基础来源：
- 三个内嵌文本来源，每个来源最多可包含 100 个事实文本
- 六个 Vertex AI Search 数据存储区
- 一个内嵌文本来源，最多包含 100 个事实文本
系统会按照来源在请求中的指定顺序为每个来源分配一个索引。例如，如果您在请求中指定了多种来源，则系统会按以下表格所示分配来源索引：

依据来源索引

内嵌文本 #1 0

内嵌文本 #2 1

Vertex AI Search 数据存储区 1 2

内嵌文本 #3 3

Vertex AI Search 数据存储区 2 4

此索引会在响应中引用，有助于追溯来源。
生成规范：模型配置的规范，包含以下信息：
- 模型 ID：指定用于生成回答的 Vertex AI Gemini 模型。如需查看可用于生成有依据的回答的模型列表，请参阅支持的模型。
- 模型参数：指定您可以为所选模型设置的参数。这些参数包括：语言、温度、Top-P 和 Top-K。如需详细了解这些参数，请参阅 Gemini 模型参数。
语言代码：生成的答案的语言通常设置为与提示的语言一致。如果提示中没有单一语言（例如，如果提示非常简短，并且可以用多种语言表达），则语言代码字段会决定回答的语言。

如需查看语言代码列表，请参阅语言。
纬度和经度：指定用户的纬度和经度。如果查询包含特定于位置的问题，例如“查找我附近的咖啡店”，则使用这些字段。如果无法确定查询语言，并且未设置语言代码，则系统会使用纬度和经度来确定回答的语言。

依据来源	索引
内嵌文本 #1	0
内嵌文本 #2	1
Vertex AI Search 数据存储区 1	2
内嵌文本 #3	3
Vertex AI Search 数据存储区 2	4

输出数据

模型生成的回答称为候选回答，其中包含以下数据。输出中可能不会显示所有字段。

角色：接地回答的发送者。回答始终包含接地回答文本。因此，回答中的角色始终是模型。
文本：接地的回答。
依据得分：一个介于 [0, 1] 范围内的浮点值，用于指示答案在给定来源中的依据程度。
接地元数据：有关接地来源的元数据。接地元数据包含以下信息：
- 支持块：支持回答的块列表。每个支持块都会分配一个支持块索引，该索引在跟踪来源时非常有用。每个支持块包含以下内容：
  - 块文本：从答案或部分答案（称为声明文本）的来源中逐字引用的一部分文本。此字段可能不会始终出现在响应中。
  - 来源：请求中分配给来源的索引。
  - 来源元数据：有关块的元数据。根据来源的不同，来源元数据可以是以下任一类型：
    - 对于内嵌来源，元数据可以是请求中指定的其他详细信息，例如标题、作者或 URI。
    - 对于 Vertex AI Search 数据存储区，元数据可以是文档 ID、文档标题、URI（Cloud Storage 位置）或页码。
    - 对于“使用 Google 搜索建立依据”，当生成有依据的回答时，元数据会包含一个 URI，该 URI 会重定向到用于生成有依据的回答的内容的发布商。元数据还包含发布商的网域。在生成有根据的回答后，所提供的 URI 最长可在 30 天内保持可访问状态。
    重要提示：最终用户必须直接访问所提供的 URI，不得通过自动化方式以编程方式查询该 URI。如果检测到自动化访问，Grounding with Google Search 服务可能会停止提供重定向 URI。如需重新启动重定向 URI，请与您的客户工程师联系。
- 接地支持：回答中声明的接地信息。基础支持包含以下信息：
  - 声明文本：通过支持块文本证实的回答或回答的一部分。
  - 支持块索引：支持块索引，按支持块在支持块列表中的显示顺序分配。
  - 网络搜索查询：Google 搜索建议的建议搜索查询。
  - 搜索建议：如果您在回答中收到 Google 搜索建议，则该回答属于“接地结果”，受“依托 Google 搜索进行接地”服务条款的约束。如需了解详情，请参阅服务条款。 searchEntryPoint 字段中的 renderedContent 字段是用于实现 Google 搜索建议的提供的代码。如需使用 Google 搜索建议，请参阅使用 Google 搜索建议。

在单个对话轮次中生成接地回答

本部分介绍如何根据以下来源生成回答：

内嵌文本和 Vertex AI Search 数据存储区
Google 搜索

以内嵌文本和 Vertex AI Search 数据存储区中的数据作为回答依据

以下示例展示了如何通过指定内嵌文本和 Vertex AI Search 数据存储区作为接地源来发送提示文本。您无法将网站搜索数据存储区指定为依据来源。此示例使用 generateGroundedContent 方法。

REST

在以下 curl 请求中发送提示。

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/PROJECT_NUMBER/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT"
     }
   ]
 }
],
"systemInstruction": {
   "parts": {
       "text": "SYSTEM_INSTRUCTION"
   }
},
"groundingSpec": {
 "groundingSources": [
   {
     "inlineSource": {
       "groundingFacts": [
         {
           "factText": "FACT_TEXT_1",
           "attributes": {
             "title": "TITLE_1",
             "uri": "URI_1",
             "author": "AUTHOR_1"
           }
         }
       ]
     }
   },
   {
     "inlineSource": {
       "groundingFacts": [
         {
           "factText": "FACT_TEXT_2",
           "attributes": {
             "title": "TITLE_2",
             "uri": "URI_2"
           }
         },
         {
           "factText": "FACT_TEXT_3",
           "attributes": {
             "title": "TITLE_3",
             "uri": "URI_3"
           }
         }
       ]
     }
   },
   {
     "searchSource": {
       "servingConfig": "projects/PROJECT_NUMBER/locations/global/collections/default_collection/engines/APP_ID_1/servingConfigs/default_search"
     }
   },
   {
     "searchSource": {
       "servingConfig": "projects/PROJECT_NUMBER/locations/global/collections/default_collection/engines/APP_ID_2/servingConfigs/default_search"
     }
   }
  ]
},
"generationSpec": {
  "modelId": "MODEL_ID",
  "temperature": TEMPERATURE,
  "topP": TOP_P,
  "topK": TOP_K
},
"user_context": {
  "languageCode: "LANGUAGE_CODE",
  "latLng": {
    "latitude": LATITUDE,
    "longitude": LONGITUDE
 },
}
}'

替换以下内容：

PROJECT_NUMBER：您的 Google Cloud 项目的编号。
PROMPT_TEXT：用户的提示。
SYSTEM_INSTRUCTION：一个可选字段，用于提供序言或一些其他背景信息。
FACT_TEXT_N：用于确定答案基础的内嵌文本。您最多可以提供 100 条事实陈述。
TITLE_N：一个可选字段，用于为内嵌文本设置 title 元属性。
URI_N：一个可选字段，用于为内嵌文本设置 URI 元属性。
AUTHOR_N：一个可选字段，用于为内嵌文本设置作者元属性。
APP_ID_N：Vertex AI Search 应用的 ID。
MODEL_ID：一个可选字段，用于设置您希望用来生成有事实依据的答案的 Gemini 模型的模型 ID。如需查看可用模型 ID 的列表，请参阅支持的模型。
TEMPERATURE：一个可选字段，用于设置采样时使用的温度。Google 建议将温度设为 0.0。如需了解详情，请参阅 Gemini 模型参数。
TOP_P：一个可选字段，用于设置模型的 top-P 值。如需了解详情，请参阅 Gemini 模型参数。
TOP_K：一个可选字段，用于设置模型的 Top-K 值。如需了解详情，请参阅 Gemini 模型参数。
LANGUAGE_CODE：一个可选字段，可用于设置生成的答案和返回的块文本的语言。如果无法从查询中确定语言，则使用此字段。默认值为 en。如需查看语言代码列表，请参阅语言。
LATITUDE：用于设置纬度的可选字段。以十进制度输入值，例如 -25.34。
LONGITUDE：用于设置经度的可选字段。以十进制度输入值，例如 131.04。

响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。

{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT"
         }
       ]
     },
     "groundingScore": GROUNDING_SCORE,
     "groundingMetadata": {
       "supportChunks": [
         {
           "chunkText": "CHUNK_TEXT_FROM_A_DOCUMENT_IN_A_DATA_STORE ",
           "source": "4",
           "sourceMetadata": {
             "title": "DOCUMENT_TITLE",
             "uri": "gs://PATH/TO/DOCUMENT.pdf",
             "document_id": "DOCUMENT_ID",
             "page_identifier": "PAGE_NUMBER"
           }
         },
         {
           "chunkText": "CHUNK_TEXT_FROM_FACT_TEXT_1",
           "source": "0",
           "sourceMetadata": {
             "title": "TITLE_1",
             "uri": "URI_1",
             "author": "AUTHOR_1"
           }
         }
       ],
       "groundingSupport": [
         {
           "claimText": "CLAIM_TEXT_1",
           "supportChunkIndices": [
             0,
             1
           ]
         }
       ]
     }
   }
 ]
}

以内嵌文本和 Vertex AI Search 为依据生成单轮回答的示例

在以下示例中，请求指定了以下接地源：一个内嵌文本事实和一个 Vertex AI Search 数据存储区。此示例使用 generateGroundedContent 方法。此示例还使用系统指令来让回答以笑脸表情符号结尾。

REST

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/123456/locations/global:generateGroundedContent" \
-d '
{
  "contents": [
    {
      "role": "user",
      "parts": [
        {
          "text": "How did Google do in 2020? Where can I find BigQuery docs?"
        }
      ]
    }
  ],
  "systemInstruction": {
      "parts": {
          "text": "Add a smiley emoji after the answer."
      }
  },
  "groundingSpec": {
    "groundingSources": [
      {
        "inline_source": {
          "grounding_facts": [
            {
              "fact_text": "The BigQuery documentation can be found at https://cloud.google.com/bigquery/docs/introduction",
              "attributes": {
                "title": "BigQuery Overview",
                "uri": "https://cloud.google.com/bigquery/docs/introduction"
              }
            }
          ]
        }
      },
      {
        "searchSource": {
          "servingConfig": "projects/123456/locations/global/collections/default_collection/engines/app_id_example/servingConfigs/default_search"
        }
      }
    ]
  },
  "generationSpec": {
    "modelId": "gemini-1.5-flash"
  },
  "user_context": {
    "languageCode: "en",
    "latLng": {
       "latitude": 37.422131,
       "longitude": -122.084801
    }
  }
}'

响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。

{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "Google's revenue increased by 23% in 2020, reaching $182.5 billion. Google Cloud revenue was $13.1 billion for 2020. You can find BigQuery documentation at https://cloud.google.com/bigquery/docs/introduction. 😊 \n"
         }
       ]
     },
     "groundingScore": 0.86738646,
     "groundingMetadata": {
       "supportChunks": [
         {
           "chunkText": "Alphabet Announces Fourth Quarter and Fiscal Year 2020 Results\nMOUNTAIN VIEW, Calif. – February 2, 2021 – Alphabet Inc. (NASDAQ: GOOG, GOOGL) today announced\nfinancial results for the quarter and fiscal year ended December 31, 2020. Sundar Pichai, CEO of Google and Alphabet, said: "Our strong results this quarter reflect the helpfulness of our\nproducts and services to people and businesses, as well as the accelerating transition to online services and the\ncloud. Google succeeds when we help our customers and partners succeed, and we see significant opportunities to\nforge meaningful partnerships as businesses increasingly look to a digital future." Ruth Porat, CFO of Google and Alphabet, said: "Our strong fourth quarter performance, with revenues of $56.9\nbillion, was driven by Search and YouTube, as consumer and business activity recovered from earlier in the year. Google Cloud revenues were $13.1 billion for 2020, with significant ongoing momentum, and we remain focused on\ndelivering value across the growth opportunities we see." New reporting segment structure and operating results\nWe are now reporting results for three segments: Google Services, Google Cloud, and Other Bets. \n...\nIn 2020, we entered into derivatives that hedged the changes in fair value of certain marketable equity securities, which\nresulted in a $497 million net loss for the quarter ended December 31, 2020. The offsetting recognized gains on the\nmarketable equity securities are reflected in Gain (loss) on equity securities, net. Segment results\nThe following table presents our revenues and operating income (loss) (in millions; unaudited): Quarter Fiscal Year Q4 2019 Q1 2020 Q2 2020 Q3 2020 Q4 2020 2018 2019 2020 Revenues:\nGoogle Services $ 43,198 $ 38,198 $ 34,991 $ 42,573 $ 52,873 $ 130,524 $ 151,825 $ 168,635 Google Cloud 2,614 2,777 3,007 3,444 3,831 5,838 8,918 13,059 Other Bets 172 135 148 178 196 595 659 657 Hedging gains (losses) 91 49 151 (22) (2) (138) 455 176 Total revenues $ 46,075 $ 41,159 $ 38,297 $ 46,173 $ 56,898 $ 136,819 $ 161,857 $ 182,527 Quarter Fiscal Year Q4 2019 Q1 2020 Q2 2020 Q3 2020 Q4 2020 2018 2019 2020 Operating income (loss):\nGoogle Services $ 13,488 $ 11,548 $ 9,539 $ 14,453 $ 19,066 $ 43,137 $ 48,999 $ 54,606 Google Cloud (1,194) (1,730) (1,426) (1,208) (1,243) (4,348) (4,645) (5,607) Other Bets (2,026) (1,121) (1,116) (1,103) (1,136) \n...\nQ4 2020 financial highlights\nThe following table summarizes our consolidated financial results for the quarters ended December 31, 2019 and\n2020 (in millions, except for per share information and percentages; unaudited). Quarter Ended December 31,\n2019 2020 Revenues $ 46,075 $ 56,898 Increase in revenues year over year 17 % 23 % Increase in constant currency revenues year over year(1) 19 % 23 % Operating income $ 9,266 $ 15,651 Operating margin 20 % 28 % Other income (expense), net $ 1,438 $ 3,038 Net income $ 10,671 $ 15,227 Diluted EPS $ 15.35 $ 22.30 (1) Non-GAAP measure. See the table captioned "Reconciliation from GAAP revenues to non-GAAP constant currency\nrevenues" for more details. Q4 2020 supplemental information (in millions, except for number of employees; unaudited)\nRevenues, Traffic Acquisition Costs (TAC) and number of employees\nThe following table summarizes our revenues, total TAC and number of employees. Quarter Ended December 31,\n2019 2020 Google Search & other $ 27,185 $ 31,903 YouTube ads 4,717 6,885 Google Network Members' properties 6,032 7,411 Google advertising 37,934 46,199 Google other 5,264 6,674 Google Services total 43,198 52,873 Google Cloud 2,614 3,831 Other Bets 172 196 Hedging gains (losses) 91 (2) Total revenues $ 46,075 $ 56,898 Total TAC $ 8,501 $ 10,466 Number of employees 118,899 135,301 ",
           "source": "1",
           "sourceMetadata": {
             "title": "GOOG Exhibit 99.1 Q4'20",
             "page_identifier": "2",
             "uri": "gs://cloud-samples-data/gen-app-builder/search/alphabet-investor-pdfs/2020Q4_alphabet_earnings_release.pdf",
             "document_id": "projects/123456/locations/global/collections/default_collection/dataStores/data_store_id_example/branches/0/documents/217e8bedecfe08e3c43f5b289af15243"
           }
         },
         {
           "chunkText": "Alphabet Announces Fourth Quarter and Fiscal Year 2020 Results\nMOUNTAIN VIEW, Calif. – February 2, 2021 – Alphabet Inc. (NASDAQ: GOOG, GOOGL) today announced\nfinancial results for the quarter and fiscal year ended December 31, 2020. Sundar Pichai, CEO of Google and Alphabet, said: "Our strong results this quarter reflect the helpfulness of our\nproducts and services to people and businesses, as well as the accelerating transition to online services and the\ncloud. Google succeeds when we help our customers and partners succeed, and we see significant opportunities to\nforge meaningful partnerships as businesses increasingly look to a digital future." Ruth Porat, CFO of Google and Alphabet, said: "Our strong fourth quarter performance, with revenues of $56.9\nbillion, was driven by Search and YouTube, as consumer and business activity recovered from earlier in the year. Google Cloud revenues were $13.1 billion for 2020, with significant ongoing momentum, and we remain focused on\ndelivering value across the growth opportunities we see." New reporting segment structure and operating results\nWe are now reporting results for three segments: Google Services, Google Cloud, and Other Bets. \n...\nIn 2020, we entered into derivatives that hedged the changes in fair value of certain marketable equity securities, which\nresulted in a $497 million net loss for the quarter ended December 31, 2020. The offsetting recognized gains on the\nmarketable equity securities are reflected in Gain (loss) on equity securities, net. Segment results\nThe following table presents our revenues and operating income (loss) (in millions; unaudited): Quarter Fiscal Year Q4 2019 Q1 2020 Q2 2020 Q3 2020 Q4 2020 2018 2019 2020 Revenues:\nGoogle Services $ 43,198 $ 38,198 $ 34,991 $ 42,573 $ 52,873 $ 130,524 $ 151,825 $ 168,635 Google Cloud 2,614 2,777 3,007 3,444 3,831 5,838 8,918 13,059 Other Bets 172 135 148 178 196 595 659 657 Hedging gains (losses) 91 49 151 (22) (2) (138) 455 176 Total revenues $ 46,075 $ 41,159 $ 38,297 $ 46,173 $ 56,898 $ 136,819 $ 161,857 $ 182,527 Quarter Fiscal Year Q4 2019 Q1 2020 Q2 2020 Q3 2020 Q4 2020 2018 2019 2020 Operating income (loss):\nGoogle Services $ 13,488 $ 11,548 $ 9,539 $ 14,453 $ 19,066 $ 43,137 $ 48,999 $ 54,606 Google Cloud (1,194) (1,730) (1,426) (1,208) (1,243) (4,348) (4,645) (5,607) Other Bets (2,026) (1,121) (1,116) (1,103) (1,136) \n...\nQ4 2020 financial highlights\nThe following table summarizes our consolidated financial results for the quarters ended December 31, 2019 and\n2020 (in millions, except for per share information and percentages; unaudited). Quarter Ended December 31,\n2019 2020 Revenues $ 46,075 $ 56,898 Increase in revenues year over year 17 % 23 % Increase in constant currency revenues year over year(1) 19 % 23 % Operating income $ 9,266 $ 15,651 Operating margin 20 % 28 % Other income (expense), net $ 1,438 $ 3,038 Net income $ 10,671 $ 15,227 Diluted EPS $ 15.35 $ 22.30 (1) Non-GAAP measure. See the table captioned "Reconciliation from GAAP revenues to non-GAAP constant currency\nrevenues" for more details. Q4 2020 supplemental information (in millions, except for number of employees; unaudited)\nRevenues, Traffic Acquisition Costs (TAC) and number of employees\nThe following table summarizes our revenues, total TAC and number of employees. Quarter Ended December 31,\n2019 2020 Google Search & other $ 27,185 $ 31,903 YouTube ads 4,717 6,885 Google Network Members' properties 6,032 7,411 Google advertising 37,934 46,199 Google other 5,264 6,674 Google Services total 43,198 52,873 Google Cloud 2,614 3,831 Other Bets 172 196 Hedging gains (losses) 91 (2) Total revenues $ 46,075 $ 56,898 Total TAC $ 8,501 $ 10,466 Number of employees 118,899 135,301 ",
           "source": "1",
           "sourceMetadata": {
             "document_id": "projects/123456/locations/global/collections/default_collection/dataStores/data_store_id_example/branches/0/documents/217e8bedecfe08e3c43f5b289af15243",
             "page_identifier": "2",
             "title": "GOOG Exhibit 99.1 Q4'20",
             "uri": "gs://cloud-samples-data/gen-app-builder/search/alphabet-investor-pdfs/2020Q4_alphabet_earnings_release.pdf"
           }
         },
         {
           "chunkText": "The BigQuery documentation can be found at https://cloud.google.com/bigquery/docs/introduction ",
           "source": "0",
           "sourceMetadata": {
             "uri": "https://cloud.google.com/bigquery/docs/introduction",
             "title": "BigQuery Overview"
           }
         }
       ],
       "groundingSupport": [
         {
           "claimText": "Google's revenue increased by 23% in 2020, reaching $182.5 billion.",
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "Google Cloud revenue was $13.1 billion for 2020.",
           "supportChunkIndices": [
             1
           ]
         },
         {
           "claimText": "You can find BigQuery documentation at https://cloud.google.com/bigquery/docs/introduction.😊 ",
           "supportChunkIndices": [
             2
           ]
         }
       ]
     }
   }
 ]
}

Python

from google.cloud import discoveryengine_v1 as discoveryengine

# TODO(developer): Uncomment these variables before running the sample.
# project_number = "YOUR_PROJECT_NUMBER"
# engine_id = "YOUR_ENGINE_ID"

client = discoveryengine.GroundedGenerationServiceClient()

request = discoveryengine.GenerateGroundedContentRequest(
    # The full resource name of the location.
    # Format: projects/{project_number}/locations/{location}
    location=client.common_location_path(project=project_number, location="global"),
    generation_spec=discoveryengine.GenerateGroundedContentRequest.GenerationSpec(
        model_id="default",
    ),
    # Conversation between user and model
    contents=[
        discoveryengine.GroundedGenerationContent(
            role="user",
            parts=[
                discoveryengine.GroundedGenerationContent.Part(
                    text="How did Google do in 2020? Where can I find BigQuery docs?"
                )
            ],
        )
    ],
    system_instruction=discoveryengine.GroundedGenerationContent(
        parts=[
            discoveryengine.GroundedGenerationContent.Part(
                text="Add a smiley emoji after the answer."
            )
        ],
    ),
    # What to ground on.
    grounding_spec=discoveryengine.GenerateGroundedContentRequest.GroundingSpec(
        grounding_sources=[
            discoveryengine.GenerateGroundedContentRequest.GroundingSource(
                inline_source=discoveryengine.GenerateGroundedContentRequest.GroundingSource.InlineSource(
                    grounding_facts=[
                        discoveryengine.GroundingFact(
                            fact_text=(
                                "The BigQuery documentation can be found at https://cloud.google.com/bigquery/docs/introduction"
                            ),
                            attributes={
                                "title": "BigQuery Overview",
                                "uri": "https://cloud.google.com/bigquery/docs/introduction",
                            },
                        ),
                    ]
                ),
            ),
            discoveryengine.GenerateGroundedContentRequest.GroundingSource(
                search_source=discoveryengine.GenerateGroundedContentRequest.GroundingSource.SearchSource(
                    # The full resource name of the serving config for a Vertex AI Search App
                    serving_config=f"projects/{project_number}/locations/global/collections/default_collection/engines/{engine_id}/servingConfigs/default_search",
                ),
            ),
        ]
    ),
)
response = client.generate_grounded_content(request)

# Handle the response
print(response)

依托 Google 搜索生成接地的回答

您可以根据公开提供的 Web 数据生成回答。

动态检索

您可以在请求中使用动态检索功能，选择何时关闭“依托 Google 搜索进行接地”功能。当提示不需要依托 Google 搜索进行接地的回答，并且支持的模型也能够基于自身知识在不进行接地的情况下提供回答时，就可以利用这项功能。这有助于您更有效地管理延迟时间、回答质量和费用。

动态检索预测得分和阈值

当您发送请求以生成有事实依据的回答时，AI 应用会为提示分配一个预测得分。预测得分是一个介于 [0,1] 范围内的浮点值。其值取决于提示是否能通过依托 Google 搜索中的最新信息来生成回答。因此，如果提示要求回答基于网络上最新的事实，则预测得分较高；如果提示只需要模型生成的回答，则预测得分较低。

以下是一些提示及其预测得分的示例。

提示	预测分数	评论
“写一首关于牡丹的诗”	0.13	模型可以依靠自身知识，回答不需要接地
“为 2 岁儿童推荐玩具”	0.36	模型可以依靠自身知识，回答不需要接地
“你能提供一份富有亚洲风味的鳄梨酱食谱吗？”	0.55	Google 搜索可以提供有依据的回答，但并非必须进行接地；模型知识可能足够
“什么是 AI 应用？AI 应用中，利用 Google 搜索建立回答依据的机制如何计费？	0.72	需要 Google 搜索来生成有理有据的回答
“谁赢得了最新的 F1 大奖赛？”	0.97	需要 Google 搜索来生成有理有据的回答

在基于事实依据的回答生成请求中，您可以指定具有阈值的动态检索配置。阈值是一个在 [0,1] 范围内的浮点值，默认为 0.7。如果阈值为零，则回答始终“依托 Google 搜索进行接地”。对于阈值的所有其他值，以下内容适用：

如果预测得分大于或等于阈值，则系统会根据 Google 搜索结果生成回答。阈值越低，意味着有更多提示的回答是使用 Google 搜索接地生成的。
如果预测得分低于阈值，模型可能仍会生成回答，但不会依托 Google 搜索进行接地。

为了找到适合您业务需求的理想阈值，您可以创建一组您预计会遇到的代表性查询。然后，您可以根据响应中的预测得分对查询进行排序，并为您的使用场景选择合适的阈值。

依托 Google 搜索为回答接地

以下示例展示了如何通过指定 Google 搜索作为依据来源，根据提示生成有依据的回答。此示例使用 generateGroundedContent 方法。

REST

在以下 curl 请求中发送提示。

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/PROJECT_NUMBER/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT"
     }
   ]
 }
],
"systemInstruction": {
   "parts": {
       "text": "SYSTEM_INSTRUCTION"
   }
},
"groundingSpec": {
 "groundingSources": [
 {
     "googleSearchSource": {
          "dynamicRetrievalConfig": {
              "predictor":{
                  "threshold": DYNAMIC_RETRIEVAL_THRESHOLD
              }
          }
     }
 }
]
},
"generationSpec": {
 "modelId": "MODEL_ID",
 "temperature": TEMPERATURE,
 "topP": TOP_P,
 "topK": TOP_K
},
"user_context": {
 "languageCode: "LANGUAGE_CODE",
 "latLng": {
   "latitude": LATITUDE,
   "longitude": LONGITUDE
 },
}
}'

替换以下内容：

PROJECT_NUMBER：您的 Google Cloud 项目的编号。
PROMPT_TEXT：用户的提示。
SYSTEM_INSTRUCTION：一个可选字段，用于提供序言或一些其他背景信息。
DYNAMIC_RETRIEVAL_THRESHOLD：可选字段，用于设置调用动态检索配置的阈值。该值是一个介于 [0,1] 范围内的浮点值。如果您添加了 dynamicRetrievalConfig 字段，但未设置 predictor 或 threshold 字段，则阈值默认为 0.7。如果您不设置 dynamicRetrievalConfig 字段，则回答始终会依托 Google 搜索进行接地。
MODEL_ID：一个可选字段，用于设置您希望用来生成有事实依据的答案的 Gemini 模型的模型 ID。如需查看可用模型 ID 的列表，请参阅支持的模型。
TEMPERATURE：一个可选字段，用于设置采样时使用的温度。Google 建议将温度设为 0.0。如需了解详情，请参阅 Gemini 模型参数。
TOP_P：一个可选字段，用于设置模型的 top-P 值。如需了解详情，请参阅 Gemini 模型参数。
TOP_K：一个可选字段，用于设置模型的 Top-K 值。如需了解详情，请参阅 Gemini 模型参数。
LANGUAGE_CODE：一个可选字段，可用于设置生成的答案和返回的块文本的语言。如果无法从查询中确定语言，则使用此字段。默认值为 en。如需查看语言代码列表，请参阅语言。
LATITUDE：用于设置纬度的可选字段。以十进制度输入值，例如 -25.34。
LONGITUDE：用于设置经度的可选字段。以十进制度输入值，例如 131.04。

响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。

{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT"
         }
       ]
     },
     "groundingScore":GROUNDING_SCORE,
     "groundingMetadata": {
       "supportChunks": [
         {
          "source": "0",
          "sourceMetadata": {
            "uri": "REDIRECTION_URI",
            "domain": "PUBLISHER_DOMAIN"
          }
         }
       ],
       "groundingSupport": [
         {
           "claimText": "CLAIM_TEXT_1",
           "supportScore": SUPPORT_SCORE,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "CLAIM_TEXT_2",
           "supportScore": SUPPORT_SCORE,
           "supportChunkIndices": [
             0
           ]
         }
       ],
       "webSearchQueries": [
         "QUERY_BUILT_FROM_USER_PROMPT"
       ],
       "searchEntryPoint": {
         "renderedContent": "RENDERED_CONTENT"
       },
     }
   }
 ]
}
{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT"
         }
       ]
     },
     "groundingScore":GROUNDING_SCORE,
     "groundingMetadata": {
       "supportChunks": [
         {}
       ],
       "groundingSupport": [
         {
           "claimText": "CLAIM_TEXT_1",
           "supportScore": SUPPORT_SCORE,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "CLAIM_TEXT_2",
           "supportScore": SUPPORT_SCORE,
           "supportChunkIndices": [
             0
           ]
         }
       ],
       "webSearchQueries": [
         "QUERY_BUILT_FROM_USER_PROMPT"
       ],
       "searchEntryPoint": {
         "renderedContent": "RENDERED_CONTENT"
       },
       "retrievalMetadata": [
         {
           "source": "GOOGLE_SEARCH",
           "dynamicRetrievalMetadata": {
             "predictorMetadata": {
               "version": "V1_INDEPENDENT",
               "prediction": PREDICTION_SCORE
             }
           }
         }
       ]
     }
   }
 ]
}

Python

from google.cloud import discoveryengine_v1 as discoveryengine

# TODO(developer): Uncomment these variables before running the sample.
# project_number = "YOUR_PROJECT_NUMBER"

client = discoveryengine.GroundedGenerationServiceClient()

request = discoveryengine.GenerateGroundedContentRequest(
    # The full resource name of the location.
    # Format: projects/{project_number}/locations/{location}
    location=client.common_location_path(project=project_number, location="global"),
    generation_spec=discoveryengine.GenerateGroundedContentRequest.GenerationSpec(
        model_id="default",
    ),
    # Conversation between user and model
    contents=[
        discoveryengine.GroundedGenerationContent(
            role="user",
            parts=[
                discoveryengine.GroundedGenerationContent.Part(
                    text="How much is Google stock?"
                )
            ],
        )
    ],
    system_instruction=discoveryengine.GroundedGenerationContent(
        parts=[
            discoveryengine.GroundedGenerationContent.Part(text="Be comprehensive.")
        ],
    ),
    # What to ground on.
    grounding_spec=discoveryengine.GenerateGroundedContentRequest.GroundingSpec(
        grounding_sources=[
            discoveryengine.GenerateGroundedContentRequest.GroundingSource(
                google_search_source=discoveryengine.GenerateGroundedContentRequest.GroundingSource.GoogleSearchSource(
                    # Optional: For Dynamic Retrieval
                    dynamic_retrieval_config=discoveryengine.GenerateGroundedContentRequest.DynamicRetrievalConfiguration(
                        predictor=discoveryengine.GenerateGroundedContentRequest.DynamicRetrievalConfiguration.DynamicRetrievalPredictor(
                            threshold=0.7
                        )
                    )
                )
            ),
        ]
    ),
)
response = client.generate_grounded_content(request)

# Handle the response
print(response)

基于 Google 搜索结果生成单轮回答的示例

在以下示例中，请求将 Google 搜索指定为依据来源。此示例使用 generateGroundedContent 方法。此示例还使用系统指令来让回答以笑脸表情符号结尾。

REST

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/123456/locations/global:generateGroundedContent" \
-d '
{
"contents": [{
  "role": "user",
  "parts": [{
    "text": "What is ai applications?"
}]
}],
"systemInstruction": {
   "parts": {
      "text": "Add a smiley emoji after the answer."
   }
},
"groundingSpec": {
  "groundingSources": [
  {
      "googleSearchSource": {
        "dynamicRetrievalConfig": {
               "predictor":{
                   "threshold": 0.6
               }
           }
      }
  }
 ]
},
"generationSpec": {
  "modelId": "gemini-1.5-flash"
}
}
'

响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。

{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "AI Applications is a platform developed by Google Cloud that simplifies the creation and deployment of generative AI agents. It offers both no-code and code-first approaches, allowing developers of all skill levels to build AI-powered agents. \n\nHere are some key features of AI Applications:\n\n* **No-code interface:**  Use natural language to design and build agents without writing code.\n* **Code-first approach:**  Utilize powerful orchestration and customization capabilities, including LangChain on Vertex AI.\n* **Enterprise-grade security and compliance:**  Built-in security, compliance, and governance features align with industry certifications like HIPAA, ISO 27000-series, SOC-1/2/3, VPC-SC, and CMEK.\n* **Integration with enterprise data:**  Easily ground your agents in enterprise data using Vertex AI Search and Retrieval Augmented Generation (RAG) APIs.\n* **Pre-built templates:**  Rapidly prototype and experiment with pre-built templates for conversational AI and process automation agents.\n* **Advanced integrations:**  Supports integrations with frameworks like LlamaIndex and LangChain for enhanced AI capabilities.\n* **Natural language understanding (NLU):**  Accurate query responses and support for multiple languages.\n\nAI Applications is designed to help developers create AI agents that can:\n\n* Answer complex questions\n* Provide support and personalize user experiences\n* Automate tasks and processes\n* Interact with backend systems\n\nOverall, AI Applications is a powerful tool that makes it easier for developers to build and deploy generative AI agents, regardless of their experience level. 😊 \n"
         }
       ]
     },
     "groundingScore": 0.80400103,
     "groundingMetadata": {
       "supportChunks": [
         {
          "source": "0",
          "sourceMetadata": {
          "uri": "https://vertexaisearch.cloud.google.com/grounding-api-redirect/{unique_string}",
          "domain": "example.com"
         }
        }
       ],
       "groundingSupport": [
         {
           "claimText": "AI Applications is a platform developed by Google Cloud that simplifies the creation and deployment of generative AI agents.",
           "supportScore": 0.9541752,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "It offers both no-code and code-first approaches, allowing developers of all skill levels to build AI-powered agents.",
           "supportScore": 0.9648506,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **No-code interface:**  Use natural language to design and build agents without writing code.",
           "supportScore": 0.77115613,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Code-first approach:**  Utilize powerful orchestration and customization capabilities, including LangChain on Vertex AI.",
           "supportScore": 0.8540146,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Enterprise-grade security and compliance:**  Built-in security, compliance, and governance features align with industry certifications like HIPAA, ISO 27000-series, SOC-1/2/3, VPC-SC, and CMEK.",
           "supportScore": 0.9574074,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Integration with enterprise data:**  Easily ground your agents in enterprise data using Vertex AI Search and Retrieval Augmented Generation (RAG) APIs.",
           "supportScore": 0.9533333,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Pre-built templates:**  Rapidly prototype and experiment with pre-built templates for conversational AI and process automation agents.",
           "supportScore": 0.9457701,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Advanced integrations:**  Supports integrations with frameworks like LlamaIndex and LangChain for enhanced AI capabilities.",
           "supportScore": 0.9541752,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* **Natural language understanding (NLU):**  Accurate query responses and support for multiple languages.",
           "supportScore": 0.97726375,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* Provide support and personalize user experiences",
           "supportScore": 0.8540146,
           "supportChunkIndices": [
             0
           ]
         },
         {
           "claimText": "* Automate tasks and processes",
           "supportScore": 0.82046676,
           "supportChunkIndices": [
             0
           ]
         }
       ],
       "webSearchQueries": [
         "what is ai applications"
       ],
       "searchEntryPoint": {
         "renderedContent": "\u003cstyle\u003e\n.container {\n  align-items: center;\n  border-radius: 8px;\n  display: flex;\n  font-family: Google Sans, Roboto, sans-serif;\n  font-size: 14px;\n  line-height: 20px;\n  padding: 8px 12px;\n}\n.chip {\n  display: inline-block;\n  border: solid 1px;\n  border-radius: 16px;\n  min-width: 14px;\n  padding: 5px 16px;\n  text-align: center;\n  user-select: none;\n  margin: 0 8px;\n  -webkit-tap-highlight-color: transparent;\n}\n.carousel {\n  overflow: auto;\n  scrollbar-width: none;\n  white-space: nowrap;\n  margin-right: -12px;\n}\n.headline {\n  display: flex;\n  margin-right: 4px;\n}\n.gradient-container {\n  position: relative;\n}\n.gradient {\n  position: absolute;\n  transform: translate(3px, -9px);\n  height: 36px;\n  width: 9px;\n}\n@media (prefers-color-scheme: light) {\n  .container {\n    background-color: #fafafa;\n    box-shadow: 0 0 0 1px #0000000f;\n  }\n  .headline-label {\n    color: #1f1f1f;\n  }\n  .chip {\n    background-color: #ffffff;\n    border-color: #d2d2d2;\n    color: #5e5e5e;\n    text-decoration: none;\n  }\n  .chip:hover {\n    background-color: #f2f2f2;\n  }\n  .chip:focus {\n    background-color: #f2f2f2;\n  }\n  .chip:active {\n    background-color: #d8d8d8;\n    border-color: #b6b6b6;\n  }\n  .logo-dark {\n    display: none;\n  }\n  .gradient {\n    background: linear-gradient(90deg, #fafafa 15%, #fafafa00 100%);\n  }\n}\n@media (prefers-color-scheme: dark) {\n  .container {\n    background-color: #1f1f1f;\n    box-shadow: 0 0 0 1px #ffffff26;\n  }\n  .headline-label {\n    color: #fff;\n  }\n  .chip {\n    background-color: #2c2c2c;\n    border-color: #3c4043;\n    color: #fff;\n    text-decoration: none;\n  }\n  .chip:hover {\n    background-color: #353536;\n  }\n  .chip:focus {\n    background-color: #353536;\n  }\n  .chip:active {\n    background-color: #464849;\n    border-color: #53575b;\n  }\n  .logo-light {\n    display: none;\n  }\n  .gradient {\n    background: linear-gradient(90deg, #1f1f1f 15%, #1f1f1f00 100%);\n  }\n}\n\u003c/style\u003e\n\u003cdiv class=\"container\"\u003e\n  \u003cdiv class=\"headline\"\u003e\n    \u003csvg class=\"logo-light\" width=\"18\" height=\"18\" viewBox=\"9 9 35 35\" fill=\"none\" xmlns=\"http://www.w3.org/2000/svg\"\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M42.8622 27.0064C42.8622 25.7839 42.7525 24.6084 42.5487 23.4799H26.3109V30.1568H35.5897C35.1821 32.3041 33.9596 34.1222 32.1258 35.3448V39.6864H37.7213C40.9814 36.677 42.8622 32.2571 42.8622 27.0064V27.0064Z\" fill=\"#4285F4\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M26.3109 43.8555C30.9659 43.8555 34.8687 42.3195 37.7213 39.6863L32.1258 35.3447C30.5898 36.3792 28.6306 37.0061 26.3109 37.0061C21.8282 37.0061 18.0195 33.9811 16.6559 29.906H10.9194V34.3573C13.7563 39.9841 19.5712 43.8555 26.3109 43.8555V43.8555Z\" fill=\"#34A853\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M16.6559 29.8904C16.3111 28.8559 16.1074 27.7588 16.1074 26.6146C16.1074 25.4704 16.3111 24.3733 16.6559 23.3388V18.8875H10.9194C9.74388 21.2072 9.06992 23.8247 9.06992 26.6146C9.06992 29.4045 9.74388 32.022 10.9194 34.3417L15.3864 30.8621L16.6559 29.8904V29.8904Z\" fill=\"#FBBC05\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M26.3109 16.2386C28.85 16.2386 31.107 17.1164 32.9095 18.8091L37.8466 13.8719C34.853 11.082 30.9659 9.3736 26.3109 9.3736C19.5712 9.3736 13.7563 13.245 10.9194 18.8875L16.6559 23.3388C18.0195 19.2636 21.8282 16.2386 26.3109 16.2386V16.2386Z\" fill=\"#EA4335\"/\u003e\n    \u003c/svg\u003e\n    \u003csvg class=\"logo-dark\" width=\"18\" height=\"18\" viewBox=\"0 0 48 48\" xmlns=\"http://www.w3.org/2000/svg\"\u003e\n      \u003ccircle cx=\"24\" cy=\"23\" fill=\"#FFF\" r=\"22\"/\u003e\n      \u003cpath d=\"M33.76 34.26c2.75-2.56 4.49-6.37 4.49-11.26 0-.89-.08-1.84-.29-3H24.01v5.99h8.03c-.4 2.02-1.5 3.56-3.07 4.56v.75l3.91 2.97h.88z\" fill=\"#4285F4\"/\u003e\n      \u003cpath d=\"M15.58 25.77A8.845 8.845 0 0 0 24 31.86c1.92 0 3.62-.46 4.97-1.31l4.79 3.71C31.14 36.7 27.65 38 24 38c-5.93 0-11.01-3.4-13.45-8.36l.17-1.01 4.06-2.85h.8z\" fill=\"#34A853\"/\u003e\n      \u003cpath d=\"M15.59 20.21a8.864 8.864 0 0 0 0 5.58l-5.03 3.86c-.98-2-1.53-4.25-1.53-6.64 0-2.39.55-4.64 1.53-6.64l1-.22 3.81 2.98.22 1.08z\" fill=\"#FBBC05\"/\u003e\n      \u003cpath d=\"M24 14.14c2.11 0 4.02.75 5.52 1.98l4.36-4.36C31.22 9.43 27.81 8 24 8c-5.93 0-11.01 3.4-13.45 8.36l5.03 3.85A8.86 8.86 0 0 1 24 14.14z\" fill=\"#EA4335\"/\u003e\n    \u003c/svg\u003e\n    \u003cdiv class=\"gradient-container\"\u003e\u003cdiv class=\"gradient\"\u003e\u003c/div\u003e\u003c/div\u003e\n  \u003c/div\u003e\n  \u003cdiv class=\"carousel\"\u003e\n    \u003ca class=\"chip\" href=\"https://www.google.com/search?q=what+is+ai-applications&client=app-vertex-grounding&safesearch=active\"\u003ewhat is ai applications\u003c/a\u003e\n  \u003c/div\u003e\n\u003c/div\u003e\n"
       },
       "retrievalMetadata": [
         {
           "source": "GOOGLE_SEARCH",
           "dynamicRetrievalMetadata": {
             "predictorMetadata": {
               "version": "V1_INDEPENDENT",
               "prediction": 0.671875
             }
           }
         }
       ]
     }
   }
 ]
}

在多个回合中生成有依据的回答

在多回合回答生成中，您必须在每个请求中发送用户与模型在之前所有回合中交换的所有文本。这样可确保连续性并保持上下文，以便为最新提示生成回答。

如需通过多轮回答生成功能获取有依据的回答，请执行以下操作：

REST

以下示例展示了如何通过多个轮次发送后续提示文本。这些示例使用 generateGroundedContent 方法，并依托 Google 搜索来生成答案。您可以使用类似的步骤，通过其他依据来源生成有依据的回答。

在以下 curl 请求中发送第一个提示。
```
curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/PROJECT_NUMBER/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT_TURN_1"
     }
   ]
 }
],
"systemInstruction": {
   "parts": {
       "text": "SYSTEM_INSTRUCTION_TURN_1"
   }
},
"groundingSpec": {
 "groundingSources": [
   {
     "googleSearchSource": {}
   }
 ]
},
"generationSpec": {
 "modelId": "MODEL_ID",
 "temperature": TEMPERATURE,
 "topP": TOP_P,
 "topK": TOP_K
},
"user_context": {
 "languageCode: "LANGUAGE_CODE",
 "latLng": {
   "latitude": LATITUDE,
   "longitude": LONGITUDE
 },
}
}'
```
替换以下内容：
- PROJECT_NUMBER：您的 Google Cloud 项目的编号。
- PROMPT_TEXT_TURN_1：第一轮中用户的提示文本。
- SYSTEM_INSTRUCTION_TURN_1：一个可选字段，用于提供序言或一些其他背景信息。对于多轮回答生成，您必须在每一轮中提供系统指令。
- MODEL_ID：一个可选字段，用于设置您希望用来生成有事实依据的答案的 Gemini 模型的模型 ID。如需查看可用模型 ID 的列表，请参阅支持的模型。
- TEMPERATURE：一个可选字段，用于设置采样时使用的温度。Google 建议将温度设为 0.0。如需了解详情，请参阅 Gemini 模型参数。
- TOP_P：一个可选字段，用于设置模型的 top-P 值。如需了解详情，请参阅 Gemini 模型参数。
- TOP_K：一个可选字段，用于设置模型的 Top-K 值。如需了解详情，请参阅 Gemini 模型参数。
- LANGUAGE_CODE：一个可选字段，可用于设置生成的答案和返回的块文本的语言。如果无法从查询中确定语言，则使用此字段。默认值为 en。如需查看语言代码列表，请参阅语言。
- LATITUDE：用于设置纬度的可选字段。以十进制度输入值，例如 -25.34。
- LONGITUDE：用于设置经度的可选字段。以十进制度输入值，例如 131.04。
响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。
```
{
"candidates": [
 {
   "content": {
     "role": "model",
     "parts": [
       {
         "text": "ANSWER_TEXT_TURN_1"
       }
     ]
   },
   "groundingScore": GROUNDING_SCORE,
   "groundingMetadata": {
     "supportChunks": [],
     "groundingSupport": [
       {
         "claimText": "CLAIM_TEXT_1",
         "supportChunkIndices": [
           0,
           1
         ]
       },
       {
         "claimText": "CLAIM_TEXT_2",
         "supportChunkIndices": [
           1
         ]
       }
     ],
     "webSearchQueries": [
       "QUERY_BUILT_FROM_USER_PROMPT"
     ],
     "searchEntryPoint": {
       "renderedContent": "RENDERED_CONTENT"
     }
   }
 }
]
} 
```
发送第二个提示作为后续提示。添加用户的第一个提示，然后添加模型针对该提示给出的相应回答，以提供上下文。
```
curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/PROJECT_NUMBER/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT_TURN_1"
     }
   ]
 },
 {
   "role": "model",
   "parts": [
     {
       "text": "ANSWER_TEXT_TURN_1"
     }
   ]
 },
 {
   "role": "user",
   "parts": [
     {
       "text": "PROMPT_TEXT_TURN_2"
     }
   ]
 }
],
"systemInstruction": {
   "parts": {
       "text": "SYSTEM_INSTRUCTION_TURN_2"
   }
},
"groundingSpec": {
 "groundingSources": [
   {
     "googleSearchSource": {}
   }
 ]
},
"generationSpec": {
 "modelId": "MODEL_ID",
 "temperature": TEMPERATURE,
 "topP": TOP_P,
 "topK": TOP_K
},
"user_context": {
 "languageCode: "LANGUAGE_CODE",
 "latLng": {
   "latitude": LATITUDE,
   "longitude": LONGITUDE
 },
}
}'
```
替换以下内容：
- PROJECT_NUMBER：您的 Google Cloud 项目的编号。
- PROMPT_TEXT_TURN_1：第一轮中用户的提示文本。
- ANSWER_TEXT_TURN_1：第一轮中模型给出的回答文本。
- PROMPT_TEXT_TURN_2：用户在第二轮对话中输入的提示文本。
- SYSTEM_INSTRUCTION_TURN_2：一个可选字段，用于提供序言或一些其他背景信息。对于多轮回答生成，您必须在每一轮中提供系统指令。
- MODEL_ID：一个可选字段，用于设置您希望用来生成有事实依据的答案的 Gemini 模型的模型 ID。如需查看可用模型 ID 的列表，请参阅支持的模型。
- TEMPERATURE：一个可选字段，用于设置采样时使用的温度。Google 建议将温度设为 0.0。如需了解详情，请参阅 Gemini 模型参数。
- TOP_P：一个可选字段，用于设置模型的 top-P 值。如需了解详情，请参阅 Gemini 模型参数。
- TOP_K：一个可选字段，用于设置模型的 Top-K 值。如需了解详情，请参阅 Gemini 模型参数。
- LANGUAGE_CODE：一个可选字段，可用于设置生成的答案和返回的块文本的语言。如果无法从查询中确定语言，则使用此字段。默认值为 en。如需查看语言代码列表，请参阅语言。
- LATITUDE：用于设置纬度的可选字段。以十进制度输入值，例如 -25.34。
- LONGITUDE：用于设置经度的可选字段。以十进制度输入值，例如 131.04。
响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。
```
{
"candidates": [
 {
   "content": {
     "role": "model",
     "parts": [
       {
         "text": "ANSWER_TEXT_TURN_2"
       }
     ]
   },
   "groundingScore": GROUNDING_SCORE,
   "groundingMetadata": {
     "supportChunks": [],
     "groundingSupport": [
       {
         "claimText": "CLAIM_TEXT_1",
         "supportChunkIndices": [
           0
         ]
       },
       {
         "claimText": "CLAIM_TEXT_2",
         "supportChunkIndices": [
           1,
           2
         ]
       }
     ],
     "webSearchQueries": [
       "QUERY_BUILT_FROM_USER_PROMPT"
     ],
     "searchEntryPoint": {
       "renderedContent": "RENDERED_CONTENT"
     }
   }
 }
]
}   
```
重复此流程，以获得更多后续回答。在每个回合中，添加用户之前的所有提示，然后添加模型给出的相应回答。

多轮回答生成示例

在以下示例中，请求指定了三个内嵌事实文本作为依据来源，以生成两个轮次的回答。此示例使用 generateGroundedContent 方法。此示例还使用系统指令在第一轮中以笑脸表情符号结束回答。

REST

在以下 curl 请求中发送第一个提示。

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/123456/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "Summarize what happened in 2023 in one paragraph."
     }
   ]
 }
],
"systemInstruction": {
  "parts": {
      "text": "Add a smiley emoji after the answer."
  }
},
"grounding_spec": {
 "grounding_sources": [
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, the world population surpassed 8 billion. This milestone marked a significant moment in human history, highlighting both the rapid growth of our species and the challenges of resource management and sustainability in the years to come.",
           "attributes": {
             "title": "title_1",
             "uri": "some-uri-1"
           }
         }
       ]
     }
   },
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, global e-commerce sales reached an estimated $5.7 trillion. The continued rise of online shopping solidified its position as a dominant force in retail, with major implications for traditional brick-and-mortar stores and the logistics networks supporting worldwide deliveries.",
           "attributes": {
             "title": "title_2",
             "uri": "some-uri-2"
           }
         }
       ]
     }
   },
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, the global average surface temperature was approximately 0.2 degrees Celsius higher than the 20th-century average. This continued the worrying trend of global warming, underscoring the urgency of worldwide climate initiatives, carbon reduction efforts, and investment in renewable energy sources.",
           "attributes": {
             "title": "title_3",
             "uri": "some-uri-3"
           }
         }
       ]
     }
   }
 ]
},
"generationSpec": {
 "modelId": "gemini-1.5-flash"
}
}'

响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。

{
"candidates": [
 {
   "content": {
     "role": "model",
     "parts": [
       {
         "text": "In 2023, the global average surface temperature increased, the world population surpassed 8 billion, and global e-commerce sales reached an estimated $5.7 trillion.  😊 \n"
       }
     ]
   },
   "groundingScore": 1,
   "groundingMetadata": {
     "supportChunks": [
       {
         "chunkText": "In 2023, global e-commerce sales reached an estimated $5.7 trillion. The continued rise of online shopping solidified its position as a dominant force in retail, with major implications for traditional brick-and-mortar stores and the logistics networks supporting worldwide deliveries. ",
         "source": "1",
         "sourceMetadata": {
           "uri": "some-uri-2",
           "title": "title_2"
         }
       },
       {
         "chunkText": "In 2023, the world population surpassed 8 billion. This milestone marked a significant moment in human history, highlighting both the rapid growth of our species and the challenges of resource management and sustainability in the years to come. ",
         "source": "0",
         "sourceMetadata": {
           "uri": "some-uri-1",
           "title": "title_1"
         }
       },
       {
         "chunkText": "In 2023, the global average surface temperature was approximately 0.2 degrees Celsius higher than the 20th-century average. This continued the worrying trend of global warming, underscoring the urgency of worldwide climate initiatives, carbon reduction efforts, and investment in renewable energy sources. ",
         "source": "2",
         "sourceMetadata": {
           "title": "title_3",
           "uri": "some-uri-3"
         }
       }
     ],
     "groundingSupport": [
       {
         "claimText": "In 2023, the global average surface temperature increased, the world population surpassed 8 billion, and global e-commerce sales reached an estimated $5.7 trillion.",
         "supportScore": 1,
         "supportChunkIndices": [
           0,
           1,
           2
         ]
       }
     ]
   }
 }
]
}

发送第二个提示作为后续提示。添加用户的第一个提示，然后添加模型针对该提示给出的相应回答，以提供上下文。

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1/projects/123456/locations/global:generateGroundedContent" \
-d '
{
"contents": [
 {
   "role": "user",
   "parts": [
     {
       "text": "Summarize what happened in 2023 in one paragraph."
     }
   ]
 },
 {
   "role": "model",
   "parts": [
     {
       "text": "In 2023, the global average surface temperature increased, the world population surpassed 8 billion, and global e-commerce sales reached an estimated $5.7 trillion.  😊 \n"
     }
   ]
 },
 {
   "role": "user",
   "parts": [
     {
       "text": "Rephrase the answer in an abstracted list."
     }
   ]
 }
],
"grounding_spec": {
 "grounding_sources": [
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, the world population surpassed 8 billion. This milestone marked a significant moment in human history, highlighting both the rapid growth of our species and the challenges of resource management and sustainability in the years to come.",
           "attributes": {
             "title": "title_1",
             "uri": "some-uri-1"
           }
         }
       ]
     }
   },
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, global e-commerce sales reached an estimated $5.7 trillion. The continued rise of online shopping solidified its position as a dominant force in retail, with major implications for traditional brick-and-mortar stores and the logistics networks supporting worldwide deliveries.",
           "attributes": {
             "title": "title_2",
             "uri": "some-uri-2"
           }
         }
       ]
     }
   },
   {
     "inline_source": {
       "grounding_facts": [
         {
           "fact_text": "In 2023, the global average surface temperature was approximately 0.2 degrees Celsius higher than the 20th-century average. This continued the worrying trend of global warming, underscoring the urgency of worldwide climate initiatives, carbon reduction efforts, and investment in renewable energy sources.",
           "attributes": {
             "title": "title_3",
             "uri": "some-uri-3"
           }
         }
       ]
     }
   }
 ]
},
"generationSpec": {
 "modelId": "gemini-1.5-flash"
}
}'

响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。

{
"candidates": [
 {
   "content": {
     "role": "model",
     "parts": [
       {
         "text": "- The global average surface temperature increased in 2023.\n- The world population surpassed 8 billion in 2023.\n- Global e-commerce sales reached an estimated $5.7 trillion in 2023. \n"
       }
     ]
   },
   "groundingScore": 0.99073017,
   "groundingMetadata": {
     "supportChunks": [
       {
         "chunkText": "In 2023, the global average surface temperature was approximately 0.2 degrees Celsius higher than the 20th-century average. This continued the worrying trend of global warming, underscoring the urgency of worldwide climate initiatives, carbon reduction efforts, and investment in renewable energy sources. ",
         "source": "2",
         "sourceMetadata": {
           "uri": "some-uri-3",
           "title": "title_3"
         }
       },
       {
         "chunkText": "In 2023, the world population surpassed 8 billion. This milestone marked a significant moment in human history, highlighting both the rapid growth of our species and the challenges of resource management and sustainability in the years to come. ",
         "source": "0",
         "sourceMetadata": {
           "uri": "some-uri-1",
           "title": "title_1"
         }
       },
       {
         "chunkText": "In 2023, global e-commerce sales reached an estimated $5.7 trillion. The continued rise of online shopping solidified its position as a dominant force in retail, with major implications for traditional brick-and-mortar stores and the logistics networks supporting worldwide deliveries. ",
         "source": "1",
         "sourceMetadata": {
           "title": "title_2",
           "uri": "some-uri-2"
         }
       }
     ],
     "groundingSupport": [
       {
         "claimText": "- The global average surface temperature increased in 2023.",
         "supportScore": 0.9883382,
         "supportChunkIndices": [
           0
         ]
       },
       {
         "claimText": "- The world population surpassed 8 billion in 2023.",
         "supportScore": 0.9919262,
         "supportChunkIndices": [
           1
         ]
       },
       {
         "claimText": "- Global e-commerce sales reached an estimated $5.7 trillion in 2023.",
         "supportScore": 0.9919262,
         "supportChunkIndices": [
           2
         ]
       }
     ]
   }
 }
]
}

流式传输已接地的回答

您可以选择从模型中以流式传输回答。在答案特别长且一次性发送整个回答会导致明显延迟的用例中，此功能非常有用。对答案进行流式传输会将回答分解为多个候选答案组成的数组，其中包含回答文本的连续部分。

如需获取流式传输的基于事实的答案，请执行以下操作：

REST

以下示例展示了如何以流式传输方式获取有事实依据的回答。此示例使用 streamGenerateGroundedContent 方法，并根据 Google 搜索结果生成答案，而无需动态检索配置。您可以使用类似的步骤，通过其他依据来源生成有依据的回答。

在以下 curl 请求中发送提示。

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1alpha/projects/PROJECT_NUMBER/locations/global:streamGenerateGroundedContent" \
-d '
[
{
 "contents": [
   {
     "role": "user",
     "parts": [
       {
         "text": "PROMPT_TEXT"
       }
     ]
   }
 ],
 "systemInstruction": {
     "parts": {
         "text": "SYSTEM_INSTRUCTION"
     }
 },
 "groundingSpec": {
   "groundingSources": [
     {
       "googleSearchSource": {}
     }
   ]
 },
"generationSpec": {
 "modelId": "MODEL_ID",
 "temperature": TEMPERATURE,
 "topP": TOP_P,
 "topK": TOP_K
},
"user_context": {
 "languageCode: "LANGUAGE_CODE",
 "latLng": {
   "latitude": LATITUDE,
   "longitude": LONGITUDE
 },
}
}
]'

替换以下内容：

PROJECT_NUMBER：您的 Google Cloud 项目的编号。
PROMPT_TEXT：用户的提示。
SYSTEM_INSTRUCTION：一个可选字段，用于提供序言或一些其他背景信息。
MODEL_ID：一个可选字段，用于设置您希望用来生成有事实依据的答案的 Gemini 模型的模型 ID。如需查看可用模型 ID 的列表，请参阅支持的模型。
TEMPERATURE：一个可选字段，用于设置采样时使用的温度。Google 建议将温度设为 0.0。如需了解详情，请参阅 Gemini 模型参数。
TOP_P：一个可选字段，用于设置模型的 top-P 值。如需了解详情，请参阅 Gemini 模型参数。
TOP_K：一个可选字段，用于设置模型的 Top-K 值。如需了解详情，请参阅 Gemini 模型参数。
LANGUAGE_CODE：一个可选字段，可用于设置生成的答案和返回的块文本的语言。如果无法从查询中确定语言，则使用此字段。默认值为 en。如需查看语言代码列表，请参阅语言。
LATITUDE：用于设置纬度的可选字段。以十进制度输入值，例如 -25.34。
LONGITUDE：用于设置经度的可选字段。以十进制度输入值，例如 131.04。

响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。

[{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT_PART_1"
         }
       ]
     }
   }
 ]
},
{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT_PART_2"
         }
       ]
     }
   }
 ]
},
{
 "candidates": [
   {
     "content": {
       "role": "model",
       "parts": [
         {
           "text": "ANSWER_TEXT_PART_3"
         }
       ]
     }
   }
 ]
},
{
 "candidates": [
   {
     "groundingMetadata": {
       "supportChunks": [
         {
          "source": "0",
          "sourceMetadata": {
            "uri": "REDIRECTION_URI",
            "domain": "PUBLISHER_DOMAIN"
          }
         }
       ],
       "groundingSupport": [
         {
           "claimText": "CLAIM_TEXT_1",
           "supportChunkIndices": [
             0
           ]
         }
       ],
       "webSearchQueries": [
         "QUERY_BUILT_FROM_USER_PROMPT"
       ],
       "searchEntryPoint": {
         "renderedContent": "RENDERED_CONTENT"
       }
     }
   }
 ]
}]

Python

from google.cloud import discoveryengine_v1 as discoveryengine

# TODO(developer): Uncomment these variables before running the sample.
# project_id = "YOUR_PROJECT_ID"

client = discoveryengine.GroundedGenerationServiceClient()

request = discoveryengine.GenerateGroundedContentRequest(
    # The full resource name of the location.
    # Format: projects/{project_number}/locations/{location}
    location=client.common_location_path(project=project_number, location="global"),
    generation_spec=discoveryengine.GenerateGroundedContentRequest.GenerationSpec(
        model_id="default",
    ),
    # Conversation between user and model
    contents=[
        discoveryengine.GroundedGenerationContent(
            role="user",
            parts=[
                discoveryengine.GroundedGenerationContent.Part(
                    text="Summarize how to delete a data store in Vertex AI Agent Builder?"
                )
            ],
        )
    ],
    grounding_spec=discoveryengine.GenerateGroundedContentRequest.GroundingSpec(
        grounding_sources=[
            discoveryengine.GenerateGroundedContentRequest.GroundingSource(
                google_search_source=discoveryengine.GenerateGroundedContentRequest.GroundingSource.GoogleSearchSource()
            ),
        ]
    ),
)
responses = client.stream_generate_grounded_content(iter([request]))

for response in responses:
    # Handle the response
    print(response)

流式传输有依据的回答的示例

在以下示例中，请求指定 Google 搜索作为基础来源来流式传输答案，而无需动态检索配置。流式回答分布在多个候选回答中。此示例使用 streamGenerateGroundedContent 方法。

REST

curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
"https://discoveryengine.googleapis.com/v1alpha/projects/123456/locations/global:streamGenerateGroundedContent" \
-d '
[
{
  "contents": [
    {
      "role": "user",
      "parts": [
        {
          "text": "Summarize How to delete a data store in AI Applications?"
        }
      ]
    }
  ],
  "groundingSpec": {
    "groundingSources": [
      {
        "googleSearchSource": {}
      }
    ]
  },
  "generationSpec": {
    "modelId": "gemini-1.5-flash"
  }
}
]'

响应

您应该会收到类似以下截断的 JSON 响应。如需了解响应，请参阅输出数据。

[{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "To"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " delete a data store in AI Applications, you must first purge all data"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " from the data store. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "supportChunks": [
        {
          "source": "0",
          "sourceMetadata": {
            "uri": "https://vertexaisearch.cloud.google.com/grounding-api-redirect/{unique_string}",
            "domain": "cloud.google.com"
          }
        }
      ],
      "groundingSupport": [
        {
          "claimText": "To delete a data store in AI Applications, you must first purge all data from the data store. ",
          "supportChunkIndices": [
            0
          ]
        }
      ],
      "webSearchQueries": [
        "how to delete a data store in ai applications"
      ],
      "searchEntryPoint": {
        "renderedContent": "\u003cstyle\u003e\n.container {\n  align-items: center;\n  border-radius: 8px;\n  display: flex;\n  font-family: Google Sans, Roboto, sans-serif;\n  font-size: 14px;\n  line-height: 20px;\n  padding: 8px 12px;\n}\n.chip {\n  display: inline-block;\n  border: solid 1px;\n  border-radius: 16px;\n  min-width: 14px;\n  padding: 5px 16px;\n  text-align: center;\n  user-select: none;\n  margin: 0 8px;\n  -webkit-tap-highlight-color: transparent;\n}\n.carousel {\n  overflow: auto;\n  scrollbar-width: none;\n  white-space: nowrap;\n  margin-right: -12px;\n}\n.headline {\n  display: flex;\n  margin-right: 4px;\n}\n.gradient-container {\n  position: relative;\n}\n.gradient {\n  position: absolute;\n  transform: translate(3px, -9px);\n  height: 36px;\n  width: 9px;\n}\n@media (prefers-color-scheme: light) {\n  .container {\n    background-color: #fafafa;\n    box-shadow: 0 0 0 1px #0000000f;\n  }\n  .headline-label {\n    color: #1f1f1f;\n  }\n  .chip {\n    background-color: #ffffff;\n    border-color: #d2d2d2;\n    color: #5e5e5e;\n    text-decoration: none;\n  }\n  .chip:hover {\n    background-color: #f2f2f2;\n  }\n  .chip:focus {\n    background-color: #f2f2f2;\n  }\n  .chip:active {\n    background-color: #d8d8d8;\n    border-color: #b6b6b6;\n  }\n  .logo-dark {\n    display: none;\n  }\n  .gradient {\n    background: linear-gradient(90deg, #fafafa 15%, #fafafa00 100%);\n  }\n}\n@media (prefers-color-scheme: dark) {\n  .container {\n    background-color: #1f1f1f;\n    box-shadow: 0 0 0 1px #ffffff26;\n  }\n  .headline-label {\n    color: #fff;\n  }\n  .chip {\n    background-color: #2c2c2c;\n    border-color: #3c4043;\n    color: #fff;\n    text-decoration: none;\n  }\n  .chip:hover {\n    background-color: #353536;\n  }\n  .chip:focus {\n    background-color: #353536;\n  }\n  .chip:active {\n    background-color: #464849;\n    border-color: #53575b;\n  }\n  .logo-light {\n    display: none;\n  }\n  .gradient {\n    background: linear-gradient(90deg, #1f1f1f 15%, #1f1f1f00 100%);\n  }\n}\n\u003c/style\u003e\n\u003cdiv class=\"container\"\u003e\n  \u003cdiv class=\"headline\"\u003e\n    \u003csvg class=\"logo-light\" width=\"18\" height=\"18\" viewBox=\"9 9 35 35\" fill=\"none\" xmlns=\"http://www.w3.org/2000/svg\"\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M42.8622 27.0064C42.8622 25.7839 42.7525 24.6084 42.5487 23.4799H26.3109V30.1568H35.5897C35.1821 32.3041 33.9596 34.1222 32.1258 35.3448V39.6864H37.7213C40.9814 36.677 42.8622 32.2571 42.8622 27.0064V27.0064Z\" fill=\"#4285F4\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M26.3109 43.8555C30.9659 43.8555 34.8687 42.3195 37.7213 39.6863L32.1258 35.3447C30.5898 36.3792 28.6306 37.0061 26.3109 37.0061C21.8282 37.0061 18.0195 33.9811 16.6559 29.906H10.9194V34.3573C13.7563 39.9841 19.5712 43.8555 26.3109 43.8555V43.8555Z\" fill=\"#34A853\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M16.6559 29.8904C16.3111 28.8559 16.1074 27.7588 16.1074 26.6146C16.1074 25.4704 16.3111 24.3733 16.6559 23.3388V18.8875H10.9194C9.74388 21.2072 9.06992 23.8247 9.06992 26.6146C9.06992 29.4045 9.74388 32.022 10.9194 34.3417L15.3864 30.8621L16.6559 29.8904V29.8904Z\" fill=\"#FBBC05\"/\u003e\n      \u003cpath fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M26.3109 16.2386C28.85 16.2386 31.107 17.1164 32.9095 18.8091L37.8466 13.8719C34.853 11.082 30.9659 9.3736 26.3109 9.3736C19.5712 9.3736 13.7563 13.245 10.9194 18.8875L16.6559 23.3388C18.0195 19.2636 21.8282 16.2386 26.3109 16.2386V16.2386Z\" fill=\"#EA4335\"/\u003e\n    \u003c/svg\u003e\n    \u003csvg class=\"logo-dark\" width=\"18\" height=\"18\" viewBox=\"0 0 48 48\" xmlns=\"http://www.w3.org/2000/svg\"\u003e\n      \u003ccircle cx=\"24\" cy=\"23\" fill=\"#FFF\" r=\"22\"/\u003e\n      \u003cpath d=\"M33.76 34.26c2.75-2.56 4.49-6.37 4.49-11.26 0-.89-.08-1.84-.29-3H24.01v5.99h8.03c-.4 2.02-1.5 3.56-3.07 4.56v.75l3.91 2.97h.88z\" fill=\"#4285F4\"/\u003e\n      \u003cpath d=\"M15.58 25.77A8.845 8.845 0 0 0 24 31.86c1.92 0 3.62-.46 4.97-1.31l4.79 3.71C31.14 36.7 27.65 38 24 38c-5.93 0-11.01-3.4-13.45-8.36l.17-1.01 4.06-2.85h.8z\" fill=\"#34A853\"/\u003e\n      \u003cpath d=\"M15.59 20.21a8.864 8.864 0 0 0 0 5.58l-5.03 3.86c-.98-2-1.53-4.25-1.53-6.64 0-2.39.55-4.64 1.53-6.64l1-.22 3.81 2.98.22 1.08z\" fill=\"#FBBC05\"/\u003e\n      \u003cpath d=\"M24 14.14c2.11 0 4.02.75 5.52 1.98l4.36-4.36C31.22 9.43 27.81 8 24 8c-5.93 0-11.01 3.4-13.45 8.36l5.03 3.85A8.86 8.86 0 0 1 24 14.14z\" fill=\"#EA4335\"/\u003e\n    \u003c/svg\u003e\n    \u003cdiv class=\"gradient-container\"\u003e\u003cdiv class=\"gradient\"\u003e\u003c/div\u003e\u003c/div\u003e\n  \u003c/div\u003e\n  \u003cdiv class=\"carousel\"\u003e\n    \u003ca class=\"chip\" href=\"https://www.google.com/search?q=how+to+delete+a+data+store+in+ai+applications&client=app-vertex-grounding&safesearch=active\"\u003ehow to delete a data store in ai applications\u003c/a\u003e\n  \u003c/div\u003e\n\u003c/div\u003e\n"
      }
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You can purge data from a data store"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " using the Google Cloud console or the command line. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You can purge data from a data store using the Google Cloud console or the command line. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "Once the data is purged, you can delete the data store. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "Once the data is purged, you can delete the data store. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You cannot delete"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " a data store that is connected to an app. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You cannot delete a data store that is connected to an app. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You must first delete the app that the data store is connected to. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You must first delete the app that the data store is connected to. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You also"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " cannot delete a data store that is in the process of upgrading or downgrading. "
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You also cannot delete a data store that is in the process of upgrading or downgrading. ",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": "You must wait for the upgrade or downgrade to complete before deleting the data store."
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "content": {
      "role": "model",
      "parts": [
        {
          "text": " \n"
        }
      ]
    }
  }
]
}
,
{
"candidates": [
  {
    "groundingMetadata": {
      "groundingSupport": [
        {
          "claimText": "You must wait for the upgrade or downgrade to complete before deleting the data store. \n",
          "supportChunkIndices": [
            0
          ]
        }
      ]
    }
  }
]
}
]

支持的模型

以下模型支持接地：

Gemini 1.5 Pro（仅包含文本输入）
Gemini 1.5 Flash（仅包含文本输入）

如需详细了解这些 Gemini 模型，请参阅 Gemini 模型版本和生命周期。

调用 generateGroundedContent 方法时，您可以使用以下模型 ID：

模型 ID	自动更新
`default`	是
`gemini-1.5-flash`	是
`gemini-1.5-flash-001`	否
`gemini-1.5-flash-002`	否
`gemini-1.5-pro`	是
`gemini-1.5-pro-001`	否
`gemini-1.5-pro-002`	否

后续步骤

了解如何将接地生成方法与其他 RAG API 搭配使用，以从非结构化数据中生成接地回答。

使用 RAG 生成有依据的回答 使用集合让一切井井有条 根据您的偏好保存内容并对其进行分类。

术语

RAG 术语

输入数据

输出数据

在单个对话轮次中生成接地回答

以内嵌文本和 Vertex AI Search 数据存储区中的数据作为回答依据

REST

响应

以内嵌文本和 Vertex AI Search 为依据生成单轮回答的示例

REST

响应

Python

依托 Google 搜索生成接地的回答

动态检索

动态检索预测得分和阈值

依托 Google 搜索为回答接地

REST

响应

Python

基于 Google 搜索结果生成单轮回答的示例

REST

响应

在多个回合中生成有依据的回答

REST

响应

响应

多轮回答生成示例

REST

响应

响应

流式传输已接地的回答

REST

响应

Python

流式传输有依据的回答的示例

REST

响应

支持的模型

后续步骤

使用 RAG 生成有依据的回答