This page explains how to ground a model's responses using Google Search, which uses publicly-available web data.
Grounding with Google Search
If you want to connect your model with world knowledge, a wide possible range of topics, or up-to-date information on the Internet, then use Grounding with Google Search.
Grounding with Google Search supports dynamic retrieval that gives you the option to generate grounded responses with Google Search. Therefore, the dynamic retrieval configuration evaluates whether a prompt requires knowledge about recent events and enables Grounding with Google Search. For more information, see Dynamic retrieval.
To learn more about model grounding in Vertex AI, see the Grounding overview.
Supported models
This section lists the models that support grounding with Search. To explore how each model generates grounded responses, follow these instructions:
Try a model listed in this table in the Google Cloud console.
Click the Grounding toggle to the on position.
Model | Description | Try a model |
---|---|---|
Gemini 2.5 Pro |
Text, code, images, audio, video, video with audio, PDF Doesn't support dynamic retrieval. For more information, see Considerations. |
Try the Gemini 2.5 Pro model |
Gemini 2.0 Flash |
Text, code, images, audio, video, video with audio, PDF Doesn't support dynamic retrieval. For more information, see Considerations. |
Try the Gemini 2.0 Flash model |
Gemini 2.0 Flash-Lite |
Text, code, images, audio, video, video with audio, PDF Doesn't support dynamic retrieval. For more information, see Considerations. |
Try the Gemini 2.0 Flash-Lite model |
Supported languages
For a listed of supported languages, see Languages.
Ground your model with Google Search
Use the following instructions to ground a model with publicly available web data.
Considerations
To use grounding with Google Search, you must enable Google Search Suggestions. See more at Google Search Suggestions.
For ideal results, use a temperature of
0.0
. To learn more about setting this config, see the Gemini API request body from the model reference.Grounding with Google Search has a limit of one million queries per day. If you require more queries, contact Google Cloud support for assistance.
Only the Gemini 1.0 and Gemini 1.5 models support dynamic retrieval. The Gemini 2.0 models don't support dynamic retrieval.
Gen AI SDK for Python
Install
pip install --upgrade google-genai
Set environment variables to use the Gen AI SDK with Vertex AI:
# Replace the `GOOGLE_CLOUD_PROJECT` and `GOOGLE_CLOUD_LOCATION` values # with appropriate values for your project. export GOOGLE_CLOUD_PROJECT=GOOGLE_CLOUD_PROJECT export GOOGLE_CLOUD_LOCATION=us-central1 export GOOGLE_GENAI_USE_VERTEXAI=True
Console
To use Grounding with Google Search with the Vertex AI Studio, follow these steps:
- Go to Vertex AI Studio to the Create prompt page.
- In the side panel, click the Grounding: Google toggle.
- Click Customize and set Google Search as the source.
- Enter your prompt in the text box and click Submit.
Your prompt responses now ground to Google Search.
REST
Before using any of the request data, make the following replacements:
- LOCATION: The region to process the request.
- PROJECT_ID: Your project ID.
- MODEL_ID: The model ID of the multimodal model. The Gemini 2.0 models don't support dynamic retrieval.
- TEXT: The text instructions to include in the prompt.
- DYNAMIC_THRESHOLD: An optional field to set the threshold
to invoke the dynamic retrieval configuration. It is a floating point
value in the range [0,1]. If you don't set the
dynamicThreshold
field, the threshold value defaults to 0.7.
HTTP method and URL:
POST https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID:generateContent
Request JSON body:
{ "contents": [{ "role": "user", "parts": [{ "text": "TEXT" }] }], "tools": [{ "googleSearchRetrieval": { "dynamicRetrievalConfig": { "mode": "MODE_DYNAMIC", "dynamicThreshold": DYNAMIC_THRESHOLD } } }], "model": "projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID" }
To send your request, expand one of these options:
You should receive a JSON response similar to the following:
{ "candidates": [ { "content": { "role": "model", "parts": [ { "text": "Chicago weather changes rapidly, so layers let you adjust easily. Consider a base layer, a warm mid-layer (sweater-fleece), and a weatherproof outer layer." } ] }, "finishReason": "STOP", "safetyRatings":[ "..." ], "groundingMetadata": { "webSearchQueries": [ "What's the weather in Chicago this weekend?" ], "searchEntryPoint": { "renderedContent": "....................." } "groundingSupports": [ { "segment": { "startIndex": 0, "endIndex": 65, "text": "Chicago weather changes rapidly, so layers let you adjust easily." }, "groundingChunkIndices": [ 0 ], "confidenceScores": [ 0.99 ] }, ] "retrievalMetadata": { "webDynamicRetrievalScore": 0.96879 } } } ], "usageMetadata": { "..." } }
Understand your response
If your model prompt successfully grounds to Google Search from the Vertex AI Studio or from the API, then the responses include metadata with source links (web URLs). However, there are several reasons this metadata might not be provided, and the prompt response won't be grounded. These reasons include low source relevance or incomplete information within the model's response.
Grounded support
Displaying grounded support is highly recommended. They aid users in validating responses from publishers themselves and add avenues for further learning.
Grounded support for responses from Google Search sources should be shown both inline and in aggregate. See the following image as a suggestion on how to do this.
Use of alternative search engine options
Customer's use of Grounding with Google Search does not prevent Customer from offering alternative search engine options, making alternative search options the default option for Customer Applications, or displaying their own or third party search suggestions or search results in Customer Applications, provided that that any such non-Google Search services or associated results are displayed separately from the Grounded Results and Search Suggestions and can't reasonably be attributed to, or confused with results provided by, Google.
What's next
- To learn how to send chat prompt requests, see Multiturn chat.
- To learn about responsible AI best practices and Vertex AI's safety filters, see Safety best practices.
- To learn how to ground the PaLM models, see
Grounding in Vertex AI.