Stay organized with collections
Save and categorize content based on your preferences.
This document describes how Gemini for Google Cloud is designed in view of the
capabilities, limitations, and risks that are associated with generative AI.
Capabilities and risks of large language models
Large language models (LLMs) can perform many useful tasks such as the
following:
Translate language.
Summarize text.
Generate code and creative writing.
Power chatbots and virtual assistants.
Complement search engines and recommendation systems.
At the same time, the evolving technical capabilities of LLMs create the
potential for misapplication, misuse, and unintended or unforeseen consequences.
LLMs can generate output that you don't expect, including text that's offensive,
insensitive, or factually incorrect. Because LLMs are incredibly versatile, it
can be difficult to predict exactly what kinds of unintended or unforeseen
outputs they might produce.
Given these risks and complexities, Gemini for Google Cloud is designed with
Google's AI principles in
mind. However, it's important for users to understand some of the limitations of
Gemini for Google Cloud to work safely and responsibly.
Gemini for Google Cloud limitations
Some of the limitations that you might encounter using Gemini
for Google Cloud include (but aren't limited to) the following:
Edge cases. Edge cases refer to unusual, rare, or exceptional situations
that aren't well represented in the training data. These cases can lead to
limitations in the output of Gemini models, such as model
overconfidence, misinterpretation of context, or inappropriate outputs.
Model hallucinations, grounding, and factuality. Gemini
models might lack grounding and factuality in real-world knowledge, physical
properties, or accurate understanding. This limitation can lead to model
hallucinations, where Gemini for Google Cloud might
generate outputs that are plausible-sounding but factually incorrect,
irrelevant, inappropriate, or nonsensical. Hallucinations can also include
fabricating links to web pages that don't exist and have never existed. For
more information, see
Write better prompts for Gemini for Google Cloud.
Data quality and tuning. The quality, accuracy, and bias of the prompt
data that's entered into Gemini for Google Cloud
products can have a significant impact on its performance. If users enter
inaccurate or incorrect prompts, Gemini for Google Cloud
might return suboptimal or false responses.
Bias amplification. Language models can inadvertently amplify existing
biases in their training data, leading to outputs that might further reinforce
societal prejudices and unequal treatment of certain groups.
Language quality. While Gemini for Google Cloud
yields impressive multilingual capabilities on the benchmarks that we
evaluated against, the majority of our benchmarks (including all of the
fairness evaluations) are in American English.
Language models might provide inconsistent service quality to different users.
For example, text generation might not be as effective for some dialects or
language varieties because they are underrepresented in the training data.
Performance might be worse for non-English languages or English language
varieties with less representation.
Fairness benchmarks and subgroups. Google Research's fairness analyses of
Gemini models don't provide an exhaustive account of the various
potential risks. For example, we focus on biases along gender, race,
ethnicity, and religion axes, but perform the analysis only on the American
English language data and model outputs.
Limited domain expertise. Gemini models have been trained
on Google Cloud technology, but it might lack the depth of knowledge
that's required to provide accurate and detailed responses on highly
specialized or technical topics, leading to superficial or incorrect
information.
When you use the Gemini pane in the Google Cloud console,
Gemini is not context aware of your specific environment, so
it cannot answer questions such as "When was the last time I created a VM?"
In some cases, Gemini for Google Cloud sends a specific
segment of your context to the model to receive a context-specific
response—for example, when you click the Troubleshooting suggestions
button in the Error Reporting service page.
Gemini safety and toxicity filtering
Gemini for Google Cloud prompts and responses are checked
against a comprehensive list of safety attributes as applicable for each use
case. These safety attributes aim to filter out content that violates our
Acceptable Use Policy. If an output is considered
harmful, the response will be blocked.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-29 UTC."],[[["\u003cp\u003eGemini for Google Cloud is designed with Google's AI principles to leverage the capabilities of large language models while mitigating potential risks like generating factually incorrect or inappropriate content.\u003c/p\u003e\n"],["\u003cp\u003eLimitations of Gemini for Google Cloud include encountering edge cases, generating outputs that are factually incorrect, and being sensitive to the quality and bias of the prompt data entered by users.\u003c/p\u003e\n"],["\u003cp\u003eGemini models can amplify biases from their training data, and have varying language quality based on the prevalence of a language or dialect in the training data, with a focus on fairness evaluations in American English.\u003c/p\u003e\n"],["\u003cp\u003eThe models can lack depth in highly specialized domains, providing superficial information, and it is not context aware of specific user environments in the Google Cloud console, limiting its ability to answer environment-specific questions.\u003c/p\u003e\n"],["\u003cp\u003eGemini for Google Cloud incorporates safety and toxicity filtering to block harmful content, ensuring responses align with Google's Acceptable Use Policy.\u003c/p\u003e\n"]]],[],null,["# Gemini for Google Cloud and responsible AI\n\nThis document describes how Gemini for Google Cloud is designed in view of the\ncapabilities, limitations, and risks that are associated with generative AI.\n\nCapabilities and risks of large language models\n-----------------------------------------------\n\nLarge language models (LLMs) can perform many useful tasks such as the\nfollowing:\n\n- Translate language.\n- Summarize text.\n- Generate code and creative writing.\n- Power chatbots and virtual assistants.\n- Complement search engines and recommendation systems.\n\nAt the same time, the evolving technical capabilities of LLMs create the\npotential for misapplication, misuse, and unintended or unforeseen consequences.\n\nLLMs can generate output that you don't expect, including text that's offensive,\ninsensitive, or factually incorrect. Because LLMs are incredibly versatile, it\ncan be difficult to predict exactly what kinds of unintended or unforeseen\noutputs they might produce.\n\nGiven these risks and complexities, Gemini for Google Cloud is designed with\n[Google's AI principles](https://ai.google/responsibility/principles/) in\nmind. However, it's important for users to understand some of the limitations of\nGemini for Google Cloud to work safely and responsibly.\n\nGemini for Google Cloud limitations\n-----------------------------------\n\nSome of the limitations that you might encounter using Gemini\nfor Google Cloud include (but aren't limited to) the following:\n\n- **Edge cases.** Edge cases refer to unusual, rare, or exceptional situations\n that aren't well represented in the training data. These cases can lead to\n limitations in the output of Gemini models, such as model\n overconfidence, misinterpretation of context, or inappropriate outputs.\n\n- **Model hallucinations, grounding, and factuality.** Gemini\n models might lack grounding and factuality in real-world knowledge, physical\n properties, or accurate understanding. This limitation can lead to model\n hallucinations, where Gemini for Google Cloud might\n generate outputs that are plausible-sounding but factually incorrect,\n irrelevant, inappropriate, or nonsensical. Hallucinations can also include\n fabricating links to web pages that don't exist and have never existed. For\n more information, see\n [Write better prompts for Gemini for Google Cloud](/gemini/docs/discover/write-prompts).\n\n- **Data quality and tuning.** The quality, accuracy, and bias of the prompt\n data that's entered into Gemini for Google Cloud\n products can have a significant impact on its performance. If users enter\n inaccurate or incorrect prompts, Gemini for Google Cloud\n might return suboptimal or false responses.\n\n- **Bias amplification.** Language models can inadvertently amplify existing\n biases in their training data, leading to outputs that might further reinforce\n societal prejudices and unequal treatment of certain groups.\n\n- **Language quality.** While Gemini for Google Cloud\n yields impressive multilingual capabilities on the benchmarks that we\n evaluated against, the majority of our benchmarks (including all of the\n fairness evaluations) are in American English.\n\n Language models might provide inconsistent service quality to different users.\n For example, text generation might not be as effective for some dialects or\n language varieties because they are underrepresented in the training data.\n Performance might be worse for non-English languages or English language\n varieties with less representation.\n- **Fairness benchmarks and subgroups.** Google Research's fairness analyses of\n Gemini models don't provide an exhaustive account of the various\n potential risks. For example, we focus on biases along gender, race,\n ethnicity, and religion axes, but perform the analysis only on the American\n English language data and model outputs.\n\n- **Limited domain expertise.** Gemini models have been trained\n on Google Cloud technology, but it might lack the depth of knowledge\n that's required to provide accurate and detailed responses on highly\n specialized or technical topics, leading to superficial or incorrect\n information.\n\n When you use the **Gemini** pane in the Google Cloud console,\n Gemini is not context aware of your specific environment, so\n it cannot answer questions such as \"When was the last time I created a VM?\"\n\n In some cases, Gemini for Google Cloud sends a specific\n segment of your context to the model to receive a context-specific\n response---for example, when you click the **Troubleshooting suggestions**\n button in the Error Reporting service page.\n\nGemini safety and toxicity filtering\n------------------------------------\n\nGemini for Google Cloud prompts and responses are checked\nagainst a comprehensive list of safety attributes as applicable for each use\ncase. These safety attributes aim to filter out content that violates our\n[Acceptable Use Policy](https://cloud.google.com/terms/aup). If an output is considered\nharmful, the response will be blocked.\n\nWhat's next\n-----------\n\n- Learn more about [how Gemini cites sources when helps you generate code](/gemini/docs/discover/works#how-when-gemini-cites-sources)."]]