Starting April 29, 2025, Gemini 1.5 Pro and Gemini 1.5 Flash models are not available in projects that have no prior usage of these models, including new projects. For details, see Model versions and lifecycle.
Stay organized with collections
Save and categorize content based on your preferences.
This page introduces you to RagManagedDb, its underlying technology, and how
RagManagedDb is used in Vertex AI RAG Engine. In addition, this page
describes the different tiers that are available to tune performance, which
might impact your costs, and provides instructions for deleting your
Vertex AI RAG Engine data, which stops billing.
Overview
Vertex AI RAG Engine uses RagManagedDb, which is an enterprise-ready,
fully-managed Google Spanner instance that's used for resource storage
by Vertex AI RAG Engine and is optionally available to be used as
the vector database of
choice for your RAG corpora.
Through Spanner, Vertex AI RAG Engine offers a
consistent, highly available, and highly scalable database to support your
application. To learn more about Google Spanner, see
Spanner.
Vertex AI RAG Engine stores your RAG corpus and RAG file resource
metadata in RagManagedDb, regardless of your choice of vector database. Vector
databases are only used for storage and retrieval of embeddings. In addition to
resource storage, RagManagedDb can also be used to store and manage vector
representations of your documents. The vector database is then used to retrieve
relevant documents based on the document's semantic similarity to a given query.
Manage tiers
Vertex AI RAG Engine lets you scale your RagManagedDb instance based
on your usage and performance requirements using a choice of two tiers, and
optionally, lets you delete your Vertex AI RAG Engine data using
a third tier.
The tier is a project-level setting that's available in the RagEngineConfig
resource that impacts RAG corpora using RagManagedDb. The following tiers
are available in RagEngineConfig:
Scaled tier: This tier offers production-scale performance along with
autoscaling functionality. It's suitable for customers with large amounts of
data or performance-sensitive workloads. Internally, this tier sets the
Spanner instance to autoscaling configuration with a minimum
of 1 node (1,000 processing units) and a maximum of 10 nodes (10,000
processing units).
Basic tier (default): This tier offers a cost-effective and low-compute
tier, which might be suitable for some of the following cases:
Experimenting with RagManagedDb.
Small data size.
Latency-insensitive workload.
Use Vertex AI RAG Engine with only other vector databases.
To offer the Basic tier, RagManagedDb sets the underlying
Spanner instance to a fixed configuration of 100 processing
units, which is equivalent to 0.1 nodes.
Unprovisioned tier: This tier deletes the RagManagedDb and its
underlying Spanner instance. The Unprovisioned tier disables
the Vertex AI RAG Engine service and deletes your data held
within this service regardless of the vector database used for your
RagCorpora. This stops the billing of the service. For more information on
billing, see Vertex AI RAG Engine
billing.
After the data is deleted, the data can't be recovered. To start usingVertex AI RAG Engine again, you must update the tier by
calling the UpdateRagEngineConfig API.
Get the project configuration
The following code samples demonstrate how to use the GetRagEngineConfig API
for each type of tier:
To learn how to use the RAG API v1, the default, see RAG API
v1.
To learn how to use the RAG API v1beta1, see RAG API
v1beta1.
To learn more about RagManagedDb and how to manage your tier configuration
as well as the RAG corpus-level retrieval strategy, see Use RagManagedDb with
Vertex AI RAG Engine.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-29 UTC."],[],[],null,["# Understanding RagManagedDb\n\n| The [VPC-SC security controls](/vertex-ai/generative-ai/docs/security-controls) and\n| CMEK are supported by Vertex AI RAG Engine. Data residency and AXT security controls aren't\n| supported.\n\nThis page introduces you to `RagManagedDb`, its underlying technology, and how\n`RagManagedDb` is used in Vertex AI RAG Engine. In addition, this page\ndescribes the different tiers that are available to tune performance, which\nmight impact your costs, and provides instructions for deleting your\nVertex AI RAG Engine data, which stops billing.\n\nOverview\n--------\n\nVertex AI RAG Engine uses `RagManagedDb`, which is an enterprise-ready,\nfully-managed Google Spanner instance that's used for resource storage\nby Vertex AI RAG Engine and is optionally available to be used as\nthe [vector database of\nchoice](/vertex-ai/generative-ai/docs/rag-engine/use-ragmanageddb-with-rag) for your RAG corpora.\n\nThrough Spanner, Vertex AI RAG Engine offers a\nconsistent, highly available, and highly scalable database to support your\napplication. To learn more about Google Spanner, see\n[Spanner](/spanner).\n\nVertex AI RAG Engine stores your RAG corpus and RAG file resource\nmetadata in `RagManagedDb`, regardless of your choice of vector database. Vector\ndatabases are only used for storage and retrieval of embeddings. In addition to\nresource storage, `RagManagedDb` can also be used to store and manage vector\nrepresentations of your documents. The vector database is then used to retrieve\nrelevant documents based on the document's semantic similarity to a given query.\n\nManage tiers\n------------\n\nVertex AI RAG Engine lets you scale your `RagManagedDb` instance based\non your usage and performance requirements using a choice of two tiers, and\noptionally, lets you delete your Vertex AI RAG Engine data using\na third tier.\n\nThe tier is a project-level setting that's available in the `RagEngineConfig`\nresource that impacts RAG corpora using `RagManagedDb`. The following tiers\nare available in `RagEngineConfig`:\n\n- **Scaled tier**: This tier offers production-scale performance along with\n autoscaling functionality. It's suitable for customers with large amounts of\n data or performance-sensitive workloads. Internally, this tier sets the\n Spanner instance to autoscaling configuration with a minimum\n of 1 node (1,000 processing units) and a maximum of 10 nodes (10,000\n processing units).\n\n- **Basic tier (default)**: This tier offers a cost-effective and low-compute\n tier, which might be suitable for some of the following cases:\n\n - Experimenting with `RagManagedDb`.\n - Small data size.\n - Latency-insensitive workload.\n - Use Vertex AI RAG Engine with only other vector databases.\n\n To offer the Basic tier, `RagManagedDb` sets the underlying\n Spanner instance to a fixed configuration of 100 processing\n units, which is equivalent to 0.1 nodes.\n- **Unprovisioned tier** : This tier deletes the `RagManagedDb` and its\n underlying Spanner instance. The Unprovisioned tier disables\n the Vertex AI RAG Engine service and deletes your data held\n within this service regardless of the vector database used for your\n `RagCorpora`. This stops the billing of the service. For more information on\n billing, see [Vertex AI RAG Engine\n billing](/vertex-ai/generative-ai/docs/rag-engine/rag_engine_billing).\n\n After the data is deleted, the data can't be recovered. To start usingVertex AI RAG Engine again, you must update the tier by\n calling the `UpdateRagEngineConfig` API.\n\n| **Note:** The Enterprise tier from the `v1beta1` version was renamed to the Scaled tier.\n\nGet the project configuration\n-----------------------------\n\nThe following code samples demonstrate how to use the `GetRagEngineConfig` API\nfor each type of tier:\n\n- [Version 1\n (v1)](/vertex-ai/generative-ai/docs/model-reference/rag-api-v1#get_project_configuration) API\n code samples.\n\n- [v1beta1](/vertex-ai/generative-ai/docs/model-reference/rag-api#get-project-config-for-rag) API\n code samples.\n\nUpdate the project configuration\n--------------------------------\n\nThe following code samples demonstrate how to use the `UpdateRagEngineConfig`\nAPI for each type of tier:\n\n- [Version 1\n (v1)](/vertex-ai/generative-ai/docs/model-reference/rag-api-v1#update_project_configuration)\n API code samples.\n\n- [v1beta1](/vertex-ai/generative-ai/docs/model-reference/rag-api#update-project-config-for-rag)\n API code samples.\n\nWhat's next\n-----------\n\n- To learn how to use the RAG API v1, the default, see [RAG API\n v1](/vertex-ai/generative-ai/docs/model-reference/rag-api-v1).\n- To learn how to use the RAG API v1beta1, see [RAG API\n v1beta1](/vertex-ai/generative-ai/docs/model-reference/rag-api).\n- To learn more about `RagManagedDb` and how to manage your tier configuration as well as the RAG corpus-level retrieval strategy, see [Use RagManagedDb with\n Vertex AI RAG Engine](/vertex-ai/generative-ai/docs/rag-engine/use-ragmanageddb-with-rag)."]]