Store, index, and query vectors
The stock pgvector PostgreSQL
extension
extension is customized for AlloyDB, and referred to as vector.
It supports storing generated embeddings in a vector column. The extension also
adds support for scalar quantization feature to create IVF indexes. You can
also create an IVFFlat index or HSNW index that are available with stock
pgvector.
For more information about storing vectors, see Store vectors.
In addition to the customized vector extension, AlloyDB
includes the alloydb_scann extension that implements a highly efficient
nearest-neighbor index powered by the ScaNN
algorithm.
For more information about creating indexes and querying vectors, see Create indexes and query vectors.
Tune your vector query performance
You can tune your indexes for a balance between query-per-second (QPS) and recall with your queries. For more information about tuning your indexes, see Tune vector query performance.
Generate embeddings and text predictions
AlloyDB AI extends PostgreSQL syntax with two functions for
querying models using the google_ml_integration extension:
- Invoke predictions to call a model using SQL within a transaction. 
- Generate embeddings to have an LLM translate text prompts into numerical vectors. - You can use the - embedding()function to query Vertex AI models, while the- google_ml.embedding()function can be used to query registered Vertex AI, hosted, and third-party models.- You can then apply these vector embeddings as input to - pgvectorfunctions. This includes methods to compare and sort samples of text according to their relative semantic distance.
Use models in the cloud with Vertex AI
You can configure AlloyDB Omni to work with Vertex AI.
This gives your applications the following benefits:
- Your applications can invoke predictions using any model stored in the Vertex AI Model Garden that they have access to. 
- Your applications can generate embeddings using the - text-embeddingEnglish models LLM.