Get text embeddings

This page describes how to create a text embedding using the Text Embedding API.

Vertex AI supports text embeddings in Google Distributed Cloud (GDC) air-gapped through the Text Embedding API. Text Embedding uses vector representations.

Text Embedding converts textual data written in any supported language into numerical vectors. These vector representations are designed to capture the semantic meaning and context of the words they represent. Text embedding models can generate optimized embeddings for various task types, such as document retrieval, questions and answers, classification, and fact verification for text.

For more information about key concepts that text embeddings use, see the following documentation:

To learn more about embeddings, see the text embeddings overview.
To learn about text embedding models, see Embeddings models.
To learn how task types generate optimized embeddings, see Choose an embeddings task type.
For information about which languages each embedding model supports, see Supported text embedding languages.

Before you begin

Before using Text Embedding in a GDC project, follow these steps:

Set up a project for Vertex AI.
Choose one of the available models for text embeddings, depending on the language and task type.
Enable the Text Embedding or Text Embedding Multilingual API, depending on the model you want to use.
Grant a user or service account appropriate access to Text Embedding or Text Embedding Multilingual. For more information, see the following documentation:
- For information about required roles, see Prepare IAM permissions.
- For information about role bindings for service accounts, see Set up service accounts.
Install the Vertex AI client libraries.

Important: You must install the Vertex AI Platform client library to use Text Embedding and Text Embedding Multilingual.
Get an authentication token.

You must use the same project for your model requests, the service account, and the IAM role binding.

Get text embeddings for a snippet of text

After meeting the prerequisites, you can use the Text Embedding or Text Embedding Multilingual models to get text embeddings for a snippet of text by using the API or the SDK for Python.

The following examples use the text-embedding-004 model.

Make a REST request to the Text Embedding API. Otherwise, interact with the model from a Python script to get a text embedding.

REST

To get text embeddings, send a POST request by specifying the model endpoint.

Follow these steps to make a request:

Save your request content in a JSON file named request.json. The file must look like the following example:

{
  "instances": [
    {
      "content": "What is life?",
      "task_type": "",
      "title": ""
    }
  ]
}

Make the request using the curl tool:
```
curl -X POST \
-H "Authorization: Bearer TOKEN"\
-H "Content-Type: application/json; charset=utf-8" \
-d @request.json \
"https://ENDPOINT:443/v1/projects/PROJECT/locations/PROJECT/endpoints/MODEL:predict"
```
Replace the following:
- TOKEN: the authentication token you obtained.
- ENDPOINT: the Text Embedding or Text Embedding Multilingual endpoint that you use for your organization. For more information, view service status and endpoints.
- PROJECT: your project name.
- MODEL: the model you want to use. The following are the available values:
  - endpoint-text-embedding for the Text Embedding model.
  - endpoint-text-embedding-multilingual for the Text Embedding Multilingual model.

You must obtain a JSON response similar to the following:

{"predictions":[[-0.00668720435,3.20804138e-05,-0.0281705819,-0.00954890903,-0.0818724185,0.0150693133,-0.00677698106, …. ,0.0167487375,-0.0534791686,0.00208711182,0.032938987,-0.01491543]],"deployedModelId":"text-embedding","model":"models/text-embedding/1","modelDisplayName":"text-embedding","modelVersionId":"1"}

Python

Follow these steps to get text embeddings from a Python script:

Install the Vertex AI Platform client library.