This page describes how to enable Vertex AI APIs on Google Distributed Cloud (GDC) air-gapped so you can start implementing Vertex AI capabilities.
Most tasks to enable or deactivate Vertex AI APIs require administrator access. If you lack the necessary permissions, ask your administrator to enable Vertex AI APIs on your behalf.
Vertex AI on Distributed Cloud includes various APIs that you must enable from the GDC console to let users from your organization interact with ML models and AI capabilities. To learn more about these APIs, see the following documentation:
- Generative AI: Learn about Generative AI capabilities and available models.
- Online Prediction: Learn about online predictions.
- Optical Character Recognition (OCR): Learn about character recognition features.
- Speech-to-Text: Learn about speech recognition features.
- Vertex AI Translation: Learn about translation features.
- Vertex AI Workbench: Learn about Vertex AI Workbench.
Use the GDC console to enable, deactivate, and view the endpoints of Vertex AI APIs.
Before you begin
To get the permissions that you need to enable APIs, ask your Organization IAM
Admin or Project IAM Admin to grant you the AI Platform Admin
(ai-platform-admin
) role in the project namespace.
For information about this role, see Prepare IAM permissions. To learn how to grant permissions to a subject, see Grant and revoke access.
Follow these steps before enabling APIs:
- Set up the GDC domain name system (DNS). If you haven't set up the DNS, work with your Infrastructure Operator (IO) to complete this prerequisite.
- Set up a project to use Vertex AI.
If you bring your own model because you want to use Online Prediction or run artificial intelligence (AI) and machine learning (ML) notebooks in a GPU environment, make sure to allocate GPU machines for the correct cluster types.
Enable pre-trained APIs
You can enable Vertex AI service APIs and models using the GDC console.
After meeting the prerequisites, follow these steps to enable APIs:
- Sign in to the GDC console. If you can't sign in, see Connect to an identity provider.
- In the navigation menu, click Vertex AI > Pre-trained APIs.
On the Pre-trained APIs page, click Enable on a specific service to enable that API.
In the confirmation dialog, click Enable. A progress message displays.
The enablement duration varies. It might take between 15 and 45 minutes to finish, depending on the state of the cluster.
If you want to view the status of the pre-trained APIs, view the service status and endpoints.
The VAI-A0001 alert (Enabling State Time Limit Reached
) triggers if the
services take a long time to be enabled. In this case, your IO must review the
VAI-R0001 runbook for details.
Deactivate pre-trained APIs
You can deactivate pre-trained APIs using the GDC console.
After meeting the prerequisites, follow these steps to deactivate APIs:
- Sign in to the GDC console.
- In the navigation menu, click Vertex AI > Pre-trained APIs.
On the Pre-trained APIs page, click Disable on a specific service to deactivate that API.
In the confirmation dialog, enter disable in the text field to confirm that you want to take that action. Then, click Disable. A progress message displays.
If you want to view the status of the pre-trained APIs, view the service status and endpoints.