Starting April 29, 2025, Gemini 1.5 Pro and Gemini 1.5 Flash models are not available in projects that have no prior usage of these models, including new projects. For details, see Model versions and lifecycle.

Vertex AI GenAI API

Service: aiplatform.googleapis.com

The Service name aiplatform.googleapis.com is needed to create RPC client stubs.

`google.cloud.aiplatform.v1.EvaluationService`

Methods
`EvaluateDataset`	Evaluates a dataset based on a set of given metrics.
`EvaluateInstances`	Evaluates instances based on a given metric.

`google.cloud.aiplatform.v1.GenAiCacheConfigService`

Methods
`GetCacheConfig`	Gets a GenAI cache config.
`UpdateCacheConfig`	Updates a cache config.

`google.cloud.aiplatform.v1.GenAiCacheService`

Methods
`CreateCachedContent`	Creates cached content, this call will initialize the cached content in the data storage, and users need to pay for the cache data storage.
`DeleteCachedContent`	Deletes cached content
`GetCachedContent`	Gets cached content configurations
`ListCachedContents`	Lists cached contents in a project
`UpdateCachedContent`	Updates cached content configurations

`google.cloud.aiplatform.v1.GenAiTuningService`

Methods
`CancelTuningJob`	Cancels a TuningJob.
`CreateTuningJob`	Creates a TuningJob.
`GetTuningJob`	Gets a TuningJob.
`ListTuningJobs`	Lists TuningJobs in a Location.
`RebaseTunedModel`	Rebase a TunedModel.

`google.cloud.aiplatform.v1.LlmBidiService`

Methods
`BidiGenerateContent`	Bidirectional streaming predict.

`google.cloud.aiplatform.v1.PredictionService`

Methods
`ChatCompletions`	Exposes an OpenAI-compatible endpoint for chat completions.
`FetchPredictOperation`	Fetch an asynchronous online prediction operation.
`GenerateContent`	Generate content with multimodal inputs.
`Predict`	Perform an online prediction.
`PredictLongRunning`
`RawPredict`	Perform an online prediction with an arbitrary HTTP payload.
`ServerStreamingPredict`	Perform a server-side streaming online prediction request for Vertex LLM streaming.
`StreamDirectPredict`	Perform a streaming online prediction request to a gRPC model server for Vertex first-party products and frameworks.
`StreamDirectRawPredict`	Perform a streaming online prediction request to a gRPC model server for custom containers.
`StreamGenerateContent`	Generate content with multimodal inputs with streaming support.
`StreamRawPredict`	Perform a streaming online prediction with an arbitrary HTTP payload.
`StreamingPredict`	Perform a streaming online prediction request for Vertex first-party products and frameworks.
`StreamingRawPredict`	Perform a streaming online prediction request through gRPC.

`google.cloud.aiplatform.v1.ReasoningEngineExecutionService`

Methods
`QueryReasoningEngine`	Queries using a reasoning engine.
`StreamQueryReasoningEngine`	Streams queries using a reasoning engine.

`google.cloud.aiplatform.v1.ReasoningEngineService`

Methods
`CreateReasoningEngine`	Creates a reasoning engine.
`DeleteReasoningEngine`	Deletes a reasoning engine.
`GetReasoningEngine`	Gets a reasoning engine.
`ListReasoningEngines`	Lists reasoning engines in a location.
`UpdateReasoningEngine`	Updates a reasoning engine.

`google.cloud.aiplatform.v1.VertexRagDataService`

Methods
`CreateRagCorpus`	Creates a RagCorpus.
`DeleteRagCorpus`	Deletes a RagCorpus.
`DeleteRagFile`	Deletes a RagFile.
`GetRagCorpus`	Gets a RagCorpus.
`GetRagEngineConfig`	Gets a RagEngineConfig.
`GetRagFile`	Gets a RagFile.
`ImportRagFiles`	Import files from Google Cloud Storage or Google Drive into a RagCorpus.
`ListRagCorpora`	Lists RagCorpora in a Location.
`ListRagFiles`	Lists RagFiles in a RagCorpus.
`UpdateRagCorpus`	Updates a RagCorpus.
`UpdateRagEngineConfig`	Updates a RagEngineConfig.

`google.cloud.aiplatform.v1.VertexRagService`

Methods
`AugmentPrompt`	Given an input prompt, it returns augmented prompt from vertex rag store to guide LLM towards generating grounded responses.
`CorroborateContent`	Given an input text, it returns a score that evaluates the factuality of the text.
`RetrieveContexts`	Retrieves relevant contexts for a query.

`google.cloud.aiplatform.v1beta1.EvaluationService`

Methods
`EvaluateDataset`	Evaluates a dataset based on a set of given metrics.
`EvaluateInstances`	Evaluates instances based on a given metric.

`google.cloud.aiplatform.v1beta1.ExtensionExecutionService`

Methods
`ExecuteExtension`	Executes the request against a given extension.
`QueryExtension`	Queries an extension with a default controller.

`google.cloud.aiplatform.v1beta1.ExtensionRegistryService`

Methods
`DeleteExtension`	Deletes an Extension.
`GetExtension`	Gets an Extension.
`ImportExtension`	Imports an Extension.
`ListExtensions`	Lists Extensions in a location.
`UpdateExtension`	Updates an Extension.

`google.cloud.aiplatform.v1beta1.GenAiCacheConfigService`

Methods
`GetCacheConfig`	Gets a GenAI cache config.
`UpdateCacheConfig`	Updates a cache config.

`google.cloud.aiplatform.v1beta1.GenAiCacheService`

Methods
`CreateCachedContent`	Creates cached content, this call will initialize the cached content in the data storage, and users need to pay for the cache data storage.
`DeleteCachedContent`	Deletes cached content
`GetCachedContent`	Gets cached content configurations
`ListCachedContents`	Lists cached contents in a project
`UpdateCachedContent`	Updates cached content configurations

`google.cloud.aiplatform.v1beta1.GenAiTuningService`

Methods
`CancelTuningJob`	Cancels a TuningJob.
`CreateTuningJob`	Creates a TuningJob.
`GetTuningJob`	Gets a TuningJob.
`ListTuningJobs`	Lists TuningJobs in a Location.
`RebaseTunedModel`	Rebase a TunedModel.

`google.cloud.aiplatform.v1beta1.LlmBidiService`

Methods
`BidiGenerateContent`	Bidirectional streaming predict.

`google.cloud.aiplatform.v1beta1.MemoryBankService`

Methods
`CreateMemory`	Create a Memory.
`DeleteMemory`	Delete a Memory.
`GenerateMemories`	Generate memories.
`GetMemory`	Get a Memory.
`ListMemories`	List Memories.
`RetrieveMemories`	Retrieve memories.
`UpdateMemory`	Update a Memory.

`google.cloud.aiplatform.v1beta1.PredictionService`

Methods
`ChatCompletions`	Exposes an OpenAI-compatible endpoint for chat completions.
`CountTokens`	Perform a token counting.
`FetchPredictOperation`	Fetch an asynchronous online prediction operation.
`GenerateContent`	Generate content with multimodal inputs.
`Predict`	Perform an online prediction.
`PredictLongRunning`
`RawPredict`	Perform an online prediction with an arbitrary HTTP payload.
`ServerStreamingPredict`	Perform a server-side streaming online prediction request for Vertex LLM streaming.
`StreamDirectPredict`	Perform a streaming online prediction request to a gRPC model server for Vertex first-party products and frameworks.
`StreamDirectRawPredict`	Perform a streaming online prediction request to a gRPC model server for custom containers.
`StreamGenerateContent`	Generate content with multimodal inputs with streaming support.
`StreamRawPredict`	Perform a streaming online prediction with an arbitrary HTTP payload.
`StreamingPredict`	Perform a streaming online prediction request for Vertex first-party products and frameworks.
`StreamingRawPredict`	Perform a streaming online prediction request through gRPC.

`google.cloud.aiplatform.v1beta1.ReasoningEngineExecutionService`

Methods
`QueryReasoningEngine`	Queries using a reasoning engine.
`StreamQueryReasoningEngine`	Streams queries using a reasoning engine.

`google.cloud.aiplatform.v1beta1.ReasoningEngineService`

Methods
`CreateReasoningEngine`	Creates a reasoning engine.
`DeleteReasoningEngine`	Deletes a reasoning engine.
`GetReasoningEngine`	Gets a reasoning engine.
`ListReasoningEngines`	Lists reasoning engines in a location.
`UpdateReasoningEngine`	Updates a reasoning engine.

`google.cloud.aiplatform.v1beta1.SessionService`

Methods
`AppendEvent`	Appends an event to a given session.
`CreateSession`	Creates a new `Session`.
`DeleteSession`	Deletes details of the specific `Session`.
`GetSession`	Gets details of the specific `Session`.
`ListEvents`	Lists `Events` in a given session.
`ListSessions`	Lists `Sessions` in a given reasoning engine.
`UpdateSession`	Updates the specific `Session`.

`google.cloud.aiplatform.v1beta1.VertexRagDataService`

Methods
`CreateRagCorpus`	Creates a RagCorpus.
`DeleteRagCorpus`	Deletes a RagCorpus.
`DeleteRagFile`	Deletes a RagFile.
`GetRagCorpus`	Gets a RagCorpus.
`GetRagEngineConfig`	Gets a RagEngineConfig.
`GetRagFile`	Gets a RagFile.
`ImportRagFiles`	Import files from Google Cloud Storage or Google Drive into a RagCorpus.
`ListRagCorpora`	Lists RagCorpora in a Location.
`ListRagFiles`	Lists RagFiles in a RagCorpus.
`UpdateRagCorpus`	Updates a RagCorpus.
`UpdateRagEngineConfig`	Updates a RagEngineConfig.

`google.cloud.aiplatform.v1beta1.VertexRagService`

Methods
`AugmentPrompt`	Given an input prompt, it returns augmented prompt from vertex rag store to guide LLM towards generating grounded responses.
`CorroborateContent`	Given an input text, it returns a score that evaluates the factuality of the text.
`RetrieveContexts`	Retrieves relevant contexts for a query.

`google.iam.v1.IAMPolicy`

Methods
`GetIamPolicy`	Gets the access control policy for a resource.
`SetIamPolicy`	Sets the access control policy on the specified resource.
`TestIamPermissions`	Returns permissions that a caller has on the specified resource.

`google.longrunning.Operations`

Methods
`CancelOperation`	Starts asynchronous cancellation on a long-running operation.
`DeleteOperation`	Deletes a long-running operation.
`GetOperation`	Gets the latest state of a long-running operation.
`ListOperations`	Lists operations that match the specified filter in the request.
`WaitOperation`	Waits until the specified long-running operation is done or reaches at most a specified timeout, returning the latest state.

Vertex AI GenAI API Stay organized with collections Save and categorize content based on your preferences.

Service: aiplatform.googleapis.com

Vertex AI GenAI API