Vertex AI GenAI API

Service: aiplatform.googleapis.com

The Service name aiplatform.googleapis.com is needed to create RPC client stubs.

google.cloud.aiplatform.v1.EvaluationService

Methods
EvaluateInstances Evaluates instances based on a given metric.

google.cloud.aiplatform.v1.GenAiTuningService

Methods
CancelTuningJob Cancels a TuningJob.
CreateTuningJob Creates a TuningJob.
GetTuningJob Gets a TuningJob.
ListTuningJobs Lists TuningJobs in a Location.
RebaseTunedModel Rebase a TunedModel.

google.cloud.aiplatform.v1.PredictionService

Methods
ChatCompletions Exposes an OpenAI-compatible endpoint for chat completions.
GenerateContent Generate content with multimodal inputs.
Predict Perform an online prediction.
ServerStreamingPredict Perform a server-side streaming online prediction request for Vertex LLM streaming.
StreamDirectPredict Perform a streaming online prediction request to a gRPC model server for Vertex first-party products and frameworks.
StreamDirectRawPredict Perform a streaming online prediction request to a gRPC model server for custom containers.
StreamGenerateContent Generate content with multimodal inputs with streaming support.
StreamingPredict Perform a streaming online prediction request for Vertex first-party products and frameworks.
StreamingRawPredict Perform a streaming online prediction request through gRPC.

google.cloud.aiplatform.v1beta1.EvaluationService

Methods
EvaluateInstances Evaluates instances based on a given metric.

google.cloud.aiplatform.v1beta1.ExtensionExecutionService

Methods
ExecuteExtension Executes the request against a given extension.
QueryExtension Queries an extension with a default controller.

google.cloud.aiplatform.v1beta1.ExtensionRegistryService

Methods
DeleteExtension Deletes an Extension.
GetExtension Gets an Extension.
ImportExtension Imports an Extension.
ListExtensions Lists Extensions in a location.
UpdateExtension Updates an Extension.

google.cloud.aiplatform.v1beta1.GenAiCacheConfigService

Methods
GetCacheConfig Gets a GenAI cache config.
UpdateCacheConfig Updates a cache config.

google.cloud.aiplatform.v1beta1.GenAiCacheService

Methods
CreateCachedContent Creates cached content, this call will initialize the cached content in the data storage, and users need to pay for the cache data storage.
DeleteCachedContent Deletes cached content
GetCachedContent Gets cached content configurations
ListCachedContents Lists cached contents in a project
UpdateCachedContent Updates cached content configurations

google.cloud.aiplatform.v1beta1.GenAiTuningService

Methods
CancelTuningJob Cancels a TuningJob.
CreateTuningJob Creates a TuningJob.
GetTuningJob Gets a TuningJob.
ListTuningJobs Lists TuningJobs in a Location.
RebaseTunedModel Rebase a TunedModel.

google.cloud.aiplatform.v1beta1.PredictionService

Methods
ChatCompletions Exposes an OpenAI-compatible endpoint for chat completions.
CountTokens Perform a token counting.
GenerateContent Generate content with multimodal inputs.
Predict Perform an online prediction.
ServerStreamingPredict Perform a server-side streaming online prediction request for Vertex LLM streaming.
StreamDirectPredict Perform a streaming online prediction request to a gRPC model server for Vertex first-party products and frameworks.
StreamDirectRawPredict Perform a streaming online prediction request to a gRPC model server for custom containers.
StreamGenerateContent Generate content with multimodal inputs with streaming support.
StreamingPredict Perform a streaming online prediction request for Vertex first-party products and frameworks.
StreamingRawPredict Perform a streaming online prediction request through gRPC.

google.cloud.aiplatform.v1beta1.ReasoningEngineExecutionService

Methods
QueryReasoningEngine Queries using a reasoning engine.

google.cloud.aiplatform.v1beta1.ReasoningEngineService

Methods
CreateReasoningEngine Creates a reasoning engine.
DeleteReasoningEngine Deletes a reasoning engine.
GetReasoningEngine Gets a reasoning engine.
ListReasoningEngines Lists reasoning engines in a location.
UpdateReasoningEngine Updates a reasoning engine.

google.cloud.aiplatform.v1beta1.VertexRagDataService

Methods
CreateRagCorpus Creates a RagCorpus.
DeleteRagCorpus Deletes a RagCorpus.
DeleteRagFile Deletes a RagFile.
GetRagCorpus Gets a RagCorpus.
GetRagFile Gets a RagFile.
ImportRagFiles Import files from Google Cloud Storage or Google Drive into a RagCorpus.
ListRagCorpora Lists RagCorpora in a Location.
ListRagFiles Lists RagFiles in a RagCorpus.
UpdateRagCorpus Updates a RagCorpus.

google.cloud.aiplatform.v1beta1.VertexRagService

Methods
AugmentPrompt Given an input prompt, it returns augmented prompt from vertex rag store to guide LLM towards generating grounded responses.
CorroborateContent Given an input text, it returns a score that evaluates the factuality of the text.
RetrieveContexts Retrieves relevant contexts for a query.

google.iam.v1.IAMPolicy

Methods
GetIamPolicy Gets the access control policy for a resource.
SetIamPolicy Sets the access control policy on the specified resource.
TestIamPermissions Returns permissions that a caller has on the specified resource.

google.longrunning.Operations

Methods
CancelOperation Starts asynchronous cancellation on a long-running operation.
DeleteOperation Deletes a long-running operation.
GetOperation Gets the latest state of a long-running operation.
ListOperations Lists operations that match the specified filter in the request.
WaitOperation Waits until the specified long-running operation is done or reaches at most a specified timeout, returning the latest state.