Package google.cloud.aiplatform.v1

The operation generic information.

CreateRagCorpusRequest

Request message for VertexRagDataService.CreateRagCorpus.

Fields

parent

string

Required. The resource name of the Location to create the RagCorpus in. Format: projects/{project}/locations/{location}

rag_corpus

RagCorpus

Required. The RagCorpus to create.

CreateReasoningEngineOperationMetadata

Details of ReasoningEngineService.CreateReasoningEngine operation.

Fields

generic_metadata

The common part of the operation metadata.

CreateReasoningEngineRequest

Request message for ReasoningEngineService.CreateReasoningEngine.

Fields

parent

string

Required. The resource name of the Location to create the ReasoningEngine in. Format: projects/{project}/locations/{location}

reasoning_engine

ReasoningEngine

Required. The ReasoningEngine to create.

CreateTuningJobRequest

Request message for GenAiTuningService.CreateTuningJob.

Fields

parent

string

Required. The resource name of the Location to create the TuningJob in. Format: projects/{project}/locations/{location}

tuning_job

TuningJob

Required. The TuningJob to create.

CustomOutput

Spec for custom output.

Fields

Union field custom_output. Custom output. custom_output can be only one of the following:

raw_outputs

RawOutput

Output only. List of raw output strings.

CustomOutputFormatConfig

Spec for custom output format configuration.

Fields

Union field custom_output_format_config. Custom output format configuration. custom_output_format_config can be only one of the following:

return_raw_output

bool

Optional. Whether to return raw output.

DeleteCachedContentRequest

Request message for GenAiCacheService.DeleteCachedContent.

Fields

name

string

Required. The resource name referring to the cached content

DeleteOperationMetadata

Details of operations that perform deletes of any entities.

Fields

generic_metadata

The common part of the operation metadata.

DeleteRagCorpusRequest

Request message for VertexRagDataService.DeleteRagCorpus.

Fields

name

string

Required. The name of the RagCorpus resource to be deleted. Format: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}

force

bool

Optional. If set to true, any RagFiles in this RagCorpus will also be deleted. Otherwise, the request will only work if the RagCorpus has no RagFiles.

DeleteRagFileRequest

Request message for VertexRagDataService.DeleteRagFile.

Fields

name

string

Required. The name of the RagFile resource to be deleted. Format: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}/ragFiles/{rag_file}

DeleteReasoningEngineRequest

Request message for ReasoningEngineService.DeleteReasoningEngine.

Fields

name

string

Required. The name of the ReasoningEngine resource to be deleted. Format: projects/{project}/locations/{location}/reasoningEngines/{reasoning_engine}

force

bool

Optional. If set to true, child resources of this reasoning engine will also be deleted. Otherwise, the request will fail with FAILED_PRECONDITION error when the reasoning engine has undeleted child resources.

DirectUploadSource

This type has no fields.

The input content is encapsulated and uploaded in the request.

DynamicRetrievalConfig

Describes the options to customize dynamic retrieval.

Fields

mode

Mode

The mode of the predictor to be used in dynamic retrieval.

dynamic_threshold

float

Optional. The threshold to be used in dynamic retrieval. If not set, a system default value is used.

Mode

The mode of the predictor to be used in dynamic retrieval.

Enums
`MODE_UNSPECIFIED`	Always trigger retrieval.
`MODE_DYNAMIC`	Run retrieval only when system decides it is necessary.

EncryptionSpec

Represents a customer-managed encryption key spec that can be applied to a top-level resource.

Fields

kms_key_name

string

Required. The Cloud KMS resource identifier of the customer managed encryption key used to protect a resource. Has the form: projects/my-project/locations/my-region/keyRings/my-kr/cryptoKeys/my-key. The key needs to be in the same region as where the compute resource is created.

EnterpriseWebSearch

This type has no fields.

Tool to search public web data, powered by Vertex AI Search and Sec4 compliance.

EnvVar

Represents an environment variable present in a Container or Python Module.

Fields

name

string

Required. Name of the environment variable. Must be a valid C identifier.

value

string

Required. Variables that reference a $(VAR_NAME) are expanded using the previous defined environment variables in the container and any service environment variables. If a variable cannot be resolved, the reference in the input string will be unchanged. The $(VAR_NAME) syntax can be escaped with a double $$, ie: $$(VAR_NAME). Escaped references will never be expanded, regardless of whether the variable exists or not.

EvaluateDatasetOperationMetadata

Operation metadata for Dataset Evaluation.

Fields

generic_metadata

Generic operation metadata.

EvaluateDatasetRequest

Request message for EvaluationService.EvaluateDataset.

Fields
`location`	`string` Required. The resource name of the Location to evaluate the dataset. Format: `projects/{project}/locations/{location}`
`dataset`	`EvaluationDataset` Required. The dataset used for evaluation.
`metrics[]`	`Metric` Required. The metrics used for evaluation.
`output_config`	`OutputConfig` Required. Config for evaluation output.
`autorater_config`	`AutoraterConfig` Optional. Autorater config used for evaluation. Currently only publisher Gemini models are supported. Format: `projects/{PROJECT}/locations/{LOCATION}/publishers/google/models/{MODEL}.`

EvaluateDatasetResponse

Response in LRO for EvaluationService.EvaluateDataset.

Fields

aggregation_output

AggregationOutput

Output only. Aggregation statistics derived from results of EvaluationService.EvaluateDataset.

output_info

OutputInfo

Output only. Output info for EvaluationService.EvaluateDataset.

EvaluateInstancesRequest

Request message for EvaluationService.EvaluateInstances.

Fields
`location`	`string` Required. The resource name of the Location to evaluate the instances. Format: `projects/{project}/locations/{location}`
`autorater_config`	`AutoraterConfig` Optional. Autorater config used for evaluation.
Union field `metric_inputs`. Instances and specs for evaluation `metric_inputs` can be only one of the following:
`exact_match_input`	`ExactMatchInput` Auto metric instances. Instances and metric spec for exact match metric.
`bleu_input`	`BleuInput` Instances and metric spec for bleu metric.
`rouge_input`	`RougeInput` Instances and metric spec for rouge metric.
`fluency_input`	`FluencyInput` LLM-based metric instance. General text generation metrics, applicable to other categories. Input for fluency metric.
`coherence_input`	`CoherenceInput` Input for coherence metric.
`safety_input`	`SafetyInput` Input for safety metric.
`groundedness_input`	`GroundednessInput` Input for groundedness metric.
`fulfillment_input`	`FulfillmentInput` Input for fulfillment metric.
`summarization_quality_input`	`SummarizationQualityInput` Input for summarization quality metric.
`pairwise_summarization_quality_input`	`PairwiseSummarizationQualityInput` Input for pairwise summarization quality metric.
`summarization_helpfulness_input`	`SummarizationHelpfulnessInput` Input for summarization helpfulness metric.
`summarization_verbosity_input`	`SummarizationVerbosityInput` Input for summarization verbosity metric.
`question_answering_quality_input`	`QuestionAnsweringQualityInput` Input for question answering quality metric.
`pairwise_question_answering_quality_input`	`PairwiseQuestionAnsweringQualityInput` Input for pairwise question answering quality metric.
`question_answering_relevance_input`	`QuestionAnsweringRelevanceInput` Input for question answering relevance metric.
`question_answering_helpfulness_input`	`QuestionAnsweringHelpfulnessInput` Input for question answering helpfulness metric.
`question_answering_correctness_input`	`QuestionAnsweringCorrectnessInput` Input for question answering correctness metric.
`pointwise_metric_input`	`PointwiseMetricInput` Input for pointwise metric.
`pairwise_metric_input`	`PairwiseMetricInput` Input for pairwise metric.
`tool_call_valid_input`	`ToolCallValidInput` Tool call metric instances. Input for tool call valid metric.
`tool_name_match_input`	`ToolNameMatchInput` Input for tool name match metric.
`tool_parameter_key_match_input`	`ToolParameterKeyMatchInput` Input for tool parameter key match metric.
`tool_parameter_kv_match_input`	`ToolParameterKVMatchInput` Input for tool parameter key value match metric.
`comet_input`	`CometInput` Translation metrics. Input for Comet metric.
`metricx_input`	`MetricxInput` Input for Metricx metric.
`trajectory_exact_match_input`	`TrajectoryExactMatchInput` Input for trajectory exact match metric.
`trajectory_in_order_match_input`	`TrajectoryInOrderMatchInput` Input for trajectory in order match metric.
`trajectory_any_order_match_input`	`TrajectoryAnyOrderMatchInput` Input for trajectory match any order metric.
`trajectory_precision_input`	`TrajectoryPrecisionInput` Input for trajectory precision metric.
`trajectory_recall_input`	`TrajectoryRecallInput` Input for trajectory recall metric.
`trajectory_single_tool_use_input`	`TrajectorySingleToolUseInput` Input for trajectory single tool use metric.
`rubric_based_instruction_following_input`	`RubricBasedInstructionFollowingInput` Rubric Based Instruction Following metric.

EvaluateInstancesResponse

Response message for EvaluationService.EvaluateInstances.

Fields
Union field `evaluation_results`. Evaluation results will be served in the same order as presented in EvaluationRequest.instances. `evaluation_results` can be only one of the following:
`exact_match_results`	`ExactMatchResults` Auto metric evaluation results. Results for exact match metric.
`bleu_results`	`BleuResults` Results for bleu metric.
`rouge_results`	`RougeResults` Results for rouge metric.
`fluency_result`	`FluencyResult` LLM-based metric evaluation result. General text generation metrics, applicable to other categories. Result for fluency metric.
`coherence_result`	`CoherenceResult` Result for coherence metric.
`safety_result`	`SafetyResult` Result for safety metric.
`groundedness_result`	`GroundednessResult` Result for groundedness metric.
`fulfillment_result`	`FulfillmentResult` Result for fulfillment metric.
`summarization_quality_result`	`SummarizationQualityResult` Summarization only metrics. Result for summarization quality metric.
`pairwise_summarization_quality_result`	`PairwiseSummarizationQualityResult` Result for pairwise summarization quality metric.
`summarization_helpfulness_result`	`SummarizationHelpfulnessResult` Result for summarization helpfulness metric.
`summarization_verbosity_result`	`SummarizationVerbosityResult` Result for summarization verbosity metric.
`question_answering_quality_result`	`QuestionAnsweringQualityResult` Question answering only metrics. Result for question answering quality metric.
`pairwise_question_answering_quality_result`	`PairwiseQuestionAnsweringQualityResult` Result for pairwise question answering quality metric.
`question_answering_relevance_result`	`QuestionAnsweringRelevanceResult` Result for question answering relevance metric.
`question_answering_helpfulness_result`	`QuestionAnsweringHelpfulnessResult` Result for question answering helpfulness metric.
`question_answering_correctness_result`	`QuestionAnsweringCorrectnessResult` Result for question answering correctness metric.
`pointwise_metric_result`	`PointwiseMetricResult` Generic metrics. Result for pointwise metric.
`pairwise_metric_result`	`PairwiseMetricResult` Result for pairwise metric.
`tool_call_valid_results`	`ToolCallValidResults` Tool call metrics. Results for tool call valid metric.
`tool_name_match_results`	`ToolNameMatchResults` Results for tool name match metric.
`tool_parameter_key_match_results`	`ToolParameterKeyMatchResults` Results for tool parameter key match metric.
`tool_parameter_kv_match_results`	`ToolParameterKVMatchResults` Results for tool parameter key value match metric.
`comet_result`	`CometResult` Translation metrics. Result for Comet metric.
`metricx_result`	`MetricxResult` Result for Metricx metric.
`trajectory_exact_match_results`	`TrajectoryExactMatchResults` Result for trajectory exact match metric.
`trajectory_in_order_match_results`	`TrajectoryInOrderMatchResults` Result for trajectory in order match metric.
`trajectory_any_order_match_results`	`TrajectoryAnyOrderMatchResults` Result for trajectory any order match metric.
`trajectory_precision_results`	`TrajectoryPrecisionResults` Result for trajectory precision metric.
`trajectory_recall_results`	`TrajectoryRecallResults` Results for trajectory recall metric.
`trajectory_single_tool_use_results`	`TrajectorySingleToolUseResults` Results for trajectory single tool use metric.
`rubric_based_instruction_following_result`	`RubricBasedInstructionFollowingResult` Result for rubric based instruction following metric.

EvaluationDataset

The dataset used for evaluation.

Fields

Union field source. The source of the dataset. source can be only one of the following:

gcs_source

GcsSource

Cloud storage source holds the dataset. Currently only one Cloud Storage file path is supported.

bigquery_source

BigQuerySource

BigQuery source holds the dataset.

ExactMatchInput

Input for exact match metric.

Fields

metric_spec

ExactMatchSpec

Required. Spec for exact match metric.

instances[]

ExactMatchInstance

Required. Repeated exact match instances.

ExactMatchInstance

Spec for exact match instance.

Fields

prediction

string

Required. Output of the evaluated model.

reference

string

Required. Ground truth used to compare against the prediction.

ExactMatchMetricValue

Exact match metric value for an instance.

Fields

score

float

Output only. Exact match score.

ExactMatchResults

Results for exact match metric.

Fields

exact_match_metric_values[]

ExactMatchMetricValue

Output only. Exact match metric values.

ExactMatchSpec

This type has no fields.

Spec for exact match metric - returns 1 if prediction and reference exactly matches, otherwise 0.

ExecutableCode

Code generated by the model that is meant to be executed, and the result returned to the model.

Generated when using the [CodeExecution] tool, in which the code will be automatically executed, and a corresponding [CodeExecutionResult] will also be generated.

Fields

language

Language

Required. Programming language of the code.

code

string

Required. The code to be executed.

Language

Supported programming languages for the generated code.

Enums
`LANGUAGE_UNSPECIFIED`	Unspecified language. This value should not be used.
`PYTHON`	Python >= 3.10, with numpy and simpy available.

ExternalApi

Retrieve from data source powered by external API for grounding. The external API is not owned by Google, but need to follow the pre-defined API spec.

Fields
`api_spec`	`ApiSpec` The API spec that the external API implements.
`endpoint`	`string` The endpoint of the external API. The system will call the API at this endpoint to retrieve the data for grounding. Example: https://acme.com:443/search
`api_auth (deprecated)`	`ApiAuth` This item is deprecated! The authentication config to access the API. Deprecated. Please use auth_config instead.
`auth_config`	`AuthConfig` The authentication config to access the API.
Union field `params`. Parameters for the API call. This should be matched with the API spec used. `params` can be only one of the following:
`simple_search_params`	`SimpleSearchParams` Parameters for the simple search API.
`elastic_search_params`	`ElasticSearchParams` Parameters for the elastic search API.

ApiSpec

The API spec that the external API implements.

Enums
`API_SPEC_UNSPECIFIED`	Unspecified API spec. This value should not be used.
`SIMPLE_SEARCH`	Simple search API spec.
`ELASTIC_SEARCH`	Elastic search API spec.

ElasticSearchParams

The search parameters to use for the ELASTIC_SEARCH spec.

Fields

index

string

The ElasticSearch index to use.

search_template

string

The ElasticSearch search template to use.

num_hits

int32

Optional. Number of hits (chunks) to request.

When specified, it is passed to Elasticsearch as the num_hits param.

SimpleSearchParams

This type has no fields.

The search parameters to use for SIMPLE_SEARCH spec.

Fact

The fact used in grounding.

Fields
`query`	`string` Query that is used to retrieve this fact.
`title`	`string` If present, it refers to the title of this fact.
`uri`	`string` If present, this uri links to the source of the fact.
`summary`	`string` If present, the summary/snippet of the fact.
`vector_distance (deprecated)`	`double` This item is deprecated! If present, the distance between the query vector and this fact vector.
`score`	`double` If present, according to the underlying Vector DB and the selected metric type, the score can be either the distance or the similarity between the query and the fact and its range depends on the metric type. For example, if the metric type is COSINE_DISTANCE, it represents the distance between the query and the fact. The larger the distance, the less relevant the fact is to the query. The range is [0, 2], while 0 means the most relevant and 2 means the least relevant.
`chunk`	`RagChunk` If present, chunk properties.

FetchPredictOperationRequest

Request message for PredictionService.FetchPredictOperation.

Fields

endpoint

string

Required. The name of the Endpoint requested to serve the prediction. Format: projects/{project}/locations/{location}/endpoints/{endpoint} or projects/{project}/locations/{location}/publishers/{publisher}/models/{model}

operation_name

string

Required. The server-assigned name for the operation.

FileData

URI based data.

Fields

mime_type

string

Required. The IANA standard MIME type of the source data.

file_uri

string

Required. URI.

display_name

string

Optional. Display name of the file data.

Used to provide a label or filename to distinguish file datas.

FileStatus

RagFile status.

Fields

state

State

Output only. RagFile state.

error_status

string

Output only. Only when the state field is ERROR.

State

RagFile state.

Enums
`STATE_UNSPECIFIED`	RagFile state is unspecified.
`ACTIVE`	RagFile resource has been created and indexed successfully.
`ERROR`	RagFile resource is in a problematic state. See `error_message` field for details.

FluencyInput

Input for fluency metric.

Fields

metric_spec

FluencySpec

Required. Spec for fluency score metric.

instance

FluencyInstance

Required. Fluency instance.

FluencyInstance

Spec for fluency instance.

Fields

prediction

string

Required. Output of the evaluated model.

FluencyResult

Spec for fluency result.

Fields

explanation

string

Output only. Explanation for fluency score.

score

float

Output only. Fluency score.

confidence

float

Output only. Confidence for fluency score.

FluencySpec

Spec for fluency score metric.

Fields

version

int32

Optional. Which version to use for evaluation.

FulfillmentInput

Input for fulfillment metric.

Fields

metric_spec

FulfillmentSpec

Required. Spec for fulfillment score metric.

instance

FulfillmentInstance

Required. Fulfillment instance.

FulfillmentInstance

Spec for fulfillment instance.

Fields

prediction

string

Required. Output of the evaluated model.

instruction

string

Required. Inference instruction prompt to compare prediction with.

FulfillmentResult

Spec for fulfillment result.

Fields

explanation

string

Output only. Explanation for fulfillment score.

score

float

Output only. Fulfillment score.

confidence

float

Output only. Confidence for fulfillment score.

FulfillmentSpec

Spec for fulfillment metric.

Fields

version

int32

Optional. Which version to use for evaluation.

FunctionCall

A predicted [FunctionCall] returned from the model that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing the parameters and their values.

Fields

name

string

Required. The name of the function to call. Matches [FunctionDeclaration.name].

args

Optional. The function parameters and values in JSON object format. See [FunctionDeclaration.parameters] for parameter details.

FunctionCallingConfig

Function calling config.

Fields

mode

Mode

Optional. Function calling mode.

allowed_function_names[]

string

Optional. Function names to call. Only set when the Mode is ANY. Function names should match [FunctionDeclaration.name]. With mode set to ANY, model will predict a function call from the set of function names provided.

Mode

Function calling mode.

Enums
`MODE_UNSPECIFIED`	Unspecified function calling mode. This value should not be used.
`AUTO`	Default model behavior, model decides to predict either function calls or natural language response.
`ANY`	Model is constrained to always predicting function calls only. If "allowed_function_names" are set, the predicted function calls will be limited to any one of "allowed_function_names", else the predicted function calls will be any one of the provided "function_declarations".
`NONE`	Model will not predict any function calls. Model behavior is same as when not passing any function declarations.

FunctionDeclaration

Structured representation of a function declaration as defined by the OpenAPI 3.0 specification. Included in this declaration are the function name, description, parameters and response type. This FunctionDeclaration is a representation of a block of code that can be used as a Tool by the model and executed by the client.

Fields
`name`	`string` Required. The name of the function to call. Must start with a letter or an underscore. Must be a-z, A-Z, 0-9, or contain underscores, dots and dashes, with a maximum length of 64.
`description`	`string` Optional. Description and purpose of the function. Model uses it to decide how and whether to call the function.
`parameters`	`Schema` Optional. Describes the parameters to this function in JSON Schema Object format. Reflects the Open API 3.03 Parameter Object. string Key: the name of the parameter. Parameter names are case sensitive. Schema Value: the Schema defining the type used for the parameter. For function with no parameters, this can be left unset. Parameter names must start with a letter or an underscore and must only contain chars a-z, A-Z, 0-9, or underscores with a maximum length of 64. Example with 1 required and 1 optional parameter: type: OBJECT properties: param1: type: STRING param2: type: INTEGER required: - param1
`parameters_json_schema`	`Value` Optional. Describes the parameters to the function in JSON Schema format. The schema must describe an object where the properties are the parameters to the function. For example: `{ "type": "object", "properties": { "name": { "type": "string" }, "age": { "type": "integer" } }, "additionalProperties": false, "required": ["name", "age"], "propertyOrdering": ["name", "age"] }` This field is mutually exclusive with `parameters`.
`response`	`Schema` Optional. Describes the output from this function in JSON Schema format. Reflects the Open API 3.03 Response Object. The Schema defines the type used for the response value of the function.
`response_json_schema`	`Value` Optional. Describes the output from this function in JSON Schema format. The value specified by the schema is the response value of the function. This field is mutually exclusive with `response`.

FunctionResponse

The result output from a [FunctionCall] that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing any output from the function is used as context to the model. This should contain the result of a [FunctionCall] made based on model prediction.

Fields

name

string

Required. The name of the function to call. Matches [FunctionDeclaration.name] and [FunctionCall.name].

response

Required. The function response in JSON object format. Use "output" key to specify function output and "error" key to specify error details (if any). If "output" and "error" keys are not specified, then whole "response" is treated as function output.

GcsDestination

The Google Cloud Storage location where the output is to be written to.

Fields

output_uri_prefix

string

Required. Google Cloud Storage URI to output directory. If the uri doesn't end with '/', a '/' will be automatically appended. The directory is created if it doesn't exist.

GcsSource

The Google Cloud Storage location for the input content.

Fields

uris[]

string

Required. Google Cloud Storage URI(-s) to the input file(s). May contain wildcards. For more information on wildcards, see https://cloud.google.com/storage/docs/wildcards.

GenerateContentRequest

Request message for [PredictionService.GenerateContent].

Fields
`model`	`string` Required. The fully qualified name of the publisher model or tuned model endpoint to use. Publisher model format: `projects/{project}/locations/{location}/publishers//models/` Tuned model endpoint format: `projects/{project}/locations/{location}/endpoints/{endpoint}`
`contents[]`	`Content` Required. The content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request.
`cached_content`	`string` Optional. The name of the cached content used as context to serve the prediction. Note: only used in explicit caching, where users can have control over caching (e.g. what content to cache) and enjoy guaranteed cost savings. Format: `projects/{project}/locations/{location}/cachedContents/{cachedContent}`
`tools[]`	`Tool` Optional. A list of `Tools` the model may use to generate the next response. A `Tool` is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model.
`tool_config`	`ToolConfig` Optional. Tool config. This config is shared for all tools provided in the request.
`labels`	`map<string, string>` Optional. The labels with user-defined metadata for the request. It is used for billing and reporting only. Label keys and values can be no longer than 63 characters (Unicode codepoints) and can only contain lowercase letters, numeric characters, underscores, and dashes. International characters are allowed. Label values are optional. Label keys must start with a letter.
`safety_settings[]`	`SafetySetting` Optional. Per request settings for blocking unsafe content. Enforced on GenerateContentResponse.candidates.
`generation_config`	`GenerationConfig` Optional. Generation config.
`system_instruction`	`Content` Optional. The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph.

GenerateContentResponse

Response message for [PredictionService.GenerateContent].

Fields
`candidates[]`	`Candidate` Output only. Generated candidates.
`model_version`	`string` Output only. The model version used to generate the response.
`create_time`	`Timestamp` Output only. Timestamp when the request is made to the server.
`response_id`	`string` Output only. response_id is used to identify each response. It is the encoding of the event_id.
`prompt_feedback`	`PromptFeedback` Output only. Content filter results for a prompt sent in the request. Note: Sent only in the first stream chunk. Only happens when no candidates were generated due to content violations.
`usage_metadata`	`UsageMetadata` Usage metadata about the response(s).

PromptFeedback

Content filter results for a prompt sent in the request.

Fields

block_reason

BlockedReason

Output only. Blocked reason.

safety_ratings[]

SafetyRating

Output only. Safety ratings.

block_reason_message

string

Output only. A readable block reason message.

BlockedReason

Blocked reason enumeration.

Enums
`BLOCKED_REASON_UNSPECIFIED`	Unspecified blocked reason.
`SAFETY`	Candidates blocked due to safety.
`OTHER`	Candidates blocked due to other reason.
`BLOCKLIST`	Candidates blocked due to the terms which are included from the terminology blocklist.
`PROHIBITED_CONTENT`	Candidates blocked due to prohibited content.
`IMAGE_SAFETY`	Candidates blocked due to unsafe image generation content.

UsageMetadata

Usage metadata about response(s).

Fields
`prompt_token_count`	`int32` Number of tokens in the request. When `cached_content` is set, this is still the total effective prompt size meaning this includes the number of tokens in the cached content.
`candidates_token_count`	`int32` Number of tokens in the response(s).
`tool_use_prompt_token_count`	`int32` Output only. Number of tokens present in tool-use prompt(s).
`thoughts_token_count`	`int32` Output only. Number of tokens present in thoughts output.
`total_token_count`	`int32` Total token count for prompt, response candidates, and tool-use prompts (if present).
`cached_content_token_count`	`int32` Output only. Number of tokens in the cached part in the input (the cached content).
`prompt_tokens_details[]`	`ModalityTokenCount` Output only. List of modalities that were processed in the request input.
`cache_tokens_details[]`	`ModalityTokenCount` Output only. List of modalities of the cached content in the request input.
`candidates_tokens_details[]`	`ModalityTokenCount` Output only. List of modalities that were returned in the response.
`tool_use_prompt_tokens_details[]`	`ModalityTokenCount` Output only. List of modalities that were processed for tool-use request inputs.
`traffic_type`	`TrafficType` Output only. Traffic type. This shows whether a request consumes Pay-As-You-Go or Provisioned Throughput quota.

TrafficType

Request traffic type. Indicates whether the request consumes Pay-As-You-Go or Provisioned Throughput quota.

Enums
`TRAFFIC_TYPE_UNSPECIFIED`	Unspecified request traffic type.
`ON_DEMAND`	Type for Pay-As-You-Go traffic.
`PROVISIONED_THROUGHPUT`	Type for Provisioned Throughput traffic.

GenerationConfig

Generation config.

Fields
`stop_sequences[]`	`string` Optional. Stop sequences.
`response_mime_type`	`string` Optional. Output response mimetype of the generated candidate text. Supported mimetype: - `text/plain`: (default) Text output. - `application/json`: JSON response in the candidates. The model needs to be prompted to output the appropriate response type, otherwise the behavior is undefined. This is a preview feature.
`response_modalities[]`	`Modality` Optional. The modalities of the response.
`thinking_config`	`ThinkingConfig` Optional. Config for thinking features. An error will be returned if this field is set for models that don't support thinking.
`temperature`	`float` Optional. Controls the randomness of predictions.
`top_p`	`float` Optional. If specified, nucleus sampling will be used.
`top_k`	`float` Optional. If specified, top-k sampling will be used.
`candidate_count`	`int32` Optional. Number of candidates to generate.
`max_output_tokens`	`int32` Optional. The maximum number of output tokens to generate per message.
`response_logprobs`	`bool` Optional. If true, export the logprobs results in response.
`logprobs`	`int32` Optional. Logit probabilities.
`presence_penalty`	`float` Optional. Positive penalties.
`frequency_penalty`	`float` Optional. Frequency penalties.
`seed`	`int32` Optional. Seed.
`response_schema`	`Schema` Optional. The `Schema` object allows the definition of input and output data types. These types can be objects, but also primitives and arrays. Represents a select subset of an OpenAPI 3.0 schema object. If set, a compatible response_mime_type must also be set. Compatible mimetypes: `application/json`: Schema for JSON response.
`response_json_schema`	`Value` Optional. Output schema of the generated response. This is an alternative to `response_schema` that accepts JSON Schema. If set, `response_schema` must be omitted, but `response_mime_type` is required. While the full JSON Schema may be sent, not all features are supported. Specifically, only the following properties are supported: `$id` `$defs` `$ref` `$anchor` `type` `format` `title` `description` `enum` (for strings and numbers) `items` `prefixItems` `minItems` `maxItems` `minimum` `maximum` `anyOf` `oneOf` (interpreted the same as `anyOf`) `properties` `additionalProperties` `required` The non-standard `propertyOrdering` property may also be set. Cyclic references are unrolled to a limited degree and, as such, may only be used within non-required properties. (Nullable properties are not sufficient.) If `$ref` is set on a sub-schema, no other properties, except for than those starting as a `$`, may be set.
`routing_config`	`RoutingConfig` Optional. Routing configuration.
`audio_timestamp`	`bool` Optional. If enabled, audio timestamp will be included in the request to the model.
`media_resolution`	`MediaResolution` Optional. If specified, the media resolution specified will be used.
`speech_config`	`SpeechConfig` Optional. The speech generation config.
`enable_affective_dialog`	`bool` Optional. If enabled, the model will detect emotions and adapt its responses accordingly.

MediaResolution

Media resolution for the input media.

Enums
`MEDIA_RESOLUTION_UNSPECIFIED`	Media resolution has not been set.
`MEDIA_RESOLUTION_LOW`	Media resolution set to low (64 tokens).
`MEDIA_RESOLUTION_MEDIUM`	Media resolution set to medium (256 tokens).
`MEDIA_RESOLUTION_HIGH`	Media resolution set to high (zoomed reframing with 256 tokens).

Modality

The modalities of the response.

Enums
`MODALITY_UNSPECIFIED`	Unspecified modality. Will be processed as text.
`TEXT`	Text modality.
`IMAGE`	Image modality.
`AUDIO`	Audio modality.

RoutingConfig

The configuration for routing the request to a specific model.

Fields

Union field routing_config. Routing mode. routing_config can be only one of the following:

auto_mode

AutoRoutingMode

Automated routing.

manual_mode

ManualRoutingMode

Manual routing.

AutoRoutingMode

When automated routing is specified, the routing will be determined by the pretrained routing model and customer provided model routing preference.

Fields

model_routing_preference

ModelRoutingPreference

The model routing preference.

ModelRoutingPreference

The model routing preference.

Enums
`UNKNOWN`	Unspecified model routing preference.
`PRIORITIZE_QUALITY`	Prefer higher quality over low cost.
`BALANCED`	Balanced model routing preference.
`PRIORITIZE_COST`	Prefer lower cost over higher quality.

ManualRoutingMode

When manual routing is set, the specified model will be used directly.

Fields

model_name

string

The model name to use. Only the public LLM models are accepted. See Supported models.

ThinkingConfig

Config for thinking features.

Fields

include_thoughts

bool

Optional. Indicates whether to include thoughts in the response. If true, thoughts are returned only when available.

thinking_budget

int32

Optional. Indicates the thinking budget in tokens.

GenericOperationMetadata

Generic Metadata shared by all operations.

Fields

partial_failures[]

Status

Output only. Partial failures encountered. E.g. single files that couldn't be read. This field should never exceed 20 entries. Status details field will contain standard Google Cloud error details.

create_time

Output only. Time when the operation was created.

update_time

PairwiseQuestionAnsweringQualitySpec

Output only. Time when the operation was updated for the last time. If the operation has finished (successfully or not), this is the finish time.

GetCacheConfigRequest

Request message for getting a cache config.

Fields

name

string

Required. Name of the cache config. Format: - projects/{project}/cacheConfig.

GetCachedContentRequest

Request message for GenAiCacheService.GetCachedContent.

Fields

name

string

Required. The resource name referring to the cached content

GetRagCorpusRequest

Request message for VertexRagDataService.GetRagCorpus

Fields

name

string

Required. The name of the RagCorpus resource. Format: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}

GetRagEngineConfigRequest

Request message for VertexRagDataService.GetRagEngineConfig

Fields

name

string

Required. The name of the RagEngineConfig resource. Format: projects/{project}/locations/{location}/ragEngineConfig

GetRagFileRequest

Request message for VertexRagDataService.GetRagFile

Fields

name

string

Required. The name of the RagFile resource. Format: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}/ragFiles/{rag_file}

GetReasoningEngineRequest

Request message for ReasoningEngineService.GetReasoningEngine.

Fields

name

string

Required. The name of the ReasoningEngine resource. Format: projects/{project}/locations/{location}/reasoningEngines/{reasoning_engine}

GetTuningJobRequest

Request message for GenAiTuningService.GetTuningJob.

Fields

name

string

Required. The name of the TuningJob resource. Format: projects/{project}/locations/{location}/tuningJobs/{tuning_job}

GoAway

Server will not be able to service client soon.

Fields

time_left

Duration

The remaining time before the connection will be terminated as ABORTED. The minimal time returned here is specified differently together with the rate limits for a given model.

GoogleDriveSource

The Google Drive location for the input content.

Fields

resource_ids[]

ResourceId

Required. Google Drive resource IDs.

ResourceId

The type and ID of the Google Drive resource.

Fields

resource_type

ResourceType

Required. The type of the Google Drive resource.

resource_id

string

Required. The ID of the Google Drive resource.

ResourceType

The type of the Google Drive resource.

Enums
`RESOURCE_TYPE_UNSPECIFIED`	Unspecified resource type.
`RESOURCE_TYPE_FILE`	File resource type.
`RESOURCE_TYPE_FOLDER`	Folder resource type.

GoogleMaps

Tool to retrieve public maps data for grounding, powered by Google.

Fields

api_auth
(deprecated)

ApiAuth

The authentication config to access the API. Deprecated. Please use auth_config instead.

auth_config

AuthConfig

The authentication config to access the API. Only API key is supported.

GoogleSearchRetrieval

Tool to retrieve public web data for grounding, powered by Google.

Fields

dynamic_retrieval_config

DynamicRetrievalConfig

Specifies the dynamic retrieval configuration for the given source.

GroundednessInput

Input for groundedness metric.

Fields

metric_spec

GroundednessSpec

Required. Spec for groundedness metric.

instance

GroundednessInstance

Required. Groundedness instance.

GroundednessInstance

Spec for groundedness instance.

Fields

prediction

string

Required. Output of the evaluated model.

context

string

Required. Background information provided in context used to compare against the prediction.

GroundednessResult

Spec for groundedness result.

Fields

explanation

string

Output only. Explanation for groundedness score.

score

float

Output only. Groundedness score.

confidence

float

Output only. Confidence for groundedness score.

GroundednessSpec

Spec for groundedness metric.

Fields

version

int32

Optional. Which version to use for evaluation.

GroundingChunk

Grounding chunk.

Fields
Union field `chunk_type`. Chunk type. `chunk_type` can be only one of the following:
`web`	`Web` Grounding chunk from the web.
`retrieved_context`	`RetrievedContext` Grounding chunk from context retrieved by the retrieval tools.
`maps`	`Maps` Grounding chunk from Google Maps.

Maps

Chunk from Google Maps.

Fields
`uri`	`string` URI reference of the chunk.
`title`	`string` Title of the chunk.
`text`	`string` Text of the chunk.
`place_id`	`string` This Place's resource name, in `places/{place_id}` format. Can be used to look up the Place.

RetrievedContext

Chunk from context retrieved by the retrieval tools.

Fields
Union field `context_details`. Tool-specific details about the retrieved context. `context_details` can be only one of the following:
`rag_chunk`	`RagChunk` Additional context for the RAG retrieval result. This is only populated when using the RAG retrieval tool.
`uri`	`string` URI reference of the attribution.
`title`	`string` Title of the attribution.
`text`	`string` Text of the attribution.

Web

Chunk from the web.

Fields

uri

string

URI reference of the chunk.

title

string

Title of the chunk.

domain

string

Domain of the (original) URI.

GroundingMetadata

Metadata returned to client when grounding is enabled.

Fields
`web_search_queries[]`	`string` Optional. Web search queries for the following-up web search.
`grounding_chunks[]`	`GroundingChunk` List of supporting references retrieved from specified grounding source.
`grounding_supports[]`	`GroundingSupport` Optional. List of grounding support.
`search_entry_point`	`SearchEntryPoint` Optional. Google search entry for the following-up web searches.
`retrieval_metadata`	`RetrievalMetadata` Optional. Output only. Retrieval metadata.
`google_maps_widget_context_token`	`string` Optional. Output only. Resource name of the Google Maps widget context token to be used with the PlacesContextElement widget to render contextual data. This is populated only for Google Maps grounding.

GroundingSupport

Grounding support.

Fields

grounding_chunk_indices[]

int32

A list of indices (into 'grounding_chunk') specifying the citations associated with the claim. For instance [1,3,4] means that grounding_chunk[1], grounding_chunk[3], grounding_chunk[4] are the retrieved content attributed to the claim.

confidence_scores[]

float

Confidence score of the support references. Ranges from 0 to 1. 1 is the most confident. For Gemini 2.0 and before, this list must have the same size as the grounding_chunk_indices. For Gemini 2.5 and after, this list will be empty and should be ignored.

segment

Segment

Segment of the content this support belongs to.

HarmCategory

Harm categories that will block the content.

Enums
`HARM_CATEGORY_UNSPECIFIED`	The harm category is unspecified.
`HARM_CATEGORY_HATE_SPEECH`	The harm category is hate speech.
`HARM_CATEGORY_DANGEROUS_CONTENT`	The harm category is dangerous content.
`HARM_CATEGORY_HARASSMENT`	The harm category is harassment.
`HARM_CATEGORY_SEXUALLY_EXPLICIT`	The harm category is sexually explicit content.
`HARM_CATEGORY_CIVIC_INTEGRITY`	Deprecated: Election filter is not longer supported. The harm category is civic integrity. This item is deprecated!
`HARM_CATEGORY_IMAGE_HATE`	The harm category is image hate.
`HARM_CATEGORY_IMAGE_DANGEROUS_CONTENT`	The harm category is image dangerous content.
`HARM_CATEGORY_IMAGE_HARASSMENT`	The harm category is image harassment.
`HARM_CATEGORY_IMAGE_SEXUALLY_EXPLICIT`	The harm category is image sexually explicit content.

HttpElementLocation

Enum of location an HTTP element can be.

Enums
`HTTP_IN_UNSPECIFIED`
`HTTP_IN_QUERY`	Element is in the HTTP request query.
`HTTP_IN_HEADER`	Element is in the HTTP request header.
`HTTP_IN_PATH`	Element is in the HTTP request path.
`HTTP_IN_BODY`	Element is in the HTTP request body.
`HTTP_IN_COOKIE`	Element is in the HTTP request cookie.

ImportRagFilesConfig

Config for importing RagFiles.

Fields
`rag_file_transformation_config`	`RagFileTransformationConfig` Specifies the transformation config for RagFiles.
`rag_file_parsing_config`	`RagFileParsingConfig` Optional. Specifies the parsing config for RagFiles. RAG will use the default parser if this field is not set.
`max_embedding_requests_per_min`	`int32` Optional. The max number of queries per minute that this job is allowed to make to the embedding model specified on the corpus. This value is specific to this job and not shared across other import jobs. Consult the Quotas page on the project to set an appropriate value here. If unspecified, a default value of 1,000 QPM would be used.
`rebuild_ann_index`	`bool` Rebuilds the ANN index to optimize for recall on the imported data. Only applicable for RagCorpora running on RagManagedDb with `retrieval_strategy` set to `ANN`. The rebuild will be performed using the existing ANN config set on the RagCorpus. To change the ANN config, please use the UpdateRagCorpus API. Default is false, i.e., index is not rebuilt.
Union field `import_source`. The source of the import. `import_source` can be only one of the following:
`gcs_source`	`GcsSource` Google Cloud Storage location. Supports importing individual files as well as entire Google Cloud Storage directories. Sample formats: - `gs://bucket_name/my_directory/object_name/my_file.txt` - `gs://bucket_name/my_directory`
`google_drive_source`	`GoogleDriveSource` Google Drive location. Supports importing individual files as well as Google Drive folders.
`slack_source`	`SlackSource` Slack channels with their corresponding access tokens.
`jira_source`	`JiraSource` Jira queries with their corresponding authentication.
`share_point_sources`	`SharePointSources` SharePoint sources.
Union field `partial_failure_sink`. Optional. If provided, all partial failures are written to the sink. Deprecated. Prefer to use the `import_result_sink`. `partial_failure_sink` can be only one of the following:
`partial_failure_gcs_sink (deprecated)`	`GcsDestination` This item is deprecated! The Cloud Storage path to write partial failures to. Deprecated. Prefer to use `import_result_gcs_sink`.
`partial_failure_bigquery_sink (deprecated)`	`BigQueryDestination` This item is deprecated! The BigQuery destination to write partial failures to. It should be a bigquery table resource name (e.g. "bq://projectId.bqDatasetId.bqTableId"). The dataset must exist. If the table does not exist, it will be created with the expected schema. If the table exists, the schema will be validated and data will be added to this existing table. Deprecated. Prefer to use `import_result_bq_sink`.
Union field `import_result_sink`. Optional. If provided, all successfully imported files and all partial failures are written to the sink. `import_result_sink` can be only one of the following:
`import_result_gcs_sink`	`GcsDestination` The Cloud Storage path to write import result to.
`import_result_bigquery_sink`	`BigQueryDestination` The BigQuery destination to write import result to. It should be a bigquery table resource name (e.g. "bq://projectId.bqDatasetId.bqTableId"). The dataset must exist. If the table does not exist, it will be created with the expected schema. If the table exists, the schema will be validated and data will be added to this existing table.

ImportRagFilesOperationMetadata

Runtime operation information for VertexRagDataService.ImportRagFiles.

Fields
`generic_metadata`	`GenericOperationMetadata` The operation generic information.
`rag_corpus_id`	`int64` The resource ID of RagCorpus that this operation is executed on.
`import_rag_files_config`	`ImportRagFilesConfig` Output only. The config that was passed in the ImportRagFilesRequest.
`progress_percentage`	`int32` The progress percentage of the operation. Value is in the range [0, 100]. This percentage is calculated as follows: progress_percentage = 100 * (successes + failures + skips) / total

ImportRagFilesRequest

Request message for VertexRagDataService.ImportRagFiles.

Fields

parent

string

Required. The name of the RagCorpus resource into which to import files. Format: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}

import_rag_files_config

ImportRagFilesConfig

Required. The config for the RagFiles to be synced and imported into the RagCorpus. VertexRagDataService.ImportRagFiles.

ImportRagFilesResponse

Response message for VertexRagDataService.ImportRagFiles.

Fields
`imported_rag_files_count`	`int64` The number of RagFiles that had been imported into the RagCorpus.
`failed_rag_files_count`	`int64` The number of RagFiles that had failed while importing into the RagCorpus.
`skipped_rag_files_count`	`int64` The number of RagFiles that was skipped while importing into the RagCorpus.
Union field `partial_failure_sink`. The location into which the partial failures were written. `partial_failure_sink` can be only one of the following:
`partial_failures_gcs_path`	`string` The Google Cloud Storage path into which the partial failures were written.
`partial_failures_bigquery_table`	`string` The BigQuery table into which the partial failures were written.

JiraSource

The Jira source for the ImportRagFilesRequest.

Fields

jira_queries[]

JiraQueries

Required. The Jira queries.

JiraQueries

JiraQueries contains the Jira queries and corresponding authentication.

Fields
`projects[]`	`string` A list of Jira projects to import in their entirety.
`custom_queries[]`	`string` A list of custom Jira queries to import. For information about JQL (Jira Query Language), see https://support.atlassian.com/jira-service-management-cloud/docs/use-advanced-search-with-jira-query-language-jql/
`email`	`string` Required. The Jira email address.
`server_uri`	`string` Required. The Jira server URI.
`api_key_config`	`ApiKeyConfig` Required. The SecretManager secret version resource name (e.g. projects/{project}/secrets/{secret}/versions/{version}) storing the Jira API key. See Manage API tokens for your Atlassian account.

JobState

Describes the state of a job.

Enums
`JOB_STATE_UNSPECIFIED`	The job state is unspecified.
`JOB_STATE_QUEUED`	The job has been just created or resumed and processing has not yet begun.
`JOB_STATE_PENDING`	The service is preparing to run the job.
`JOB_STATE_RUNNING`	The job is in progress.
`JOB_STATE_SUCCEEDED`	The job completed successfully.
`JOB_STATE_FAILED`	The job failed.
`JOB_STATE_CANCELLING`	The job is being cancelled. From this state the job may only go to either `JOB_STATE_SUCCEEDED`, `JOB_STATE_FAILED` or `JOB_STATE_CANCELLED`.
`JOB_STATE_CANCELLED`	The job has been cancelled.
`JOB_STATE_PAUSED`	The job has been stopped, and can be resumed.
`JOB_STATE_EXPIRED`	The job has expired.
`JOB_STATE_UPDATING`	The job is being updated. Only jobs in the `RUNNING` state can be updated. After updating, the job goes back to the `RUNNING` state.
`JOB_STATE_PARTIALLY_SUCCEEDED`	The job is partially succeeded, some results may be missing due to errors.

ListCachedContentsRequest

Request to list CachedContents.

Fields

parent

string

Required. The parent, which owns this collection of cached contents.

page_size

int32

Optional. The maximum number of cached contents to return. The service may return fewer than this value. If unspecified, some default (under maximum) number of items will be returned. The maximum value is 1000; values above 1000 will be coerced to 1000.

page_token

string

Optional. A page token, received from a previous ListCachedContents call. Provide this to retrieve the subsequent page.

When paginating, all other parameters provided to ListCachedContents must match the call that provided the page token.

ListCachedContentsResponse

Response with a list of CachedContents.

Fields

cached_contents[]

CachedContent

List of cached contents.

next_page_token

string

A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no subsequent pages.

ListRagCorporaRequest

Request message for VertexRagDataService.ListRagCorpora.

Fields

parent

string

Required. The resource name of the Location from which to list the RagCorpora. Format: projects/{project}/locations/{location}

page_size

int32

Optional. The standard list page size.

page_token

string

Optional. The standard list page token. Typically obtained via ListRagCorporaResponse.next_page_token of the previous VertexRagDataService.ListRagCorpora call.

ListRagCorporaResponse

Response message for VertexRagDataService.ListRagCorpora.

Fields

rag_corpora[]

RagCorpus

List of RagCorpora in the requested page.

next_page_token

string

A token to retrieve the next page of results. Pass to ListRagCorporaRequest.page_token to obtain that page.

ListRagFilesRequest

Request message for VertexRagDataService.ListRagFiles.

Fields

parent

string

Required. The resource name of the RagCorpus from which to list the RagFiles. Format: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}

page_size

int32

Optional. The standard list page size.

page_token

string

Optional. The standard list page token. Typically obtained via ListRagFilesResponse.next_page_token of the previous VertexRagDataService.ListRagFiles call.

ListRagFilesResponse

Response message for VertexRagDataService.ListRagFiles.

Fields

rag_files[]

RagFile

List of RagFiles in the requested page.

next_page_token

string

A token to retrieve the next page of results. Pass to ListRagFilesRequest.page_token to obtain that page.

ListReasoningEnginesRequest

Request message for ReasoningEngineService.ListReasoningEngines.

Fields
`parent`	`string` Required. The resource name of the Location to list the ReasoningEngines from. Format: `projects/{project}/locations/{location}`
`filter`	`string` Optional. The standard list filter. More detail in AIP-160.
`page_size`	`int32` Optional. The standard list page size.
`page_token`	`string` Optional. The standard list page token.

ListReasoningEnginesResponse

Response message for ReasoningEngineService.ListReasoningEngines

Fields

reasoning_engines[]

ReasoningEngine

List of ReasoningEngines in the requested page.

next_page_token

string

A token to retrieve the next page of results. Pass to ListReasoningEnginesRequest.page_token to obtain that page.

ListTuningJobsRequest

Request message for GenAiTuningService.ListTuningJobs.

Fields
`parent`	`string` Required. The resource name of the Location to list the TuningJobs from. Format: `projects/{project}/locations/{location}`
`filter`	`string` Optional. The standard list filter.
`page_size`	`int32` Optional. The standard list page size.
`page_token`	`string` Optional. The standard list page token. Typically obtained via `ListTuningJobsResponse.next_page_token` of the previous GenAiTuningService.ListTuningJob][] call.

ListTuningJobsResponse

Response message for GenAiTuningService.ListTuningJobs

Fields

tuning_jobs[]

TuningJob

List of TuningJobs in the requested page.

next_page_token

string

A token to retrieve the next page of results. Pass to ListTuningJobsRequest.page_token to obtain that page.

LogprobsResult

Logprobs Result

Fields

top_candidates[]

TopCandidates

Length = total number of decoding steps.

chosen_candidates[]

Candidate

Length = total number of decoding steps. The chosen candidates may or may not be in top_candidates.

Candidate

Candidate for the logprobs token and score.

Fields

token

string

The candidate's token string value.

token_id

int32

The candidate's token id value.

log_probability

float

The candidate's log probability.

TopCandidates

Candidates with top log probabilities at each decoding step.

Fields

candidates[]

Candidate

Sorted by log probability in descending order.

Metric

The metric used for running evaluations.

Fields
`aggregation_metrics[]`	`AggregationMetric` Optional. The aggregation metrics to use.
Union field `metric_spec`. The spec for the metric. It would be either the name of a pre-defined metric, or a inline metric spec. `metric_spec` can be only one of the following:
`pointwise_metric_spec`	`PointwiseMetricSpec` Spec for pointwise metric.
`pairwise_metric_spec`	`PairwiseMetricSpec` Spec for pairwise metric.
`exact_match_spec`	`ExactMatchSpec` Spec for exact match metric.
`bleu_spec`	`BleuSpec` Spec for bleu metric.
`rouge_spec`	`RougeSpec` Spec for rouge metric.

AggregationMetric

The aggregation metrics supported by EvaluationService.EvaluateDataset.

Enums
`AGGREGATION_METRIC_UNSPECIFIED`	Unspecified aggregation metric.
`AVERAGE`	Average aggregation metric. Not supported for Pairwise metric.
`MODE`	Mode aggregation metric.
`STANDARD_DEVIATION`	Standard deviation aggregation metric. Not supported for pairwise metric.
`VARIANCE`	Variance aggregation metric. Not supported for pairwise metric.
`MINIMUM`	Minimum aggregation metric. Not supported for pairwise metric.
`MAXIMUM`	Maximum aggregation metric. Not supported for pairwise metric.
`MEDIAN`	Median aggregation metric. Not supported for pairwise metric.
`PERCENTILE_P90`	90th percentile aggregation metric. Not supported for pairwise metric.
`PERCENTILE_P95`	95th percentile aggregation metric. Not supported for pairwise metric.
`PERCENTILE_P99`	99th percentile aggregation metric. Not supported for pairwise metric.

MetricxInput

Input for MetricX metric.

Fields

metric_spec

MetricxSpec

Required. Spec for Metricx metric.

instance

MetricxInstance

Required. Metricx instance.

MetricxInstance

Spec for MetricX instance - The fields used for evaluation are dependent on the MetricX version.

Fields

prediction

string

Required. Output of the evaluated model.

reference

string

Optional. Ground truth used to compare against the prediction.

source

string

Optional. Source text in original language.

MetricxResult

Spec for MetricX result - calculates the MetricX score for the given instance using the version specified in the spec.

Fields

score

float

Output only. MetricX score. Range depends on version.

MetricxSpec

Spec for MetricX metric.

Fields

source_language

string

Optional. Source language in BCP-47 format.

target_language

string

Optional. Target language in BCP-47 format. Covers both prediction and reference.

version

MetricxVersion

Required. Which version to use for evaluation.

MetricxVersion

MetricX Version options.

Enums
`METRICX_VERSION_UNSPECIFIED`	MetricX version unspecified.
`METRICX_24_REF`	MetricX 2024 (2.6) for translation + reference (reference-based).
`METRICX_24_SRC`	MetricX 2024 (2.6) for translation + source (QE).
`METRICX_24_SRC_REF`	MetricX 2024 (2.6) for translation + source + reference (source-reference-combined).

Modality

Content Part modality

Enums
`MODALITY_UNSPECIFIED`	Unspecified modality.
`TEXT`	Plain text.
`IMAGE`	Image.
`VIDEO`	Video.
`AUDIO`	Audio.
`DOCUMENT`	Document, e.g. PDF.

ModalityTokenCount

Represents token counting info for a single modality.

Fields

modality

Modality

The modality associated with this token count.

token_count

int32

Number of tokens.

OutputConfig

Config for evaluation output.

Fields

Union field destination. The destination for evaluation output. destination can be only one of the following:

gcs_destination

GcsDestination

Cloud storage destination for evaluation output.

OutputInfo

Describes the info for output of EvaluationService.EvaluateDataset.

Fields

Union field output_location. The output location into which evaluation output is written. output_location can be only one of the following:

gcs_output_directory

string

Output only. The full path of the Cloud Storage directory created, into which the evaluation results and aggregation results are written.

PairwiseChoice

Pairwise prediction autorater preference.

Enums
`PAIRWISE_CHOICE_UNSPECIFIED`	Unspecified prediction choice.
`BASELINE`	Baseline prediction wins
`CANDIDATE`	Candidate prediction wins
`TIE`	Winner cannot be determined

PairwiseMetricInput

Input for pairwise metric.

Fields

metric_spec

PairwiseMetricSpec

Required. Spec for pairwise metric.

instance

PairwiseMetricInstance

Required. Pairwise metric instance.

PairwiseMetricInstance

Pairwise metric instance. Usually one instance corresponds to one row in an evaluation dataset.

Fields

Union field instance. Instance for pairwise metric. instance can be only one of the following:

json_instance

string

Instance specified as a json string. String key-value pairs are expected in the json_instance to render PairwiseMetricSpec.instance_prompt_template.

content_map_instance

ContentMap

Key-value contents for the mutlimodality input, including text, image, video, audio, and pdf, etc. The key is placeholder in metric prompt template, and the value is the multimodal content.

PairwiseMetricResult

Spec for pairwise metric result.

Fields

pairwise_choice

PairwiseChoice

Output only. Pairwise metric choice.

explanation

string

Output only. Explanation for pairwise metric score.

custom_output

CustomOutput

Output only. Spec for custom output.

PairwiseMetricSpec

Spec for pairwise metric.

Fields
`candidate_response_field_name`	`string` Optional. The field name of the candidate response.
`baseline_response_field_name`	`string` Optional. The field name of the baseline response.
`custom_output_format_config`	`CustomOutputFormatConfig` Optional. CustomOutputFormatConfig allows customization of metric output. When this config is set, the default output is replaced with the raw output string. If a custom format is chosen, the `pairwise_choice` and `explanation` fields in the corresponding metric result will be empty.
`metric_prompt_template`	`string` Required. Metric prompt template for pairwise metric.
`system_instruction`	`string` Optional. System instructions for pairwise metric.

PairwiseQuestionAnsweringQualityInput

Input for pairwise question answering quality metric.

Fields

metric_spec

Required. Spec for pairwise question answering quality score metric.

instance

PairwiseQuestionAnsweringQualityInstance

Required. Pairwise question answering quality instance.

PairwiseQuestionAnsweringQualityInstance

Spec for pairwise question answering quality instance.

Fields
`prediction`	`string` Required. Output of the candidate model.
`baseline_prediction`	`string` Required. Output of the baseline model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Required. Text to answer the question.
`instruction`	`string` Required. Question Answering prompt for LLM.

PairwiseQuestionAnsweringQualityResult

Spec for pairwise question answering quality result.

Fields

pairwise_choice

PairwiseChoice

Output only. Pairwise question answering prediction choice.

explanation

string

Output only. Explanation for question answering quality score.

confidence

float

Output only. Confidence for question answering quality score.

PairwiseQuestionAnsweringQualitySpec

Spec for pairwise question answering quality score metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute question answering quality.

version

int32

Optional. Which version to use for evaluation.

PairwiseSummarizationQualityInput

Input for pairwise summarization quality metric.

Fields

metric_spec

PairwiseSummarizationQualitySpec

Required. Spec for pairwise summarization quality score metric.

instance

PairwiseSummarizationQualityInstance

Required. Pairwise summarization quality instance.

PairwiseSummarizationQualityInstance

Spec for pairwise summarization quality instance.

Fields
`prediction`	`string` Required. Output of the candidate model.
`baseline_prediction`	`string` Required. Output of the baseline model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Required. Text to be summarized.
`instruction`	`string` Required. Summarization prompt for LLM.

PairwiseSummarizationQualityResult

Spec for pairwise summarization quality result.

Fields

pairwise_choice

PairwiseChoice

Output only. Pairwise summarization prediction choice.

explanation

string

Output only. Explanation for summarization quality score.

confidence

float

Output only. Confidence for summarization quality score.

PairwiseSummarizationQualitySpec

Spec for pairwise summarization quality score metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute pairwise summarization quality.

version

int32

Optional. Which version to use for evaluation.

Part

A datatype containing media that is part of a multi-part Content message.

A Part consists of data which has an associated datatype. A Part can only contain one of the accepted types in Part.data.

A Part must have a fixed IANA MIME type identifying the type and subtype of the media if inline_data or file_data field is filled with raw bytes.

Fields

thought

bool

Optional. Indicates if the part is thought from the model.

thought_signature

bytes

Optional. An opaque signature for the thought so it can be reused in subsequent requests.

Union field data.

data can be only one of the following:

text

string

Optional. Text part (can be code).

inline_data

Blob

Optional. Inlined bytes data.

file_data

FileData

Optional. URI based data.

function_call

FunctionCall

Optional. A predicted [FunctionCall] returned from the model that contains a string representing the [FunctionDeclaration.name] with the parameters and their values.

function_response

FunctionResponse

Optional. The result output of a [FunctionCall] that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing any output from the function call. It is used as context to the model.

executable_code

ExecutableCode

Optional. Code generated by the model that is meant to be executed.

code_execution_result

CodeExecutionResult

Optional. Result of executing the [ExecutableCode].

Union field metadata.

metadata can be only one of the following:

video_metadata

VideoMetadata

Optional. Video metadata. The metadata should only be specified while the video data is presented in inline_data or file_data.

PointwiseMetricInput

Input for pointwise metric.

Fields

metric_spec

PointwiseMetricSpec

Required. Spec for pointwise metric.

instance

PointwiseMetricInstance

Required. Pointwise metric instance.

PointwiseMetricInstance

Pointwise metric instance. Usually one instance corresponds to one row in an evaluation dataset.

Fields

Union field instance. Instance for pointwise metric. instance can be only one of the following:

json_instance

string

Instance specified as a json string. String key-value pairs are expected in the json_instance to render PointwiseMetricSpec.instance_prompt_template.

content_map_instance

ContentMap

Key-value contents for the mutlimodality input, including text, image, video, audio, and pdf, etc. The key is placeholder in metric prompt template, and the value is the multimodal content.

PointwiseMetricResult

Spec for pointwise metric result.

Fields

explanation

string

Output only. Explanation for pointwise metric score.

custom_output

CustomOutput

Output only. Spec for custom output.

score

float

Output only. Pointwise metric score.

PointwiseMetricSpec

Spec for pointwise metric.

Fields

custom_output_format_config

CustomOutputFormatConfig

Optional. CustomOutputFormatConfig allows customization of metric output. By default, metrics return a score and explanation. When this config is set, the default output is replaced with either: - The raw output string. - A parsed output based on a user-defined schema. If a custom format is chosen, the score and explanation fields in the corresponding metric result will be empty.

metric_prompt_template

string

Required. Metric prompt template for pointwise metric.

system_instruction

string

Optional. System instructions for pointwise metric.

PrebuiltVoiceConfig

The configuration for the prebuilt speaker to use.

Fields

voice_name

string

The name of the preset voice to use.

PredictLongRunningRequest

Request message for PredictionService.PredictLongRunning.

Fields

endpoint

string

instances[]

Required. The instances that are the input to the prediction call. A DeployedModel may have an upper limit on the number of instances it supports per request, and when it is exceeded the prediction call errors in case of AutoML Models, or, in case of customer created Models, the behaviour is as documented by that Model. The schema of any single instance may be specified via Endpoint's DeployedModels' Model's PredictSchemata's instance_schema_uri.

parameters

Optional. The parameters that govern the prediction. The schema of the parameters may be specified via Endpoint's DeployedModels' Model's PredictSchemata's parameters_schema_uri.

PredictRequest

Request message for PredictionService.Predict.

Fields

endpoint

string

Required. The name of the Endpoint requested to serve the prediction. Format: projects/{project}/locations/{location}/endpoints/{endpoint}

instances[]

parameters

The parameters that govern the prediction. The schema of the parameters may be specified via Endpoint's DeployedModels' Model's PredictSchemata's parameters_schema_uri.

PredictResponse

Response message for PredictionService.Predict.

Fields
`predictions[]`	`Value` The predictions that are the output of the predictions call. The schema of any single prediction may be specified via Endpoint's DeployedModels' `Model's` `PredictSchemata's` `prediction_schema_uri`.
`deployed_model_id`	`string` ID of the Endpoint's DeployedModel that served this prediction.
`model`	`string` Output only. The resource name of the Model which is deployed as the DeployedModel that this prediction hits.
`model_version_id`	`string` Output only. The version ID of the Model which is deployed as the DeployedModel that this prediction hits.
`model_display_name`	`string` Output only. The `display name` of the Model which is deployed as the DeployedModel that this prediction hits.
`metadata`	`Value` Output only. Request-level metadata returned by the model. The metadata type will be dependent upon the model implementation.

ProactivityConfig

Config for proactivity features.

Fields

proactive_audio

bool

Optional. If enabled, the model can reject responding to the last prompt. For example, this allows the model to ignore out of context speech or to stay silent if the user did not make a request, yet.

QueryReasoningEngineRequest

Request message for [ReasoningEngineExecutionService.Query][].

Fields

name

string

Required. The name of the ReasoningEngine resource to use. Format: projects/{project}/locations/{location}/reasoningEngines/{reasoning_engine}

input

Optional. Input content provided by users in JSON object format. Examples include text query, function calling parameters, media bytes, etc.

class_method

string

Optional. Class method to be used for the query. It is optional and defaults to "query" if unspecified.

QueryReasoningEngineResponse

Response message for [ReasoningEngineExecutionService.Query][]

Fields

output

QuestionAnsweringCorrectnessSpec

Response provided by users in JSON object format.

QuestionAnsweringCorrectnessInput

Input for question answering correctness metric.

Fields

metric_spec

Required. Spec for question answering correctness score metric.

instance

QuestionAnsweringCorrectnessInstance

Required. Question answering correctness instance.

QuestionAnsweringCorrectnessInstance

Spec for question answering correctness instance.

Fields
`prediction`	`string` Required. Output of the evaluated model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Optional. Text provided as context to answer the question.
`instruction`	`string` Required. The question asked and other instruction in the inference prompt.

QuestionAnsweringCorrectnessResult

Spec for question answering correctness result.

Fields

explanation

string

Output only. Explanation for question answering correctness score.

score

float

Output only. Question Answering Correctness score.

confidence

float

Output only. Confidence for question answering correctness score.

QuestionAnsweringCorrectnessSpec

Spec for question answering correctness metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute question answering correctness.

version

int32

Optional. Which version to use for evaluation.

QuestionAnsweringHelpfulnessInput

Input for question answering helpfulness metric.

Fields

metric_spec

QuestionAnsweringHelpfulnessSpec

Required. Spec for question answering helpfulness score metric.

instance

QuestionAnsweringHelpfulnessInstance

Required. Question answering helpfulness instance.

QuestionAnsweringHelpfulnessInstance

Spec for question answering helpfulness instance.

Fields
`prediction`	`string` Required. Output of the evaluated model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Optional. Text provided as context to answer the question.
`instruction`	`string` Required. The question asked and other instruction in the inference prompt.

QuestionAnsweringHelpfulnessResult

Spec for question answering helpfulness result.

Fields

explanation

string

Output only. Explanation for question answering helpfulness score.

score

float

Output only. Question Answering Helpfulness score.

confidence

float

Output only. Confidence for question answering helpfulness score.

QuestionAnsweringHelpfulnessSpec

Spec for question answering helpfulness metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute question answering helpfulness.

version

int32

Optional. Which version to use for evaluation.

QuestionAnsweringQualityInput

Input for question answering quality metric.

Fields

metric_spec

QuestionAnsweringQualitySpec

Required. Spec for question answering quality score metric.

instance

QuestionAnsweringQualityInstance

Required. Question answering quality instance.

QuestionAnsweringQualityInstance

Spec for question answering quality instance.

Fields
`prediction`	`string` Required. Output of the evaluated model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Required. Text to answer the question.
`instruction`	`string` Required. Question Answering prompt for LLM.

QuestionAnsweringQualityResult

Spec for question answering quality result.

Fields

explanation

string

Output only. Explanation for question answering quality score.

score

float

Output only. Question Answering Quality score.

confidence

float

Output only. Confidence for question answering quality score.

QuestionAnsweringQualitySpec

Spec for question answering quality score metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute question answering quality.

version

int32

Optional. Which version to use for evaluation.

QuestionAnsweringRelevanceInput

Input for question answering relevance metric.

Fields

metric_spec

QuestionAnsweringRelevanceSpec

Required. Spec for question answering relevance score metric.

instance

QuestionAnsweringRelevanceInstance

Required. Question answering relevance instance.

QuestionAnsweringRelevanceInstance

Spec for question answering relevance instance.

Fields
`prediction`	`string` Required. Output of the evaluated model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Optional. Text provided as context to answer the question.
`instruction`	`string` Required. The question asked and other instruction in the inference prompt.

QuestionAnsweringRelevanceResult

Spec for question answering relevance result.

Fields

explanation

string

Output only. Explanation for question answering relevance score.

score

float

Output only. Question Answering Relevance score.

confidence

float

Output only. Confidence for question answering relevance score.

QuestionAnsweringRelevanceSpec

Spec for question answering relevance metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute question answering relevance.

version

int32

Optional. Which version to use for evaluation.

RagChunk

A RagChunk includes the content of a chunk of a RagFile, and associated metadata.

Fields

text

string

The content of the chunk.

page_span

PageSpan

If populated, represents where the chunk starts and ends in the document.

PageSpan

Represents where the chunk starts and ends in the document.

Fields

first_page

int32

Page where chunk starts in the document. Inclusive. 1-indexed.

last_page

int32

Page where chunk ends in the document. Inclusive. 1-indexed.

RagContexts

Relevant contexts for one query.

Fields

contexts[]

Context

All its contexts.

Context

A context of the query.

Fields
`source_uri`	`string` If the file is imported from Cloud Storage or Google Drive, source_uri will be original file URI in Cloud Storage or Google Drive; if file is uploaded, source_uri will be file display name.
`source_display_name`	`string` The file display name.
`text`	`string` The text chunk.
`chunk`	`RagChunk` Context of the retrieved chunk.
`score`	`double` According to the underlying Vector DB and the selected metric type, the score can be either the distance or the similarity between the query and the context and its range depends on the metric type. For example, if the metric type is COSINE_DISTANCE, it represents the distance between the query and the context. The larger the distance, the less relevant the context is to the query. The range is [0, 2], while 0 means the most relevant and 2 means the least relevant.

RagCorpus

A RagCorpus is a RagFile container and a project can have multiple RagCorpora.

Fields
`name`	`string` Output only. The resource name of the RagCorpus.
`display_name`	`string` Required. The display name of the RagCorpus. The name can be up to 128 characters long and can consist of any UTF-8 characters.
`description`	`string` Optional. The description of the RagCorpus.
`create_time`	`Timestamp` Output only. Timestamp when this RagCorpus was created.
`update_time`	`Timestamp` Output only. Timestamp when this RagCorpus was last updated.
`corpus_status`	`CorpusStatus` Output only. RagCorpus state.
`encryption_spec`	`EncryptionSpec` Optional. Immutable. The CMEK key name used to encrypt at-rest data related to this Corpus. Only applicable to RagManagedDb option for Vector DB. This field can only be set at corpus creation time, and cannot be updated or deleted.
Union field `backend_config`. The backend config of the RagCorpus. It can be data store and/or retrieval engine. `backend_config` can be only one of the following:
`vector_db_config`	`RagVectorDbConfig` Optional. Immutable. The config for the Vector DBs.
`vertex_ai_search_config`	`VertexAiSearchConfig` Optional. Immutable. The config for the Vertex AI Search.

RagEmbeddingModelConfig

Config for the embedding model to use for RAG.

Fields

Union field model_config. The model config to use. model_config can be only one of the following:

vertex_prediction_endpoint

VertexPredictionEndpoint

The Vertex AI Prediction Endpoint that either refers to a publisher model or an endpoint that is hosting a 1P fine-tuned text embedding model. Endpoints hosting non-1P fine-tuned text embedding models are currently not supported. This is used for dense vector search.

VertexPredictionEndpoint

Config representing a model hosted on Vertex Prediction Endpoint.

Fields

endpoint

string

Required. The endpoint resource name. Format: projects/{project}/locations/{location}/publishers/{publisher}/models/{model} or projects/{project}/locations/{location}/endpoints/{endpoint}

model

string

Output only. The resource name of the model that is deployed on the endpoint. Present only when the endpoint is not a publisher model. Pattern: projects/{project}/locations/{location}/models/{model}

model_version_id

string

Output only. Version ID of the model that is deployed on the endpoint. Present only when the endpoint is not a publisher model.

RagEngineConfig

Config for RagEngine.

Fields

name

string

Identifier. The name of the RagEngineConfig. Format: projects/{project}/locations/{location}/ragEngineConfig

rag_managed_db_config

RagManagedDbConfig

The config of the RagManagedDb used by RagEngine.

RagFile

A RagFile contains user data for chunking, embedding and indexing.

Fields
`name`	`string` Output only. The resource name of the RagFile.
`display_name`	`string` Required. The display name of the RagFile. The name can be up to 128 characters long and can consist of any UTF-8 characters.
`description`	`string` Optional. The description of the RagFile.
`create_time`	`Timestamp` Output only. Timestamp when this RagFile was created.
`update_time`	`Timestamp` Output only. Timestamp when this RagFile was last updated.
`file_status`	`FileStatus` Output only. State of the RagFile.
`user_metadata`	`string` Output only. The metadata for metadata search. The user_metadata Needs to be in JSON format.
Union field `rag_file_source`. The origin location of the RagFile if it is imported from Google Cloud Storage or Google Drive. `rag_file_source` can be only one of the following:
`gcs_source`	`GcsSource` Output only. Google Cloud Storage location of the RagFile. It does not support wildcards in the Cloud Storage uri for now.
`google_drive_source`	`GoogleDriveSource` Output only. Google Drive location. Supports importing individual files as well as Google Drive folders.
`direct_upload_source`	`DirectUploadSource` Output only. The RagFile is encapsulated and uploaded in the UploadRagFile request.
`slack_source`	`SlackSource` The RagFile is imported from a Slack channel.
`jira_source`	`JiraSource` The RagFile is imported from a Jira query.
`share_point_sources`	`SharePointSources` The RagFile is imported from a SharePoint source.

RagFileChunkingConfig

Specifies the size and overlap of chunks for RagFiles.

Fields

Union field chunking_config. Specifies the chunking config for RagFiles. chunking_config can be only one of the following:

fixed_length_chunking

FixedLengthChunking

Specifies the fixed length chunking config.

FixedLengthChunking

Specifies the fixed length chunking config.

Fields

chunk_size

int32

The size of the chunks.

chunk_overlap

int32

The overlap between chunks.

RagFileParsingConfig

Specifies the parsing config for RagFiles.

Fields

Union field parser. The parser to use for RagFiles. parser can be only one of the following:

layout_parser

LayoutParser

The Layout Parser to use for RagFiles.

llm_parser

LlmParser

The LLM Parser to use for RagFiles.

LayoutParser

Document AI Layout Parser config.

Fields

processor_name

string

The full resource name of a Document AI processor or processor version. The processor must have type LAYOUT_PARSER_PROCESSOR. If specified, the additional_config.parse_as_scanned_pdf field must be false. Format: * projects/{project_id}/locations/{location}/processors/{processor_id} * projects/{project_id}/locations/{location}/processors/{processor_id}/processorVersions/{processor_version_id}

max_parsing_requests_per_min

int32

The maximum number of requests the job is allowed to make to the Document AI processor per minute. Consult https://cloud.google.com/document-ai/quotas and the Quota page for your project to set an appropriate value here. If unspecified, a default value of 120 QPM would be used.

LlmParser

Specifies the LLM parsing for RagFiles.

Fields

model_name

string

The name of a LLM model used for parsing. Format: * projects/{project_id}/locations/{location}/publishers/{publisher}/models/{model}

max_parsing_requests_per_min

int32

The maximum number of requests the job is allowed to make to the LLM model per minute. Consult https://cloud.google.com/vertex-ai/generative-ai/docs/quotas and your document size to set an appropriate value here. If unspecified, a default value of 5000 QPM would be used.

custom_parsing_prompt

string

The prompt to use for parsing. If not specified, a default prompt will be used.

RagFileTransformationConfig

Specifies the transformation config for RagFiles.

Fields

rag_file_chunking_config

RagFileChunkingConfig

Specifies the chunking config for RagFiles.

RagManagedDbConfig

Configuration message for RagManagedDb used by RagEngine.

Fields
Union field `tier`. The tier of the RagManagedDb. `tier` can be only one of the following:
`scaled`	`Scaled` Sets the RagManagedDb to the Scaled tier.
`basic`	`Basic` Sets the RagManagedDb to the Basic tier.
`unprovisioned`	`Unprovisioned` Sets the RagManagedDb to the Unprovisioned tier.

Basic

This type has no fields.

Basic tier is a cost-effective and low compute tier suitable for the following cases: * Experimenting with RagManagedDb. * Small data size. * Latency insensitive workload. * Only using RAG Engine with external vector DBs.

NOTE: This is the default tier if not explicitly chosen.

Scaled

This type has no fields.

Scaled tier offers production grade performance along with autoscaling functionality. It is suitable for customers with large amounts of data or performance sensitive workloads.

Unprovisioned

This type has no fields.

Disables the RAG Engine service and deletes all your data held within this service. This will halt the billing of the service.

NOTE: Once deleted the data cannot be recovered. To start using RAG Engine again, you will need to update the tier by calling the UpdateRagEngineConfig API.

RagQuery

A query to retrieve relevant contexts.

Fields

rag_retrieval_config

RagRetrievalConfig

Optional. The retrieval config for the query.

Union field query. The query to retrieve contexts. Currently only text query is supported. query can be only one of the following:

text

string

Optional. The query in text format to get relevant contexts.

RagRetrievalConfig

Specifies the context retrieval config.

Fields

top_k

int32

Optional. The number of contexts to retrieve.

filter

Filter

Optional. Config for filters.

ranking

Ranking

Optional. Config for ranking and reranking.

Filter

Config for filters.

Fields
`metadata_filter`	`string` Optional. String for metadata filtering.
Union field `vector_db_threshold`. Filter contexts retrieved from the vector DB based on either vector distance or vector similarity. `vector_db_threshold` can be only one of the following:
`vector_distance_threshold`	`double` Optional. Only returns contexts with vector distance smaller than the threshold.
`vector_similarity_threshold`	`double` Optional. Only returns contexts with vector similarity larger than the threshold.

Ranking

Config for ranking and reranking.

Fields

Union field ranking_config. Config options for ranking. Currently only Rank Service is supported. ranking_config can be only one of the following:

rank_service

RankService

Optional. Config for Rank Service.

llm_ranker

LlmRanker

Optional. Config for LlmRanker.

LlmRanker

Config for LlmRanker.

Fields

model_name

string

Optional. The model name used for ranking. See Supported models.

RankService

Config for Rank Service.

Fields

model_name

string

Optional. The model name of the rank service. Format: semantic-ranker-512@latest

RagVectorDbConfig

Config for the Vector DB to use for RAG.

Fields
`api_auth`	`ApiAuth` Authentication config for the chosen Vector DB.
`rag_embedding_model_config`	`RagEmbeddingModelConfig` Optional. Immutable. The embedding model config of the Vector DB.
Union field `vector_db`. The config for the Vector DB. `vector_db` can be only one of the following:
`rag_managed_db`	`RagManagedDb` The config for the RAG-managed Vector DB.
`pinecone`	`Pinecone` The config for the Pinecone.
`vertex_vector_search`	`VertexVectorSearch` The config for the Vertex Vector Search.

Pinecone

The config for the Pinecone.

Fields

index_name

string

Pinecone index name. This value cannot be changed after it's set.

RagManagedDb

The config for the default RAG-managed Vector DB.

Fields

Union field retrieval_strategy. Choice of retrieval strategy. retrieval_strategy can be only one of the following:

knn

KNN

Performs a KNN search on RagCorpus. Default choice if not specified.

ann

ANN

Performs an ANN search on RagCorpus. Use this if you have a lot of files (> 10K) in your RagCorpus and want to reduce the search latency.

ANN

Config for ANN search.

RagManagedDb uses a tree-based structure to partition data and facilitate faster searches. As a tradeoff, it requires longer indexing time and manual triggering of index rebuild via the ImportRagFiles and UpdateRagCorpus API.

Fields

tree_depth

int32

The depth of the tree-based structure. Only depth values of 2 and 3 are supported.

Recommended value is 2 if you have if you have O(10K) files in the RagCorpus and set this to 3 if more than that.

Default value is 2.

leaf_count

int32

Number of leaf nodes in the tree-based structure. Each leaf node contains groups of closely related vectors along with their corresponding centroid.

Recommended value is 10 * sqrt(num of RagFiles in your RagCorpus).

Default value is 500.

KNN

This type has no fields.

Config for KNN search.

VertexVectorSearch

The config for the Vertex Vector Search.

Fields

index_endpoint

string

The resource name of the Index Endpoint. Format: projects/{project}/locations/{location}/indexEndpoints/{index_endpoint}

index

string

The resource name of the Index. Format: projects/{project}/locations/{location}/indexes/{index}

RawOutput

Raw output.

Fields

raw_output[]

string

Output only. Raw output string.

RawPredictRequest

Request message for PredictionService.RawPredict.

Fields

endpoint

string

Required. The name of the Endpoint requested to serve the prediction. Format: projects/{project}/locations/{location}/endpoints/{endpoint}

http_body

HttpBody

The prediction input. Supports HTTP headers and arbitrary data payload.

A DeployedModel may have an upper limit on the number of instances it supports per request. When this limit it is exceeded for an AutoML model, the RawPredict method returns an error. When this limit is exceeded for a custom-trained model, the behavior varies depending on the model.

You can specify the schema for each instance in the predict_schemata.instance_schema_uri field when you create a Model. This schema applies when you deploy the Model as a DeployedModel to an Endpoint and use the RawPredict method.

RealtimeInputConfig

Configures the realtime input behavior in BidiGenerateContent.

Fields

automatic_activity_detection

AutomaticActivityDetection

Optional. If not set, automatic activity detection is enabled by default. If automatic voice detection is disabled, the client must send activity signals.

activity_handling

ActivityHandling

Optional. Defines what effect activity has.

turn_coverage

TurnCoverage

Optional. Defines which input is included in the user's turn.

ActivityHandling

The different ways of handling user activity.

Enums
`ACTIVITY_HANDLING_UNSPECIFIED`	If unspecified, the default behavior is `START_OF_ACTIVITY_INTERRUPTS`.
`START_OF_ACTIVITY_INTERRUPTS`	If true, start of activity will interrupt the model's response (also called "barge in"). The model's current response will be cut-off in the moment of the interruption. This is the default behavior.
`NO_INTERRUPTION`	The model's response will not be interrupted.

AutomaticActivityDetection

Configures automatic detection of activity.

Fields
`start_of_speech_sensitivity`	`StartSensitivity` Optional. Determines how likely speech is to be detected.
`end_of_speech_sensitivity`	`EndSensitivity` Optional. Determines how likely detected speech is ended.
`prefix_padding_ms`	`int32` Optional. The required duration of detected speech before start-of-speech is committed. The lower this value the more sensitive the start-of-speech detection is and the shorter speech can be recognized. However, this also increases the probability of false positives.
`silence_duration_ms`	`int32` Optional. The required duration of detected silence (or non-speech) before end-of-speech is committed. The larger this value, the longer speech gaps can be without interrupting the user's activity but this will increase the model's latency.
`disabled`	`bool` Optional. If enabled, detected voice and text input count as activity. If disabled, the client must send activity signals.

EndSensitivity

End of speech sensitivity.

Enums
`END_SENSITIVITY_UNSPECIFIED`	The default is END_SENSITIVITY_LOW.
`END_SENSITIVITY_HIGH`	Automatic detection ends speech more often.
`END_SENSITIVITY_LOW`	Automatic detection ends speech less often.

StartSensitivity

Start of speech sensitivity.

Enums
`START_SENSITIVITY_UNSPECIFIED`	The default is START_SENSITIVITY_LOW.
`START_SENSITIVITY_HIGH`	Automatic detection will detect the start of speech more often.
`START_SENSITIVITY_LOW`	Automatic detection will detect the start of speech less often.

TurnCoverage

Options about which input is included in the user's turn.

Enums
`TURN_COVERAGE_UNSPECIFIED`	If unspecified, the default behavior is `TURN_INCLUDES_ALL_INPUT`.
`TURN_INCLUDES_ONLY_ACTIVITY`	The users turn only includes activity since the last turn, excluding inactivity (e.g. silence on the audio stream).
`TURN_INCLUDES_ALL_INPUT`	The users turn includes all realtime input since the last turn, including inactivity (e.g. silence on the audio stream). This is the default behavior.

ReasoningEngine

ReasoningEngine provides a customizable runtime for models to determine which actions to take and in which order.

Fields
`name`	`string` Identifier. The resource name of the ReasoningEngine. Format: `projects/{project}/locations/{location}/reasoningEngines/{reasoning_engine}`
`display_name`	`string` Required. The display name of the ReasoningEngine.
`description`	`string` Optional. The description of the ReasoningEngine.
`spec`	`ReasoningEngineSpec` Optional. Configurations of the ReasoningEngine
`create_time`	`Timestamp` Output only. Timestamp when this ReasoningEngine was created.
`update_time`	`Timestamp` Output only. Timestamp when this ReasoningEngine was most recently updated.
`etag`	`string` Optional. Used to perform consistent read-modify-write updates. If not set, a blind "overwrite" update happens.

ReasoningEngineSpec

ReasoningEngine configurations

Fields
`package_spec`	`PackageSpec` Optional. User provided package spec of the ReasoningEngine. Ignored when users directly specify a deployment image through `deployment_spec.first_party_image_override`, but keeping the field_behavior to avoid introducing breaking changes.
`deployment_spec`	`DeploymentSpec` Optional. The specification of a Reasoning Engine deployment.
`class_methods[]`	`Struct` Optional. Declarations for object class methods in OpenAPI specification format.
`agent_framework`	`string` Optional. The OSS agent framework used to develop the agent. Currently supported values: "google-adk", "langchain", "langgraph", "ag2", "llama-index", "custom".

DeploymentSpec

The specification of a Reasoning Engine deployment.

Fields

env[]

EnvVar

Optional. Environment variables to be set with the Reasoning Engine deployment. The environment variables can be updated through the UpdateReasoningEngine API.

secret_env[]

SecretEnvVar

Optional. Environment variables where the value is a secret in Cloud Secret Manager. To use this feature, add 'Secret Manager Secret Accessor' role (roles/secretmanager.secretAccessor) to AI Platform Reasoning Engine Service Agent.

PackageSpec

User provided package spec like pickled object and package requirements.

Fields
`pickle_object_gcs_uri`	`string` Optional. The Cloud Storage URI of the pickled python object.
`dependency_files_gcs_uri`	`string` Optional. The Cloud Storage URI of the dependency files in tar.gz format.
`requirements_gcs_uri`	`string` Optional. The Cloud Storage URI of the `requirements.txt` file
`python_version`	`string` Optional. The Python version. Currently support 3.8, 3.9, 3.10, 3.11. If not specified, default value is 3.10.

RebaseTunedModelOperationMetadata

Runtime operation information for GenAiTuningService.RebaseTunedModel.

Fields

generic_metadata

RubricBasedInstructionFollowingSpec

The common part of the operation generic information.

RebaseTunedModelRequest

Request message for GenAiTuningService.RebaseTunedModel.

Fields
`parent`	`string` Required. The resource name of the Location into which to rebase the Model. Format: `projects/{project}/locations/{location}`
`tuned_model_ref`	`TunedModelRef` Required. TunedModel reference to retrieve the legacy model information.
`tuning_job`	`TuningJob` Optional. The TuningJob to be updated. Users can use this TuningJob field to overwrite tuning configs.
`artifact_destination`	`GcsDestination` Optional. The Google Cloud Storage location to write the artifacts.
`deploy_to_same_endpoint`	`bool` Optional. By default, bison to gemini migration will always create new model/endpoint, but for gemini-1.0 to gemini-1.5 migration, we default deploy to the same endpoint. See details in this Section.

Retrieval

Defines a retrieval tool that model can call to access external knowledge.

Fields
`disable_attribution (deprecated)`	`bool` This item is deprecated! Optional. Deprecated. This option is no longer supported.
Union field `source`. The source of the retrieval. `source` can be only one of the following:
`vertex_ai_search`	`VertexAISearch` Set to use data source powered by Vertex AI Search.
`vertex_rag_store`	`VertexRagStore` Set to use data source powered by Vertex RAG store. User data is uploaded via the VertexRagDataService.
`external_api`	`ExternalApi` Use data source powered by external API for grounding.

RetrievalConfig

Retrieval config.

Fields

lat_lng

LatLng

The location of the user.

language_code

string

The language code of the user.

RetrievalMetadata

Metadata related to retrieval in the grounding flow.

Fields

google_search_dynamic_retrieval_score

float

Optional. Score indicating how likely information from Google Search could help answer the prompt. The score is in the range [0, 1], where 0 is the least likely and 1 is the most likely. This score is only populated when Google Search grounding and dynamic retrieval is enabled. It will be compared to the threshold to determine whether to trigger Google Search.

RetrieveContextsRequest

Request message for VertexRagService.RetrieveContexts.

Fields
`parent`	`string` Required. The resource name of the Location from which to retrieve RagContexts. The users must have permission to make a call in the project. Format: `projects/{project}/locations/{location}`.
`query`	`RagQuery` Required. Single RAG retrieve query.
Union field `data_source`. Data Source to retrieve contexts. `data_source` can be only one of the following:
`vertex_rag_store`	`VertexRagStore` The data source for Vertex RagStore.

VertexRagStore

The data source for Vertex RagStore.

Fields

rag_resources[]

RagResource

Optional. The representation of the rag source. It can be used to specify corpus only or ragfiles. Currently only support one corpus or multiple files from one corpus. In the future we may open up multiple corpora support.

vector_distance_threshold
(deprecated)

double

Optional. Only return contexts with vector distance smaller than the threshold.

RagResource

The definition of the Rag resource.

Fields

rag_corpus

string

Optional. RagCorpora resource name. Format: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}

rag_file_ids[]

string

Optional. rag_file_id. The files should be in the same rag_corpus set in rag_corpus field.

RetrieveContextsResponse

Response message for VertexRagService.RetrieveContexts.

Fields

contexts

RagContexts

The contexts of the query.

RougeInput

Input for rouge metric.

Fields

metric_spec

RougeSpec

Required. Spec for rouge score metric.

instances[]

RougeInstance

Required. Repeated rouge instances.

RougeInstance

Spec for rouge instance.

Fields

prediction

string

Required. Output of the evaluated model.

reference

string

Required. Ground truth used to compare against the prediction.

RougeMetricValue

Rouge metric value for an instance.

Fields

score

float

Output only. Rouge score.

RougeResults

Results for rouge metric.

Fields

rouge_metric_values[]

RougeMetricValue

Output only. Rouge metric values.

RougeSpec

Spec for rouge score metric - calculates the recall of n-grams in prediction as compared to reference - returns a score ranging between 0 and 1.

Fields

rouge_type

string

Optional. Supported rouge types are rougen[1-9], rougeL, and rougeLsum.

use_stemmer

bool

Optional. Whether to use stemmer to compute rouge score.

split_summaries

bool

Optional. Whether to split summaries while using rougeLsum.

RubricBasedInstructionFollowingInput

Instance and metric spec for RubricBasedInstructionFollowing metric.

Fields

metric_spec

Required. Spec for RubricBasedInstructionFollowing metric.

instance

RubricBasedInstructionFollowingInstance

Required. Instance for RubricBasedInstructionFollowing metric.

RubricBasedInstructionFollowingInstance

Instance for RubricBasedInstructionFollowing metric - one instance corresponds to one row in an evaluation dataset.

Fields

Union field instance. Instance for RubricBasedInstructionFollowing metric. instance can be only one of the following:

json_instance

string

Required. Instance specified as a json string. String key-value pairs are expected in the json_instance to render RubricBasedInstructionFollowing prompt templates.

RubricBasedInstructionFollowingResult

Result for RubricBasedInstructionFollowing metric.

Fields

rubric_critique_results[]

RubricCritiqueResult

Output only. List of per rubric critique results.

score

float

Output only. Overall score for the instruction following.

RubricBasedInstructionFollowingSpec

This type has no fields.

Spec for RubricBasedInstructionFollowing metric - returns rubrics and verdicts corresponding to rubrics along with overall score.

RubricCritiqueResult

Rubric critique result.

Fields

rubric

string

Output only. Rubric to be evaluated.

verdict

bool

Output only. Verdict for the rubric - true if the rubric is met, false otherwise.

SafetyInput

Input for safety metric.

Fields

metric_spec

SafetySpec

Required. Spec for safety metric.

instance

SafetyInstance

Required. Safety instance.

SafetyInstance

Spec for safety instance.

Fields

prediction

string

Required. Output of the evaluated model.

SafetyRating

Safety rating corresponding to the generated content.

Fields
`category`	`HarmCategory` Output only. Harm category.
`probability`	`HarmProbability` Output only. Harm probability levels in the content.
`probability_score`	`float` Output only. Harm probability score.
`severity`	`HarmSeverity` Output only. Harm severity levels in the content.
`severity_score`	`float` Output only. Harm severity score.
`blocked`	`bool` Output only. Indicates whether the content was filtered out because of this rating.
`overwritten_threshold`	`HarmBlockThreshold` Output only. The overwritten threshold for the safety category of Gemini 2.0 image out. If minors are detected in the output image, the threshold of each safety category will be overwritten if user sets a lower threshold.

HarmProbability

Harm probability levels in the content.

Enums
`HARM_PROBABILITY_UNSPECIFIED`	Harm probability unspecified.
`NEGLIGIBLE`	Negligible level of harm.
`LOW`	Low level of harm.
`MEDIUM`	Medium level of harm.
`HIGH`	High level of harm.

HarmSeverity

Harm severity levels.

Enums
`HARM_SEVERITY_UNSPECIFIED`	Harm severity unspecified.
`HARM_SEVERITY_NEGLIGIBLE`	Negligible level of harm severity.
`HARM_SEVERITY_LOW`	Low level of harm severity.
`HARM_SEVERITY_MEDIUM`	Medium level of harm severity.
`HARM_SEVERITY_HIGH`	High level of harm severity.

SafetyResult

Spec for safety result.

Fields

explanation

string

Output only. Explanation for safety score.

score

float

Output only. Safety score.

confidence

float

Output only. Confidence for safety score.

SafetySetting

Safety settings.

Fields

category

HarmCategory

Required. Harm category.

threshold

HarmBlockThreshold

Required. The harm block threshold.

method

HarmBlockMethod

Optional. Specify if the threshold is used for probability or severity score. If not specified, the threshold is used for probability score.

HarmBlockMethod

Probability vs severity.

Enums
`HARM_BLOCK_METHOD_UNSPECIFIED`	The harm block method is unspecified.
`SEVERITY`	The harm block method uses both probability and severity scores.
`PROBABILITY`	The harm block method uses the probability score.

HarmBlockThreshold

Probability based thresholds levels for blocking.

Enums
`HARM_BLOCK_THRESHOLD_UNSPECIFIED`	Unspecified harm block threshold.
`BLOCK_LOW_AND_ABOVE`	Block low threshold and above (i.e. block more).
`BLOCK_MEDIUM_AND_ABOVE`	Block medium threshold and above.
`BLOCK_ONLY_HIGH`	Block only high threshold (i.e. block less).
`BLOCK_NONE`	Block none.
`OFF`	Turn off the safety filter.

SafetySpec

Spec for safety metric.

Fields

version

int32

Optional. Which version to use for evaluation.

Schema

Schema is used to define the format of input/output data. Represents a select subset of an OpenAPI 3.0 schema object. More fields may be added in the future as needed.

Fields
`type`	`Type` Optional. The type of the data.
`format`	`string` Optional. The format of the data. Supported formats: for NUMBER type: "float", "double" for INTEGER type: "int32", "int64" for STRING type: "email", "byte", etc
`title`	`string` Optional. The title of the Schema.
`description`	`string` Optional. The description of the data.
`nullable`	`bool` Optional. Indicates if the value may be null.
`default`	`Value` Optional. Default value of the data.
`items`	`Schema` Optional. SCHEMA FIELDS FOR TYPE ARRAY Schema of the elements of Type.ARRAY.
`min_items`	`int64` Optional. Minimum number of the elements for Type.ARRAY.
`max_items`	`int64` Optional. Maximum number of the elements for Type.ARRAY.
`enum[]`	`string` Optional. Possible values of the element of primitive type with enum format. Examples: 1. We can define direction as : {type:STRING, format:enum, enum:["EAST", NORTH", "SOUTH", "WEST"]} 2. We can define apartment number as : {type:INTEGER, format:enum, enum:["101", "201", "301"]}
`properties`	`map<string, Schema>` Optional. SCHEMA FIELDS FOR TYPE OBJECT Properties of Type.OBJECT.
`property_ordering[]`	`string` Optional. The order of the properties. Not a standard field in open api spec. Only used to support the order of the properties.
`required[]`	`string` Optional. Required properties of Type.OBJECT.
`min_properties`	`int64` Optional. Minimum number of the properties for Type.OBJECT.
`max_properties`	`int64` Optional. Maximum number of the properties for Type.OBJECT.
`minimum`	`double` Optional. SCHEMA FIELDS FOR TYPE INTEGER and NUMBER Minimum value of the Type.INTEGER and Type.NUMBER
`maximum`	`double` Optional. Maximum value of the Type.INTEGER and Type.NUMBER
`min_length`	`int64` Optional. SCHEMA FIELDS FOR TYPE STRING Minimum length of the Type.STRING
`max_length`	`int64` Optional. Maximum length of the Type.STRING
`pattern`	`string` Optional. Pattern of the Type.STRING to restrict a string to a regular expression.
`example`	`Value` Optional. Example of the object. Will only populated when the object is the root.
`any_of[]`	`Schema` Optional. The value should be validated against any (one or more) of the subschemas in the list.
`additional_properties`	`Value` Optional. Can either be a boolean or an object; controls the presence of additional properties.
`ref`	`string` Optional. Allows indirect references between schema nodes. The value should be a valid reference to a child of the root `defs`. For example, the following schema defines a reference to a schema node named "Pet": type: object properties: pet: ref: #/defs/Pet defs: Pet: type: object properties: name: type: string The value of the "pet" property is a reference to the schema node named "Pet". See details in https://json-schema.org/understanding-json-schema/structuring
`defs`	`map<string, Schema>` Optional. A map of definitions for use by `ref` Only allowed at the root of the schema.

SearchEntryPoint

Google search entry point.

Fields

rendered_content

string

Optional. Web content snippet that can be embedded in a web page or an app webview.

sdk_blob

bytes

Optional. Base64 encoded JSON representing array of <search term, search url> tuple.

SecretEnvVar

Represents an environment variable where the value is a secret in Cloud Secret Manager.

Fields

name

string

Required. Name of the secret environment variable.

secret_ref

SecretRef

Required. Reference to a secret stored in the Cloud Secret Manager that will provide the value for this environment variable.

SecretRef

Reference to a secret stored in the Cloud Secret Manager that will provide the value for this environment variable.

Fields

secret

string

Required. The name of the secret in Cloud Secret Manager. Format: {secret_name}.

version

string

The Cloud Secret Manager secret version. Can be 'latest' for the latest version, an integer for a specific version, or a version alias.

Segment

Segment of the content.

Fields
`part_index`	`int32` Output only. The index of a Part object within its parent Content object.
`start_index`	`int32` Output only. Start index in the given Part, measured in bytes. Offset from the start of the Part, inclusive, starting at zero.
`end_index`	`int32` Output only. End index in the given Part, measured in bytes. Offset from the start of the Part, exclusive, starting at zero.
`text`	`string` Output only. The text corresponding to the segment from the response.

SessionResumptionConfig

Configuration of session resumption mechanism.

Included in BidiGenerateContentSetup.session_resumption. If included server will send SessionResumptionUpdate messages.

Fields

transparent

bool

Optional. If set requests server to send updates with message_index of last message sent from client included in session state.

handle

string

Session resumption handle of previous session (session to restore).

If not present new session will be started.

SessionResumptionUpdate

Update of the session resumption state.

Only sent if BidiGenerateContentSetup.session_resumption was set.

Fields

new_handle

string

New handle that represents state that can be resumed. Empty if resumable=false.

resumable

bool

True if session can be resumed at this point.

It might be not possible to resume session at some points. In that case we send update empty new_handle and resumable=false. Example of such case could be model executing function calls or just generating. Resuming session (using previous session token) in such state will result in some data loss.

last_consumed_client_message_index

int64

Index of last message sent by client that is included in state represented by this SessionResumptionToken. Only sent when SessionResumptionConfig.transparent is set.

Presence of this index allows users to transparently reconnect and avoid issue of losing some part of realtime audio input/video. If client wishes to temporarily disconnect (for example as result of receiving GoAway) they can do it without losing state by buffering messages sent since last SessionResmumptionTokenUpdate. This field will enable them to limit buffering (avoid keeping all requests in RAM).

It will not be used for 'resumption to restore state' some time later -- in those cases partial audio and video frames are likely not needed.

SharePointSources

The SharePointSources to pass to ImportRagFiles.

Fields

share_point_sources[]

SharePointSource

The SharePoint sources.

SharePointSource

An individual SharePointSource.

Fields
`client_id`	`string` The Application ID for the app registered in Microsoft Azure Portal. The application must also be configured with MS Graph permissions "Files.ReadAll", "Sites.ReadAll" and BrowserSiteLists.Read.All.
`client_secret`	`ApiKeyConfig` The application secret for the app registered in Azure.
`tenant_id`	`string` Unique identifier of the Azure Active Directory Instance.
`sharepoint_site_name`	`string` The name of the SharePoint site to download from. This can be the site name or the site id.
`file_id`	`string` Output only. The SharePoint file id. Output only.
Union field `folder_source`. The SharePoint folder source. If not provided, uses "root". `folder_source` can be only one of the following:
`sharepoint_folder_path`	`string` The path of the SharePoint folder to download from.
`sharepoint_folder_id`	`string` The ID of the SharePoint folder to download from.
Union field `drive_source`. The SharePoint drive source. `drive_source` can be only one of the following:
`drive_name`	`string` The name of the drive to download from.
`drive_id`	`string` The ID of the drive to download from.

SlackSource

The Slack source for the ImportRagFilesRequest.

Fields

channels[]

SlackChannels

Required. The Slack channels.

SlackChannels

SlackChannels contains the Slack channels and corresponding access token.

Fields

channels[]

SlackChannel

Required. The Slack channel IDs.

api_key_config

ApiKeyConfig

Required. The SecretManager secret version resource name (e.g. projects/{project}/secrets/{secret}/versions/{version}) storing the Slack channel access token that has access to the slack channel IDs. See: https://api.slack.com/tutorials/tracks/getting-a-token.

SlackChannel

SlackChannel contains the Slack channel ID and the time range to import.

Fields

channel_id

string

Required. The Slack channel ID.

start_time

Optional. The starting timestamp for messages to import.

end_time

Optional. The ending timestamp for messages to import.

SpeechConfig

The speech generation config.

Fields

voice_config

VoiceConfig

The configuration for the speaker to use.

language_code

string

Optional. Language code (ISO 639. e.g. en-US) for the speech synthesization.

StreamDirectPredictRequest

Request message for PredictionService.StreamDirectPredict.

The first message must contain endpoint field and optionally [input][]. The subsequent messages must contain [input][].

Fields

endpoint

string

Required. The name of the Endpoint requested to serve the prediction. Format: projects/{project}/locations/{location}/endpoints/{endpoint}

inputs[]

Optional. The prediction input.

parameters

Optional. The parameters that govern the prediction.

StreamDirectPredictResponse

Response message for PredictionService.StreamDirectPredict.

Fields

outputs[]

The prediction output.

parameters

The parameters that govern the prediction.

StreamDirectRawPredictRequest

Request message for PredictionService.StreamDirectRawPredict.

The first message must contain endpoint and method_name fields and optionally input. The subsequent messages must contain input. method_name in the subsequent messages have no effect.

Fields

endpoint

string

Required. The name of the Endpoint requested to serve the prediction. Format: projects/{project}/locations/{location}/endpoints/{endpoint}

method_name

string

Optional. Fully qualified name of the API method being invoked to perform predictions.

Format: /namespace.Service/Method/ Example: /tensorflow.serving.PredictionService/Predict

input

bytes

Optional. The prediction input.

StreamDirectRawPredictResponse

Response message for PredictionService.StreamDirectRawPredict.

Fields

output

bytes

The prediction output.

StreamQueryReasoningEngineRequest

Request message for [ReasoningEngineExecutionService.StreamQuery][].

Fields

name

string

Required. The name of the ReasoningEngine resource to use. Format: projects/{project}/locations/{location}/reasoningEngines/{reasoning_engine}

input

Optional. Input content provided by users in JSON object format. Examples include text query, function calling parameters, media bytes, etc.

class_method

string

Optional. Class method to be used for the stream query. It is optional and defaults to "stream_query" if unspecified.

StreamRawPredictRequest

Request message for PredictionService.StreamRawPredict.

Fields

endpoint

string

Required. The name of the Endpoint requested to serve the prediction. Format: projects/{project}/locations/{location}/endpoints/{endpoint}

http_body

HttpBody

The prediction input. Supports HTTP headers and arbitrary data payload.

StreamingPredictRequest

Request message for PredictionService.StreamingPredict.

The first message must contain endpoint field and optionally [input][]. The subsequent messages must contain [input][].

Fields

endpoint

string

Required. The name of the Endpoint requested to serve the prediction. Format: projects/{project}/locations/{location}/endpoints/{endpoint}

inputs[]

The prediction input.

parameters

The parameters that govern the prediction.

StreamingPredictResponse

Response message for PredictionService.StreamingPredict.

Fields

outputs[]

The prediction output.

parameters

SummarizationHelpfulnessSpec

The parameters that govern the prediction.

StreamingRawPredictRequest

Request message for PredictionService.StreamingRawPredict.

The first message must contain endpoint and method_name fields and optionally input. The subsequent messages must contain input. method_name in the subsequent messages have no effect.

Fields

endpoint

string

Required. The name of the Endpoint requested to serve the prediction. Format: projects/{project}/locations/{location}/endpoints/{endpoint}

method_name

string

Fully qualified name of the API method being invoked to perform predictions.

Format: /namespace.Service/Method/ Example: /tensorflow.serving.PredictionService/Predict

input

bytes

The prediction input.

StreamingRawPredictResponse

Response message for PredictionService.StreamingRawPredict.

Fields

output

bytes

The prediction output.

SummarizationHelpfulnessInput

Input for summarization helpfulness metric.

Fields

metric_spec

Required. Spec for summarization helpfulness score metric.

instance

SummarizationHelpfulnessInstance

Required. Summarization helpfulness instance.

SummarizationHelpfulnessInstance

Spec for summarization helpfulness instance.

Fields
`prediction`	`string` Required. Output of the evaluated model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Required. Text to be summarized.
`instruction`	`string` Optional. Summarization prompt for LLM.

SummarizationHelpfulnessResult

Spec for summarization helpfulness result.

Fields

explanation

string

Output only. Explanation for summarization helpfulness score.

score

float

Output only. Summarization Helpfulness score.

confidence

float

Output only. Confidence for summarization helpfulness score.

SummarizationHelpfulnessSpec

Spec for summarization helpfulness score metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute summarization helpfulness.

version

int32

Optional. Which version to use for evaluation.

SummarizationQualityInput

Input for summarization quality metric.

Fields

metric_spec

SummarizationQualitySpec

Required. Spec for summarization quality score metric.

instance

SummarizationQualityInstance

Required. Summarization quality instance.

SummarizationQualityInstance

Spec for summarization quality instance.

Fields
`prediction`	`string` Required. Output of the evaluated model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Required. Text to be summarized.
`instruction`	`string` Required. Summarization prompt for LLM.

SummarizationQualityResult

Spec for summarization quality result.

Fields

explanation

string

Output only. Explanation for summarization quality score.

score

float

Output only. Summarization Quality score.

confidence

float

Output only. Confidence for summarization quality score.

SummarizationQualitySpec

Spec for summarization quality score metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute summarization quality.

version

int32

Optional. Which version to use for evaluation.

SummarizationVerbosityInput

Input for summarization verbosity metric.

Fields

metric_spec

SummarizationVerbositySpec

Required. Spec for summarization verbosity score metric.

instance

SummarizationVerbosityInstance

Required. Summarization verbosity instance.

SummarizationVerbosityInstance

Spec for summarization verbosity instance.

Fields
`prediction`	`string` Required. Output of the evaluated model.
`reference`	`string` Optional. Ground truth used to compare against the prediction.
`context`	`string` Required. Text to be summarized.
`instruction`	`string` Optional. Summarization prompt for LLM.

SummarizationVerbosityResult

Spec for summarization verbosity result.

Fields

explanation

string

Output only. Explanation for summarization verbosity score.

score

float

Output only. Summarization Verbosity score.

confidence

float

Output only. Confidence for summarization verbosity score.

SummarizationVerbositySpec

Spec for summarization verbosity score metric.

Fields

use_reference

bool

Optional. Whether to use instance.reference to compute summarization verbosity.

version

int32

Optional. Which version to use for evaluation.

SupervisedHyperParameters

Hyperparameters for SFT.

Fields

epoch_count

int64

Optional. Number of complete passes the model makes over the entire training dataset during training.

learning_rate_multiplier

double

Optional. Multiplier for adjusting the default learning rate. Mutually exclusive with learning_rate. This feature is only available for 1P models.

adapter_size

AdapterSize

Optional. Adapter size for tuning.

AdapterSize

Supported adapter sizes for tuning.

Enums
`ADAPTER_SIZE_UNSPECIFIED`	Adapter size is unspecified.
`ADAPTER_SIZE_ONE`	Adapter size 1.
`ADAPTER_SIZE_TWO`	Adapter size 2.
`ADAPTER_SIZE_FOUR`	Adapter size 4.
`ADAPTER_SIZE_EIGHT`	Adapter size 8.
`ADAPTER_SIZE_SIXTEEN`	Adapter size 16.
`ADAPTER_SIZE_THIRTY_TWO`	Adapter size 32.

SupervisedTuningDataStats

Tuning data statistics for Supervised Tuning.

Fields
`tuning_dataset_example_count`	`int64` Output only. Number of examples in the tuning dataset.
`total_tuning_character_count`	`int64` Output only. Number of tuning characters in the tuning dataset.
`total_billable_character_count (deprecated)`	`int64` This item is deprecated! Output only. Number of billable characters in the tuning dataset.
`total_billable_token_count`	`int64` Output only. Number of billable tokens in the tuning dataset.
`tuning_step_count`	`int64` Output only. Number of tuning steps for this Tuning Job.
`user_input_token_distribution`	`SupervisedTuningDatasetDistribution` Output only. Dataset distributions for the user input tokens.
`user_output_token_distribution`	`SupervisedTuningDatasetDistribution` Output only. Dataset distributions for the user output tokens.
`user_message_per_example_distribution`	`SupervisedTuningDatasetDistribution` Output only. Dataset distributions for the messages per example.
`user_dataset_examples[]`	`Content` Output only. Sample user messages in the training dataset uri.
`total_truncated_example_count`	`int64` Output only. The number of examples in the dataset that have been dropped. An example can be dropped for reasons including: too many tokens, contains an invalid image, contains too many images, etc.
`truncated_example_indices[]`	`int64` Output only. A partial sample of the indices (starting from 1) of the dropped examples.
`dropped_example_reasons[]`	`string` Output only. For each index in `truncated_example_indices`, the user-facing reason why the example was dropped.

SupervisedTuningDatasetDistribution

Dataset distribution for Supervised Tuning.

Fields
`sum`	`int64` Output only. Sum of a given population of values.
`billable_sum`	`int64` Output only. Sum of a given population of values that are billable.
`min`	`double` Output only. The minimum of the population values.
`max`	`double` Output only. The maximum of the population values.
`mean`	`double` Output only. The arithmetic mean of the values in the population.
`median`	`double` Output only. The median of the values in the population.
`p5`	`double` Output only. The 5th percentile of the values in the population.
`p95`	`double` Output only. The 95th percentile of the values in the population.
`buckets[]`	`DatasetBucket` Output only. Defines the histogram bucket.

DatasetBucket

Dataset bucket used to create a histogram for the distribution given a population of values.

Fields

count

double

Output only. Number of values in the bucket.

left

double

Output only. Left bound of the bucket.

right

double

Output only. Right bound of the bucket.

SupervisedTuningSpec

Tuning Spec for Supervised Tuning for first party models.

Fields
`training_dataset_uri`	`string` Required. Training dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset.
`validation_dataset_uri`	`string` Optional. Validation dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset.
`hyper_parameters`	`SupervisedHyperParameters` Optional. Hyperparameters for SFT.
`export_last_checkpoint_only`	`bool` Optional. If set to true, disable intermediate checkpoints for SFT and only the last checkpoint will be exported. Otherwise, enable intermediate checkpoints for SFT. Default is false.

Tensor

A tensor value type.

Fields
`dtype`	`DataType` The data type of tensor.
`shape[]`	`int64` Shape of the tensor.
`bool_val[]`	`bool` Type specific representations that make it easy to create tensor protos in all languages. Only the representation corresponding to "dtype" can be set. The values hold the flattened representation of the tensor in row major order. `BOOL`
`string_val[]`	`string` `STRING`
`bytes_val[]`	`bytes` `STRING`
`float_val[]`	`float` `FLOAT`
`double_val[]`	`double` `DOUBLE`
`int_val[]`	`int32` `INT_8` `INT_16` `INT_32`
`int64_val[]`	`int64` `INT64`
`uint_val[]`	`uint32` `UINT8` `UINT16` `UINT32`
`uint64_val[]`	`uint64` `UINT64`
`list_val[]`	`Tensor` A list of tensor values.
`struct_val`	`map<string, Tensor>` A map of string to tensor.
`tensor_val`	`bytes` Serialized raw tensor content.

DataType

Data type of the tensor.

Enums
`DATA_TYPE_UNSPECIFIED`	Not a legal value for DataType. Used to indicate a DataType field has not been set.
`BOOL`	Data types that all computation devices are expected to be capable to support.
`STRING`
`FLOAT`
`DOUBLE`
`INT8`
`INT16`
`INT32`
`INT64`
`UINT8`
`UINT16`
`UINT32`
`UINT64`

Tool

Tool details that the model may use to generate response.

A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model. A Tool object should contain exactly one type of Tool (e.g FunctionDeclaration, Retrieval or GoogleSearchRetrieval).

Fields
`function_declarations[]`	`FunctionDeclaration` Optional. Function tool type. One or more function declarations to be passed to the model along with the current user query. Model may decide to call a subset of these functions by populating `FunctionCall` in the response. User should provide a `FunctionResponse` for each function call in the next turn. Based on the function responses, Model will generate the final response back to the user. Maximum 512 function declarations can be provided.
`retrieval`	`Retrieval` Optional. Retrieval tool type. System will always execute the provided retrieval tool(s) to get external knowledge to answer the prompt. Retrieval results are presented to the model for generation.
`google_search`	`GoogleSearch` Optional. GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google.
`google_search_retrieval`	`GoogleSearchRetrieval` Optional. GoogleSearchRetrieval tool type. Specialized retrieval tool that is powered by Google search.
`google_maps`	`GoogleMaps` Optional. GoogleMaps tool type. Tool to support Google Maps in Model.
`enterprise_web_search`	`EnterpriseWebSearch` Optional. Tool to support searching public web data, powered by Vertex AI Search and Sec4 compliance.
`code_execution`	`CodeExecution` Optional. CodeExecution tool type. Enables the model to execute code as part of generation.
`url_context`	`UrlContext` Optional. Tool to support URL context retrieval.
`computer_use`	`ComputerUse` Optional. Tool to support the model interacting directly with the computer. If enabled, it automatically populates computer-use specific Function Declarations.

CodeExecution

This type has no fields.

Tool that executes code generated by the model, and automatically returns the result to the model.

See also [ExecutableCode]and [CodeExecutionResult] which are input and output to this tool.

ComputerUse

Tool to support computer use.

Fields

environment

Environment

Required. The environment being operated.

Environment

Represents the environment being operated, such as a web browser.

Enums
`ENVIRONMENT_UNSPECIFIED`	Defaults to browser.
`ENVIRONMENT_BROWSER`	Operates in a web browser.

GoogleSearch

This type has no fields.

GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google.

ToolCall

Spec for tool call.

Fields

tool_name

string

Required. Spec for tool name

tool_input

string

Optional. Spec for tool input

ToolCallValidInput

Input for tool call valid metric.

Fields

metric_spec

ToolCallValidSpec

Required. Spec for tool call valid metric.

instances[]

ToolCallValidInstance

Required. Repeated tool call valid instances.

ToolCallValidInstance

Spec for tool call valid instance.

Fields

prediction

string

Required. Output of the evaluated model.

reference

string

Required. Ground truth used to compare against the prediction.

ToolCallValidMetricValue

Tool call valid metric value for an instance.

Fields

score

float

Output only. Tool call valid score.

ToolCallValidResults

Results for tool call valid metric.

Fields

tool_call_valid_metric_values[]

ToolCallValidMetricValue

Output only. Tool call valid metric values.

ToolCallValidSpec

This type has no fields.

Spec for tool call valid metric.

ToolConfig

Tool config. This config is shared for all tools provided in the request.

Fields

function_calling_config

FunctionCallingConfig

Optional. Function calling config.

retrieval_config

RetrievalConfig

Optional. Retrieval config.

ToolNameMatchInput

Input for tool name match metric.

Fields

metric_spec

ToolNameMatchSpec

Required. Spec for tool name match metric.

instances[]

ToolNameMatchInstance

Required. Repeated tool name match instances.

ToolNameMatchInstance

Spec for tool name match instance.

Fields

prediction

string

Required. Output of the evaluated model.

reference

string

Required. Ground truth used to compare against the prediction.

ToolNameMatchMetricValue

Tool name match metric value for an instance.

Fields

score

float

Output only. Tool name match score.

ToolNameMatchResults

Results for tool name match metric.

Fields

tool_name_match_metric_values[]

ToolNameMatchMetricValue

Output only. Tool name match metric values.

ToolNameMatchSpec

This type has no fields.

Spec for tool name match metric.

ToolParameterKVMatchInput

Input for tool parameter key value match metric.

Fields

metric_spec

ToolParameterKVMatchSpec

Required. Spec for tool parameter key value match metric.

instances[]

ToolParameterKVMatchInstance

Required. Repeated tool parameter key value match instances.

ToolParameterKVMatchInstance

Spec for tool parameter key value match instance.

Fields

prediction

string

Required. Output of the evaluated model.

reference

string

Required. Ground truth used to compare against the prediction.

ToolParameterKVMatchMetricValue

Tool parameter key value match metric value for an instance.

Fields

score

float

Output only. Tool parameter key value match score.

ToolParameterKVMatchResults

Results for tool parameter key value match metric.

Fields

tool_parameter_kv_match_metric_values[]

ToolParameterKVMatchMetricValue

Output only. Tool parameter key value match metric values.

ToolParameterKVMatchSpec

Spec for tool parameter key value match metric.

Fields

use_strict_string_match

bool

Optional. Whether to use STRICT string match on parameter values.

ToolParameterKeyMatchInput

Input for tool parameter key match metric.

Fields

metric_spec

ToolParameterKeyMatchSpec

Required. Spec for tool parameter key match metric.

instances[]

ToolParameterKeyMatchInstance

Required. Repeated tool parameter key match instances.

ToolParameterKeyMatchInstance

Spec for tool parameter key match instance.

Fields

prediction

string

Required. Output of the evaluated model.

reference

string

Required. Ground truth used to compare against the prediction.

ToolParameterKeyMatchMetricValue

Tool parameter key match metric value for an instance.

Fields

score

float

Output only. Tool parameter key match score.

ToolParameterKeyMatchResults

Results for tool parameter key match metric.

Fields

tool_parameter_key_match_metric_values[]

ToolParameterKeyMatchMetricValue

Output only. Tool parameter key match metric values.

ToolParameterKeyMatchSpec

This type has no fields.

Spec for tool parameter key match metric.

Trajectory

Spec for trajectory.

Fields

tool_calls[]

ToolCall

Required. Tool calls in the trajectory.

TrajectoryAnyOrderMatchInput

Instances and metric spec for TrajectoryAnyOrderMatch metric.

Fields

metric_spec

TrajectoryAnyOrderMatchSpec

Required. Spec for TrajectoryAnyOrderMatch metric.

instances[]

TrajectoryAnyOrderMatchInstance

Required. Repeated TrajectoryAnyOrderMatch instance.

TrajectoryAnyOrderMatchInstance

Spec for TrajectoryAnyOrderMatch instance.

Fields

predicted_trajectory

Required. Spec for predicted tool call trajectory.

reference_trajectory

TrajectoryAnyOrderMatchMetricValue

Required. Spec for reference tool call trajectory.

TrajectoryAnyOrderMatchMetricValue

TrajectoryAnyOrderMatch metric value for an instance.

Fields

score

float

Output only. TrajectoryAnyOrderMatch score.

TrajectoryAnyOrderMatchResults

Results for TrajectoryAnyOrderMatch metric.

Fields

trajectory_any_order_match_metric_values[]

Output only. TrajectoryAnyOrderMatch metric values.

TrajectoryAnyOrderMatchSpec

This type has no fields.

Spec for TrajectoryAnyOrderMatch metric - returns 1 if all tool calls in the reference trajectory appear in the predicted trajectory in any order, else 0.

TrajectoryExactMatchInput

Instances and metric spec for TrajectoryExactMatch metric.

Fields

metric_spec

TrajectoryExactMatchSpec

Required. Spec for TrajectoryExactMatch metric.

instances[]

TrajectoryExactMatchInstance

Required. Repeated TrajectoryExactMatch instance.

TrajectoryExactMatchInstance

Spec for TrajectoryExactMatch instance.

Fields

predicted_trajectory

Required. Spec for predicted tool call trajectory.

reference_trajectory

TrajectoryExactMatchMetricValue

Required. Spec for reference tool call trajectory.

TrajectoryExactMatchMetricValue

TrajectoryExactMatch metric value for an instance.

Fields

score

float

Output only. TrajectoryExactMatch score.

TrajectoryExactMatchResults

Results for TrajectoryExactMatch metric.

Fields

trajectory_exact_match_metric_values[]

Output only. TrajectoryExactMatch metric values.

TrajectoryExactMatchSpec

This type has no fields.

Spec for TrajectoryExactMatch metric - returns 1 if tool calls in the reference trajectory exactly match the predicted trajectory, else 0.

TrajectoryInOrderMatchInput

Instances and metric spec for TrajectoryInOrderMatch metric.

Fields

metric_spec

TrajectoryInOrderMatchSpec

Required. Spec for TrajectoryInOrderMatch metric.

instances[]

TrajectoryInOrderMatchInstance

Required. Repeated TrajectoryInOrderMatch instance.

TrajectoryInOrderMatchInstance

Spec for TrajectoryInOrderMatch instance.

Fields

predicted_trajectory

Required. Spec for predicted tool call trajectory.

reference_trajectory

TrajectoryInOrderMatchMetricValue

Required. Spec for reference tool call trajectory.

TrajectoryInOrderMatchMetricValue

TrajectoryInOrderMatch metric value for an instance.

Fields

score

float

Output only. TrajectoryInOrderMatch score.

TrajectoryInOrderMatchResults

Results for TrajectoryInOrderMatch metric.

Fields

trajectory_in_order_match_metric_values[]

Output only. TrajectoryInOrderMatch metric values.

TrajectoryInOrderMatchSpec

This type has no fields.

Spec for TrajectoryInOrderMatch metric - returns 1 if tool calls in the reference trajectory appear in the predicted trajectory in the same order, else 0.

TrajectoryPrecisionInput

Instances and metric spec for TrajectoryPrecision metric.

Fields

metric_spec

TrajectoryPrecisionSpec

Required. Spec for TrajectoryPrecision metric.

instances[]

TrajectoryPrecisionInstance

Required. Repeated TrajectoryPrecision instance.

TrajectoryPrecisionInstance

Spec for TrajectoryPrecision instance.

Fields

predicted_trajectory

Required. Spec for predicted tool call trajectory.

reference_trajectory

TrajectoryPrecisionMetricValue

Required. Spec for reference tool call trajectory.

TrajectoryPrecisionMetricValue

TrajectoryPrecision metric value for an instance.

Fields

score

float

Output only. TrajectoryPrecision score.

TrajectoryPrecisionResults

Results for TrajectoryPrecision metric.

Fields

trajectory_precision_metric_values[]

Output only. TrajectoryPrecision metric values.

TrajectoryPrecisionSpec

This type has no fields.

Spec for TrajectoryPrecision metric - returns a float score based on average precision of individual tool calls.

TrajectoryRecallInput

Instances and metric spec for TrajectoryRecall metric.

Fields

metric_spec

TrajectoryRecallSpec

Required. Spec for TrajectoryRecall metric.

instances[]

TrajectoryRecallInstance

Required. Repeated TrajectoryRecall instance.

TrajectoryRecallInstance

Spec for TrajectoryRecall instance.

Fields

predicted_trajectory

Required. Spec for predicted tool call trajectory.

reference_trajectory

TrajectoryRecallMetricValue

Required. Spec for reference tool call trajectory.

TrajectoryRecallMetricValue

TrajectoryRecall metric value for an instance.

Fields

score

float

Output only. TrajectoryRecall score.

TrajectoryRecallResults

Results for TrajectoryRecall metric.

Fields

trajectory_recall_metric_values[]

Output only. TrajectoryRecall metric values.

TrajectoryRecallSpec

This type has no fields.

Spec for TrajectoryRecall metric - returns a float score based on average recall of individual tool calls.

TrajectorySingleToolUseInput

Instances and metric spec for TrajectorySingleToolUse metric.

Fields

metric_spec

TrajectorySingleToolUseSpec

Required. Spec for TrajectorySingleToolUse metric.

instances[]

TrajectorySingleToolUseInstance

Required. Repeated TrajectorySingleToolUse instance.

TrajectorySingleToolUseInstance

Spec for TrajectorySingleToolUse instance.

Fields

predicted_trajectory

TrajectorySingleToolUseMetricValue

Required. Spec for predicted tool call trajectory.

TrajectorySingleToolUseMetricValue

TrajectorySingleToolUse metric value for an instance.

Fields

score

float

Output only. TrajectorySingleToolUse score.

TrajectorySingleToolUseResults

Results for TrajectorySingleToolUse metric.

Fields

trajectory_single_tool_use_metric_values[]

Output only. TrajectorySingleToolUse metric values.

TrajectorySingleToolUseSpec

Spec for TrajectorySingleToolUse metric - returns 1 if tool is present in the predicted trajectory, else 0.

Fields

tool_name

string

Required. Spec for tool name to be checked for in the predicted trajectory.

TunedModel

The Model Registry Model and Online Prediction Endpoint associated with this TuningJob.

Fields

model

string

Output only. The resource name of the TunedModel. Format:

projects/{project}/locations/{location}/models/{model}@{version_id}

When tuning from a base model, the version_id will be 1.

For continuous tuning, the version id will be incremented by 1 from the last version id in the parent model. E.g.,

projects/{project}/locations/{location}/models/{model}@{last_version_id + 1}

endpoint

string

Output only. A resource name of an Endpoint. Format: projects/{project}/locations/{location}/endpoints/{endpoint}.

checkpoints[]

TunedModelCheckpoint

Output only. The checkpoints associated with this TunedModel. This field is only populated for tuning jobs that enable intermediate checkpoints.

TunedModelCheckpoint

TunedModelCheckpoint for the Tuned Model of a Tuning Job.

Fields
`checkpoint_id`	`string` The ID of the checkpoint.
`epoch`	`int64` The epoch of the checkpoint.
`step`	`int64` The step of the checkpoint.
`endpoint`	`string` The Endpoint resource name that the checkpoint is deployed to. Format: `projects/{project}/locations/{location}/endpoints/{endpoint}`.

TunedModelRef

TunedModel Reference for legacy model migration.

Fields
Union field `tuned_model_ref`. The Tuned Model Reference for the model. `tuned_model_ref` can be only one of the following:
`tuned_model`	`string` Support migration from model registry.
`tuning_job`	`string` Support migration from tuning job list page, from gemini-1.0-pro-002 to 1.5 and above.
`pipeline_job`	`string` Support migration from tuning job list page, from bison model to gemini model.

TuningDataStats

The tuning data statistic values for TuningJob.

Fields

Union field tuning_data_stats.

tuning_data_stats can be only one of the following:

supervised_tuning_data_stats

SupervisedTuningDataStats

The SFT Tuning data stats.

TuningJob

Represents a TuningJob that runs with Google owned models.

Fields

name

string

Output only. Identifier. Resource name of a TuningJob. Format: projects/{project}/locations/{location}/tuningJobs/{tuning_job}

tuned_model_display_name

string

Optional. The display name of the TunedModel. The name can be up to 128 characters long and can consist of any UTF-8 characters.

description

string

Optional. The description of the TuningJob.

state

JobState

Output only. The detailed state of the job.

create_time

Output only. Time when the TuningJob was created.

start_time

Output only. Time when the TuningJob for the first time entered the JOB_STATE_RUNNING state.

end_time

Output only. Time when the TuningJob entered any of the following JobStates: JOB_STATE_SUCCEEDED, JOB_STATE_FAILED, JOB_STATE_CANCELLED, JOB_STATE_EXPIRED.

update_time

Output only. Time when the TuningJob was most recently updated.

error

Status

Output only. Only populated when job's state is JOB_STATE_FAILED or JOB_STATE_CANCELLED.

labels

map<string, string>

Optional. The labels with user-defined metadata to organize TuningJob and generated resources such as Model and Endpoint.

Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed.

See https://goo.gl/xmQnxf for more information and examples of labels.

experiment

string

Output only. The Experiment associated with this TuningJob.

tuned_model

TunedModel

Output only. The tuned model resources associated with this TuningJob.

tuning_data_stats

TuningDataStats

Output only. The tuning data statistics associated with this TuningJob.

encryption_spec

EncryptionSpec

Customer-managed encryption key options for a TuningJob. If this is set, then all resources created by the TuningJob will be encrypted with the provided encryption key.

service_account

string

The service account that the tuningJob workload runs as. If not specified, the Vertex AI Secure Fine-Tuned Service Agent in the project will be used. See https://cloud.google.com/iam/docs/service-agents#vertex-ai-secure-fine-tuning-service-agent

Users starting the pipeline must have the iam.serviceAccounts.actAs permission on this service account.

Union field source_model.

source_model can be only one of the following:

base_model

string

The base model that is being tuned. See Supported models.

Union field tuning_spec.

tuning_spec can be only one of the following:

supervised_tuning_spec

SupervisedTuningSpec

Tuning Spec for Supervised Fine Tuning.

Type

Type contains the list of OpenAPI data types as defined by https://swagger.io/docs/specification/data-models/data-types/

Enums
`TYPE_UNSPECIFIED`	Not specified, should not be used.
`STRING`	OpenAPI string type
`NUMBER`	OpenAPI number type
`INTEGER`	OpenAPI integer type
`BOOLEAN`	OpenAPI boolean type
`ARRAY`	OpenAPI array type
`OBJECT`	OpenAPI object type
`NULL`	Null type

UpdateCacheConfigOperationMetadata

Runtime operation information for GenAiCacheConfigService.UpdateCacheConfig.

Fields

generic_metadata

The operation generic information.

UpdateCacheConfigRequest

Request message for updating a cache config.

Fields

cache_config

CacheConfig

Required. The cache config to be updated. cache_config.name is used to identify the cache config. Format: - projects/{project}/cacheConfig.

UpdateCachedContentRequest

Request message for GenAiCacheService.UpdateCachedContent. Only expire_time or ttl can be updated.

Fields

cached_content

CachedContent

Required. The cached content to update

update_mask

FieldMask

Required. The list of fields to update.

UpdateRagCorpusOperationMetadata

Runtime operation information for VertexRagDataService.UpdateRagCorpus.

Fields

generic_metadata

The operation generic information.

UpdateRagCorpusRequest

Request message for VertexRagDataService.UpdateRagCorpus.

Fields

rag_corpus

RagCorpus

Required. The RagCorpus which replaces the resource on the server.

UpdateRagEngineConfigOperationMetadata

Runtime operation information for VertexRagDataService.UpdateRagEngineConfig.

Fields

generic_metadata

The operation generic information.

UpdateRagEngineConfigRequest

Request message for VertexRagDataService.UpdateRagEngineConfig.

Fields

rag_engine_config

RagEngineConfig

Required. The updated RagEngineConfig.

NOTE: Downgrading your RagManagedDb's ComputeTier could temporarily increase request latencies until the operation is fully complete.

UpdateReasoningEngineOperationMetadata

Details of ReasoningEngineService.UpdateReasoningEngine operation.

Fields

generic_metadata