Overview
The Gemini Cloud Assist Investigations preview feature uses Gemini advanced capabilities to assist in creating and running Serverless for Apache Spark batch workloads. This feature analyzes failed and slow-running workloads to identify root causes and recommend fixes. It creates persistent analysis that you can review, save, and share with Google Cloud support to facilitate collaboration and accelerate issue resolution.
Features
Use this feature to create investigations from the Google Cloud console:
- Add a natural language context description to an issue before creating an investigation.
- Analyze failed and slow batch workloads.
- Get insights into issue root causes with recommended fixes.
- Create Google Cloud support cases with the full investigation context attached.
Before you begin
Enable the Gemini Cloud Assist API.
Roles required to enable APIs
To enable APIs, you need the Service Usage Admin IAM
role (roles/serviceusage.serviceUsageAdmin
), which
contains the serviceusage.services.enable
permission. Learn how to grant
roles.
Start an investigation
To start an investigation, do one of the following:
Option 1: In the Google Cloud console, go to the Batches List Page. For any batch with a
Failed
status, an INVESTIGATE button appears in the Insights by Gemini column. Click the button to start an investigation.Option 2: Open the Batch Details Page of the batch workload to investigate. For both
Succeeded
andFailed
batch workloads, in the Health overview section of the Summary tab, an INVESTIGATE button appears in the Insights by Gemini panel. Click the button to start an investigation.The investigation button text indicates the status of the investigation:
- INVESTIGATE: No investigation has been run for this batch. Click the button to start an investigation.
- VIEW INVESTIGATION: An investigation has been completed. Click the button ` to view the results.
- INVESTIGATING: An investigation is in progress.
View investigation results
Once an investigation is complete, the Investigation details page opens. This page contains the full Gemini analysis, which is organized into the following sections:
- Issue: A collapsed section containing auto-populated details of the batch workload being investigated
- Relevant Observations: A collapsed section that lists key data points and anomalies that Gemini found during its analysis of logs and metrics
- Hypotheses: This is the primary section, which is expanded by default.
It presents a list of potential root causes for the observed issue. Each hypothesis
includes:
- Overview: A description of the possible cause, such as "High Shuffle Write Time and Potential Task Skew."
- Recommended Fixes: A list of actionable steps to address the potential issue.
Take action
After reviewing the hypotheses and recommendations, you can take the following actions:
Apply one or more of the suggested fixes to the job configuration or code, and then rerun the job.
Provide feedback on the helpfulness of the investigation by clicking the thumbs-up or thumbs-down icons at the top of the panel.
Review and escalate investigations
The results of a previously run investigation can be reviewed by clicking the investigation name on the Cloud Assist Investigations page to open the Investigation details page.
If you need further assistance, you can open a Google Cloud support case. This process provides the support engineer with the complete context of the previously performed investigation, including the observations and hypotheses generated by Gemini. This context sharing significantly reduces the need for clarifying communications with the support team, and leads to faster case resolution.
To create a support case from an investigation, in the Investigation details page, click Request support.
Preview status and pricing
There is no charge for Gemini Cloud Assist investigations during public preview. Charges will apply to the feature when it becomes generally available (GA).
For more information about pricing after general availability, see Gemini Cloud Assist Pricing.