Monitor instances and operations

Cloud Monitoring automatically collects and stores information about your Managed Lustre instance.

This document provides a detailed overview of the metrics available for monitoring your Managed Lustre instances on Google Cloud. These metrics help you understand the performance, capacity, and health of your Managed Lustre file systems, so you can identify bottlenecks, troubleshoot issues, and optimize resource utilization.

You can use these metrics in Cloud Monitoring to create custom dashboards, set up alerts, and gain deeper insights into your Managed Lustre instance's behavior.

Cloud Monitoring is automatically enabled for Managed Lustre. There's no charge for the collection of data or to view metrics in the Google Cloud console. API calls may incur charges; see Cloud Monitoring pricing for pricing details.

Required IAM role

You must have the Monitoring Viewer (roles/monitoring.viewer) role, or equivalent permissions, to view metrics in Cloud Monitoring.

Learn how to grant an IAM role.

View metrics

Cloud Monitoring metrics are available from two locations in the Google Cloud console:

  • The Managed Lustre instance details page displays available metrics. In addition to the metrics listed below, it computes the bandwidth of bytes copied and the rate of objects copied.

  • The Cloud Monitoring page provides multiple chart options and customizations.

View metrics on the instance details page

To view a specific instance's metrics:

  1. Go to the Instances page in the Google Cloud console.

    Go to Instances

  2. Click the instance for which to view metrics. The Instance details page appears.

  3. Click the Monitoring tab. The default dashboard is displayed.

View metrics in Cloud Monitoring

To view Managed Lustre metrics in Cloud Monitoring, do the following:

  1. Go to the Metrics Explorer page in the Google Cloud console.

    Go to Monitoring: Metrics Explorer

  2. Follow the instructions in Create charts with Metrics Explorer to select and display your metrics.

Metric details

The following metrics are available for Managed Lustre instances. Each metric is identified by its type (e.g., lustre.googleapis.com/instance/available_bytes), has a display name, a description, and specific labels that provide additional context.

Data is sampled every 60 seconds. After sampling, data may not be visible for up to 180 seconds.

Storage Capacity Metrics

Metrics related to the storage space available and provisioned on your Lustre file system.

For metric labels, the value of target uses the format <fsname>-<TYPE><HEXA> where <HEXA> is the zero-based index of the target in hexadecimal. For example, if your filesystem name is filesys, the 43rd OST is filesys-OST002a, and the 4th MDT is filesys-MDT0003.

Metric Description Details
available_bytes The number of bytes of storage space for a given Object Storage Target (OST) or Metadata Target (MDT) that is available to non-root users. Display Name: Available bytes
Metric Kind: GAUGE
Value Type: INT64
Unit: By
Labels:
component: The target type: ost, mdt, or mgt.
target: The name of the target.
capacity_bytes The number of bytes provisioned for the given target. The total cluster usable data or metadata space for an instance can be obtained by adding the capacity of all targets for a given type of target. Display Name: Capacity bytes
Metric Kind: GAUGE
Value Type: INT64
Unit: Bytes
Labels:
component: The target type: ost, mdt, or mgt.
target: The name of the target.
free_bytes The number of bytes of storage space for a given OST or MDT that is available to root users. Display Name: Free bytes
Metric Kind: GAUGE
Value Type: INT64
Unit: Bytes
Labels:
component: The target type: ost, mdt, or mgt.
target: The name of the target.

Inode (object) Metrics

Metrics related to the number of inodes (objects) available and the maximum capacity.

Metric Description Details
inodes_free The number of inodes (objects) available on the given target. Display Name: Free inodes
Metric Kind: GAUGE
Value Type: INT64
Unit: {inode}
Labels:
component: The target type.
target: The name of the target.
inodes_maximum The maximum number of inodes (objects) the target can hold. Display Name: Maximum inodes
Metric Kind: GAUGE
Value Type: INT64
Unit: {inode}
Labels:
component: The target type.
target: The name of the target.

I/O Performance Metrics

Metrics providing insight into data transfer rates and operation latency.

Operation latency

Metric Description Details
io_time_milliseconds_total The number of read or write operations that have completed within a given time range. Display Name: Operation latency
Metric Kind: CUMULATIVE
Value Type: INT64
Unit: milliseconds
Labels:
component: The target type.
operation: The operation type.
size: The bucketed latency range. For example, 512 includes the count of operations that took between 512 and 1024 milliseconds.
target: The name of the target.
read_bytes_total The number of data bytes read from the given OST. Display Name: Data read bytes
Metric Kind: CUMULATIVE
Value Type: INT64
Unit: Bytes
Labels:
component: The target type: always ost.
operation: The operation type: read. target: The name of the target.
read_samples_total The number of read operations performed on the given OST. Display Name: Data read operations
Metric Kind: CUMULATIVE
Value Type: INT64
Unit: {operation}
Labels:
component: The target type: always ost.
operation: The operation type: read. target: The name of the Managed Lustre target.
write_bytes_total The number of data bytes written to the given OST. Display Name: Data write bytes
Metric Kind: CUMULATIVE
Value Type: INT64
Unit: Bytes
Labels:
component: The target type: always ost.
operation: The operation type: write.
target: The name of the target.
write_samples_total The number of write operations performed on the given OST. Display Name: Data write operations
Metric Kind: CUMULATIVE
Value Type: INT64
Unit: {operation}
Labels:
component: The target type: always ost.
operation: The operation type: write.
target: The name of the target.

Client Connection Metrics

Metrics specifically for understanding client connectivity.

Connected clients

Metric Description Details
connected_clients The number of clients currently connected to the given MDT. Display Name: Connected clients
Metric Kind: GAUGE
Value Type: INT64
Unit: {client}
Labels:
component: The target type. This is always mdt.
target: The name of the MDT.