Automated configuration values for high-performance machine types

This document describes how to automatically set default Cloud Storage FUSE values used for high-performance Compute Engine machine types, which are designed to optimize performance for demanding, high-throughput workloads. Values that are manually set at the time of mount will override these defaults.

Machine types

Configuration values are automated for the following high-performance Compute Engine machine types:

Series type Machine type
A2 machine series
a2-megagpu-16g
a2-ultragpu-8g
A3 machine series
a3-edgegpu-8g
a3-highgpu-8g
a3-megagpu-8g
a3-ultragpu-8g
A4 machine series
4-highgpu-8g-lowmem
TPU v5e
ct5l-hightpu-8t
ct5lp-hightpu-8t
TPU v5p
ct5p-hightpu-4t
ct5p-hightpu-4t-tpu
TPU v6e (Trillium)
ct6e-standard-4t
ct6e-standard-4t-tpu
ct6e-standard-8t
ct6e-standard-8t-tpu

Automated configuration values

When a supported machine type is detected, Cloud Storage FUSE automatically applies the following configuration values:

Cloud Storage FUSE configuration file field Cloud Storage FUSE CLI option Automated configuration value
metadata-cache.negative-ttl-secs --metadata-cache-negative-ttl-secs 0
metadata-cache.ttl-secs1 --metadata-cache-ttl-secs1

-1

metadata-cache.stat-cache-max-size-mb --stat-cache-max-size-mb 1024
metadata-cache.type-cache-max-size-mb --type-cache-max-size-mb 128
implicit-dirs --implicit-dirs true
file-system.rename-dir-limit --rename-dir-limit 200000

1Setting this configuration to -1 significantly boosts performance by always serving files from the cache. Be aware that this configuration bypasses consistency checks, which can lead to serving outdated data. For details on managing data consistency, refer to Overview of caching in Cloud Storage FUSE.

Further performance tuning

When you use a high-performance Google Cloud machine type, the configuration values detailed on this page are automatically applied. However, you can further fine-tune your machine for optimal performance using the Performance tuning best practices guide.

If you're running training, serving, or checkpointing and JIT cache workloads on Google Kubernetes Engine clusters that use Cloud GPUs or Cloud TPU to access large datasets in Cloud Storage, you can streamline your setup by utilizing pre-configured YAML files to mount your Cloud Storage buckets directly into your pods more efficiently. For more information and instructions on how to use pre-configured GKE YAML files, see Use pre-configured GKE YAML files to optimize Cloud Storage FUSE performance.

What's next