gcloud alpha container ai recommender accelerators list
Stay organized with collections Save and categorize content based on your preferences.

NAME: gcloud alpha container ai recommender accelerators list - list compatible accelerator profiles
SYNOPSIS: gcloud alpha container ai recommender accelerators list --model=MODEL [--format=FORMAT; default="table"] [--max-ntpot-milliseconds=MAX_NTPOT_MILLISECONDS] [--model-server=MODEL_SERVER] [--model-server-version=MODEL_SERVER_VERSION] [--filter=EXPRESSION] [--limit=LIMIT] [--page-size=PAGE_SIZE] [--sort-by=[FIELD,…]] [--uri] [GCLOUD_WIDE_FLAG …]
DESCRIPTION: (ALPHA) This command lists all supported accelerators with their performance details. By default, the supported accelerators are displayed in a table format with select information for each accelerator. To see all details, use --format=yaml.
To get supported model, model servers, and model server versions, run gcloud alpha container ai recommender models list, gcloud alpha container ai recommender model-servers list, and gcloud alpha container ai recommender model-server-versions list. Alternatively, run gcloud alpha container ai recommender model-and-server-combinations list to get all supported model and server combinations.
REQUIRED FLAGS: --model=MODEL

The model.
FLAGS: --format=FORMAT; default="table"

The output format. Default is table, which displays select information in a table format. Use --format=yaml to display all details. FORMAT must be one of: table, yaml.

--max-ntpot-milliseconds=MAX_NTPOT_MILLISECONDS

The maximum normalized time per output token (NTPOT) in milliseconds. NTPOT is measured as the request_latency / output_tokens. If this field is set, the command will only return accelerators that can meet the target ntpot milliseconds and display their throughput performance at the target latency. Otherwise, the command will return all accelerators and display their highest throughput performance.

--model-server=MODEL_SERVER

The model server. If not specified, this defaults to any model server.

--model-server-version=MODEL_SERVER_VERSION

The model server version. If not specified, this defaults to the latest version.
LIST COMMAND FLAGS: --filter=EXPRESSION

Apply a Boolean filter EXPRESSION to each resource item to be listed. If the expression evaluates True, then that item is listed. For more details and examples of filter expressions, run $ gcloud topic filters. This flag interacts with other flags that are applied in this order: --flatten, --sort-by, --filter, --limit.

--limit=LIMIT

Maximum number of resources to list. The default is unlimited. This flag interacts with other flags that are applied in this order: --flatten, --sort-by, --filter, --limit.

--page-size=PAGE_SIZE

Some services group resource list output into pages. This flag specifies the maximum number of resources per page. The default is determined by the service if it supports paging, otherwise it is unlimited (no paging). Paging may be applied before or after --filter and --limit depending on the service.

--sort-by=[FIELD,…]

Comma-separated list of resource field key names to sort by. The default order is ascending. Prefix a field with ``~´´ for descending order on that field. This flag interacts with other flags that are applied in this order: --flatten, --sort-by, --filter, --limit.

--uri

Print a list of resource URIs instead of the default output, and change the command output to a list of URIs. If this flag is used with --format, the formatting is applied on this URI list. To display URIs alongside other keys instead, use the uri() transform.
GCLOUD WIDE FLAGS: These flags are available to all commands: --access-token-file, --account, --billing-project, --configuration, --flags-file, --flatten, --format, --help, --impersonate-service-account, --log-http, --project, --quiet, --trace-token, --user-output-enabled, --verbosity.
Run $ gcloud help for details.
NOTES: This command is currently in alpha and might change without notice. If this command fails with API permission errors despite specifying the correct project, you might be trying to access an API with an invitation-only early access allowlist.

gcloud alpha container ai recommender accelerators list Stay organized with collections Save and categorize content based on your preferences.

gcloud alpha container ai recommender accelerators list
Stay organized with collections Save and categorize content based on your preferences.