- NAME
-
- gcloud alpha container ai recommender accelerators list - list compatible accelerator profiles
- SYNOPSIS
-
-
gcloud alpha container ai recommender accelerators list
--model
=MODEL
[--format
=FORMAT
; default="table"] [--max-ntpot-milliseconds
=MAX_NTPOT_MILLISECONDS
] [--model-server
=MODEL_SERVER
] [--model-server-version
=MODEL_SERVER_VERSION
] [GCLOUD_WIDE_FLAG …
]
-
- DESCRIPTION
-
(ALPHA)
This command lists all supported accelerators with their performance details. By default, the supported accelerators are displayed in a table format with select information for each accelerator. To see all details, use --format=yaml.To get supported model, model servers, and model server versions, run
gcloud alpha container ai recommender models list
,gcloud alpha container ai recommender model-servers list
, andgcloud alpha container ai recommender model-server-versions list
. Alternatively, rungcloud alpha container ai recommender model-and-server-combinations list
to get all supported model and server combinations. - REQUIRED FLAGS
-
--model
=MODEL
- The model.
- OPTIONAL FLAGS
-
--format
=FORMAT
; default="table"-
The output format. Default is table, which displays select information in a
table format. Use --format=yaml to display all details.
FORMAT
must be one of:table
,yaml
. --max-ntpot-milliseconds
=MAX_NTPOT_MILLISECONDS
- The maximum normalized time per output token (NTPOT) in milliseconds. NTPOT is measured as the request_latency / output_tokens. If this field is set, the command will only return accelerators that can meet the target ntpot milliseconds and display their throughput performance at the target latency. Otherwise, the command will return all accelerators and display their highest throughput performance.
--model-server
=MODEL_SERVER
- The model server. If not specified, this defaults to any model server.
--model-server-version
=MODEL_SERVER_VERSION
- The model server version. If not specified, this defaults to the latest version.
- GCLOUD WIDE FLAGS
-
These flags are available to all commands:
--access-token-file
,--account
,--billing-project
,--configuration
,--flags-file
,--flatten
,--format
,--help
,--impersonate-service-account
,--log-http
,--project
,--quiet
,--trace-token
,--user-output-enabled
,--verbosity
.Run
$ gcloud help
for details. - NOTES
- This command is currently in alpha and might change without notice. If this command fails with API permission errors despite specifying the correct project, you might be trying to access an API with an invitation-only early access allowlist.
gcloud alpha container ai recommender accelerators list
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-04-01 UTC.