Qwen models on Vertex AI offer fully managed and serverless models as APIs. To use a Qwen model on Vertex AI, send a request directly to the Vertex AI API endpoint. Because Qwen models use a managed API, there's no need to provision or manage infrastructure.
You can stream your responses to reduce the end-user latency perception. A streamed response uses server-sent events (SSE) to incrementally stream the response.
Available Qwen models
The following models are available from Qwen to use in Vertex AI. To access a Qwen model, go to its Model Garden model card.
Qwen3 Coder (Qwen3-Coder-480B-A35B-Instruct)
Qwen3 Coder (Qwen3-Coder-480B-A35B-Instruct
) is a large-scale, open-weight model
developed for advanced software development tasks. The model's key feature is
its large context window, allowing it to process and understand large codebases
comprehensively.
Go to the Qwen3-Coder-480B-A35B-Instruct model card
Qwen3 235B (Qwen3-235B-A22B-Instruct-2507)
Qwen3 235B (Qwen3-235B-A22B-Instruct-2507
) is a large 235B parameter model. The model
is distinguished by its "hybrid thinking" capability, which allows users to
dynamically switch between a methodical, step-by-step "thinking" mode for
complex tasks like mathematical reasoning and coding, and a rapid "non-thinking"
mode for general-purpose conversation. Its large context window makes it
suitable for use cases requiring deep reasoning and long-form comprehension.
Go to the Qwen3-235B-A22B-Instruct-2507 model card
Before you begin
To use Qwen models with Vertex AI, you must perform the
following steps. The Vertex AI API
(aiplatform.googleapis.com
) must be enabled to use
Vertex AI. If you already have an existing project with the
Vertex AI API enabled, you can use that project instead of creating a
new project.
- Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.
-
In the Google Cloud console, on the project selector page, select or create a Google Cloud project.
-
Verify that billing is enabled for your Google Cloud project.
-
Enable the Vertex AI API.
-
In the Google Cloud console, on the project selector page, select or create a Google Cloud project.
-
Verify that billing is enabled for your Google Cloud project.
-
Enable the Vertex AI API.
- Go to one of the following Model Garden model cards, then click Enable.