Fine tune LLMs using GPUs with Cloud Run jobs

You can fine tune a Gemma 2B model with a text-to-sql dataset using a Cloud Run job, then serve the fine tuned model on a Cloud Run service using vLLM.

See a step-by-step instructional codelab at How to fine tune a model using Cloud Run jobs.