Stay organized with collections
Save and categorize content based on your preferences.
The dataset operators prepare the data for training an AutoML model in
Vertex AI. For more details about specific data type dataset information,
see Train and use your own models. For information about
AutoML training in Vertex AI, see the
AutoML training documentation. For information
about Google Cloud Pipeline Components related to AutoML models and workflows, see
Vertex AI AutoML components.
The Google Cloud SDK includes the following operators related to
AutoML dataset resource management:
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-29 UTC."],[],[],null,["# Dataset components\n\nThe dataset operators prepare the data for training an AutoML model in\nVertex AI. For more details about specific data type dataset information,\nsee [Train and use your own models](/vertex-ai/docs/training-overview). For information about\nAutoML training in Vertex AI, see the\n[AutoML training documentation](/vertex-ai/docs/training-overview#automl). For information\nabout Google Cloud Pipeline Components related to AutoML models and workflows, see\n[Vertex AI AutoML components](/vertex-ai/docs/pipelines/vertex-automl-component).\n\nThe Google Cloud SDK includes the following operators related to\nAutoML `dataset` resource management:\n\n\n- [`GetVertexDatasetOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.GetVertexDatasetOp)\n- [`ImageDatasetCreateOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.ImageDatasetCreateOp)\n- [`ImageDatasetExportDataOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.ImageDatasetExportDataOp)\n- [`ImageDatasetImportDataOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.ImageDatasetImportDataOp)\n- [`TabularDatasetCreateOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.TabularDatasetCreateOp)\n- [`TabularDatasetExportDataOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.TabularDatasetExportDataOp)\n- [`TextDatasetCreateOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.TextDatasetCreateOp)\n- [`TextDatasetExportDataOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.TextDatasetExportDataOp)\n- [`TextDatasetImportDataOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.TextDatasetImportDataOp)\n- [`TimeSeriesDatasetCreateOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.TimeSeriesDatasetCreateOp)\n- [`TimeSeriesDatasetExportDataOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.TimeSeriesDatasetExportDataOp)\n- [`VideoDatasetCreateOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.VideoDatasetCreateOp)\n- [`VideoDatasetExportDataOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.VideoDatasetExportDataOp)\n- [`VideoDatasetImportDataOp`](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html#v1.dataset.VideoDatasetImportDataOp)\n\n\u003cbr /\u003e\n\nAPI reference\n-------------\n\n- For dataset component reference, see the\n [Google Cloud SDK reference for Dataset components](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/api/v1/dataset.html).\n\n- For Vertex AI API reference, see the following API reference pages:\n\n - [`Dataset` resource](/vertex-ai/docs/reference/rest/v1/projects.locations.datasets)\n\n - [`TrainingPipeline` resource](/vertex-ai/docs/reference/rest/v1/projects.locations.trainingPipelines)\n\nTutorials\n---------\n\n- [Learn how to use the Google Cloud pipeline components to train an image classification model using Vertex AI AutoML.](https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/official/pipelines/google_cloud_pipeline_components_automl_images.ipynb)\n- [Learn how to use the Google Cloud pipeline components to train a classification model using tabular data and Vertex AI AutoML.](https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/official/pipelines/automl_tabular_classification_beans.ipynb)\n- [Learn how to use the Google Cloud pipeline components to train a linear regression model using tabular data and Vertex AI AutoML.](https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/official/pipelines/google_cloud_pipeline_components_automl_tabular.ipynb)\n- [Learn how to use the Google Cloud pipeline components to train a text classification model using Vertex AI AutoML.](https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/official/pipelines/google_cloud_pipeline_components_automl_text.ipynb)\n- [Learn how to use the Google Cloud pipeline components to upload and deploy a model.](https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/official/pipelines/google_cloud_pipeline_components_model_train_upload_deploy.ipynb)\n\nVersion history and release notes\n---------------------------------\n\nTo learn more about the version history and changes to the Google Cloud Pipeline Components SDK, see the [Google Cloud Pipeline Components SDK Release Notes](https://google-cloud-pipeline-components.readthedocs.io/en/google-cloud-pipeline-components-2.19.0/release.html).\n\n### Technical support contacts\n\nIf you have any questions, reach out to\n[kubeflow-pipelines-components@google.com](mailto: kubeflow-pipelines-components@google.com)."]]