Full name: projects.locations.processors.dataset.importDocuments
Import documents into a dataset.
HTTP request
POST https://{endpoint}/v1beta3/{dataset}:importDocuments
Where {endpoint} is one of the supported service endpoints.
Path parameters
| Parameters | |
|---|---|
| dataset | 
 Required. The dataset resource name. Format: projects/{project}/locations/{location}/processors/{processor}/dataset It takes the form  | 
Request body
The request body contains data with the following structure:
| JSON representation | 
|---|
| {
  "batchDocumentsImportConfigs": [
    {
      object ( | 
| Fields | |
|---|---|
| batchDocumentsImportConfigs[] | 
 Required. The Cloud Storage uri containing raw documents that must be imported. | 
Response body
If successful, the response body contains an instance of Operation.
Authorization scopes
Requires the following OAuth scope:
- https://www.googleapis.com/auth/cloud-platform
For more information, see the Authentication Overview.
IAM Permissions
Requires the following IAM permission on the dataset resource:
- documentai.datasets.createDocuments
For more information, see the IAM documentation.
BatchDocumentsImportConfig
Config for importing documents. Each batch can have its own dataset split type.
| JSON representation | 
|---|
| { "batchInputConfig": { object ( | 
| Fields | |
|---|---|
| batchInputConfig | 
 The common config to specify a set of documents used as input. | 
| Union field  
 | |
| datasetSplit | 
 Target dataset split where the documents must be stored. | 
| autoSplitConfig | 
 If set, documents will be automatically split into training and test split category with the specified ratio. | 
AutoSplitConfig
The config for auto-split.
| JSON representation | 
|---|
| { "trainingSplitRatio": number } | 
| Fields | |
|---|---|
| trainingSplitRatio | 
 Ratio of training dataset split. |