A singleton resource of
DataStore. It's
empty when
DataStore is
created, which defaults to digital parser. The first call to
DataStoreService.UpdateDocumentProcessingConfig
method will initialize the config.
Attributes
Name
Description
name
str
The full resource name of the Document Processing Config.
Format:
projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig.
MutableMapping[str, google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig.ParsingConfig]
Map from file type to override the default parsing
configuration based on the file type. Supported keys:
- pdf: Override parsing config for PDF files, either
digital parsing, ocr parsing or layout parsing is
supported.
- html: Override parsing config for HTML files, only
digital parsing and layout parsing are supported.
- docx: Override parsing config for DOCX files, only
digital parsing and layout parsing are supported.
- pptx: Override parsing config for PPTX files, only
digital parsing and layout parsing are supported.
- xlsx: Override parsing config for XLSX files, only
digital parsing and layout parsing are supported.
Related configurations applied to a specific type of document
parser.
This message has oneof_ fields (mutually exclusive fields).
For each oneof, at most one member field can be set at the same time.
Setting any member of the oneof automatically clears all other
members.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[],[],null,["# Class DocumentProcessingConfig (0.13.11)\n\nVersion latestkeyboard_arrow_down\n\n- [0.13.11 (latest)](/python/docs/reference/discoveryengine/latest/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.13.10](/python/docs/reference/discoveryengine/0.13.10/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.12.3](/python/docs/reference/discoveryengine/0.12.3/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.11.14](/python/docs/reference/discoveryengine/0.11.14/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.10.0](/python/docs/reference/discoveryengine/0.10.0/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.9.1](/python/docs/reference/discoveryengine/0.9.1/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.8.1](/python/docs/reference/discoveryengine/0.8.1/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.7.0](/python/docs/reference/discoveryengine/0.7.0/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.6.0](/python/docs/reference/discoveryengine/0.6.0/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.5.0](/python/docs/reference/discoveryengine/0.5.0/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.4.1](/python/docs/reference/discoveryengine/0.4.1/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.3.1](/python/docs/reference/discoveryengine/0.3.1/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.2.1](/python/docs/reference/discoveryengine/0.2.1/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig)\n- [0.1.1](/python/docs/reference/discoveryengine/0.1.1/google.cloud.discoveryengine_v1alpha.types.DocumentProcessingConfig) \n\n DocumentProcessingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)\n\nA singleton resource of\nDataStore. It's\nempty when\nDataStore is\ncreated, which defaults to digital parser. The first call to\nDataStoreService.UpdateDocumentProcessingConfig\nmethod will initialize the config.\n\nClasses\n-------\n\n### ChunkingConfig\n\n ChunkingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)\n\nConfiguration for chunking config.\n\n.. _oneof: \u003chttps://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields\u003e\n\n### ParsingConfig\n\n ParsingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)\n\nRelated configurations applied to a specific type of document\nparser.\n\nThis message has `oneof`_ fields (mutually exclusive fields).\nFor each oneof, at most one member field can be set at the same time.\nSetting any member of the oneof automatically clears all other\nmembers.\n\n.. _oneof: \u003chttps://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields\u003e\n\n### ParsingConfigOverridesEntry\n\n ParsingConfigOverridesEntry(mapping=None, *, ignore_unknown_fields=False, **kwargs)\n\nThe abstract base class for a message."]]