Class DocumentProcessingConfig (0.11.14)

DocumentProcessingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)

A singleton resource of DataStore. It's empty when DataStore is created, which defaults to digital parser. The first call to [DataStoreService.UpdateDocumentProcessingConfig][] method will initialize the config.

Attributes

Name Description
name str
The full resource name of the Document Processing Config. Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig.
chunking_config google.cloud.discoveryengine_v1.types.DocumentProcessingConfig.ChunkingConfig
Whether chunking mode is enabled.
default_parsing_config google.cloud.discoveryengine_v1.types.DocumentProcessingConfig.ParsingConfig
Configurations for default Document parser. If not specified, we will configure it as default DigitalParsingConfig, and the default parsing config will be applied to all file types for Document parsing.
parsing_config_overrides MutableMapping[str, google.cloud.discoveryengine_v1.types.DocumentProcessingConfig.ParsingConfig]
Map from file type to override the default parsing configuration based on the file type. Supported keys: - pdf: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported. - html: Override parsing config for HTML files, only digital parsing and layout parsing are supported. - docx: Override parsing config for DOCX files, only digital parsing and layout parsing are supported. - pptx: Override parsing config for PPTX files, only digital parsing and layout parsing are supported.

Classes

ChunkingConfig

ChunkingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)

ParsingConfig

ParsingConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Related configurations applied to a specific type of document parser.

This message has oneof_ fields (mutually exclusive fields). For each oneof, at most one member field can be set at the same time. Setting any member of the oneof automatically clears all other members.

.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields

ParsingConfigOverridesEntry

ParsingConfigOverridesEntry(mapping=None, *, ignore_unknown_fields=False, **kwargs)

The abstract base class for a message.

Parameters
Name Description
kwargs dict

Keys and values corresponding to the fields of the message.

mapping Union[dict, .Message]

A dictionary or message to be used to determine the values for this message.

ignore_unknown_fields Optional(bool)

If True, do not raise errors for unknown fields. Only applied if mapping is a mapping type or there are keyword parameters.