Discovery Engine V1BETA API - Class Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig (v0.17.0)

Reference documentation and code samples for the Discovery Engine V1BETA API class Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig.

A singleton resource of DataStore. If it's empty when DataStore is created and DataStore is set to DataStore.ContentConfig.CONTENT_REQUIRED, the default parser will default to digital parser.

Inherits

  • Object

Extended By

  • Google::Protobuf::MessageExts::ClassMethods

Includes

  • Google::Protobuf::MessageExts

Methods

#chunking_config

def chunking_config() -> ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ChunkingConfig

#chunking_config=

def chunking_config=(value) -> ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ChunkingConfig
Parameter

#default_parsing_config

def default_parsing_config() -> ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ParsingConfig
Returns

#default_parsing_config=

def default_parsing_config=(value) -> ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ParsingConfig
Parameter
Returns

#name

def name() -> ::String
Returns
  • (::String) — The full resource name of the Document Processing Config. Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig.

#name=

def name=(value) -> ::String
Parameter
  • value (::String) — The full resource name of the Document Processing Config. Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig.
Returns
  • (::String) — The full resource name of the Document Processing Config. Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig.

#parsing_config_overrides

def parsing_config_overrides() -> ::Google::Protobuf::Map{::String => ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ParsingConfig}
Returns
  • (::Google::Protobuf::Map{::String => ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ParsingConfig}) —

    Map from file type to override the default parsing configuration based on the file type. Supported keys:

    • pdf: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported.
    • html: Override parsing config for HTML files, only digital parsing and layout parsing are supported.
    • docx: Override parsing config for DOCX files, only digital parsing and layout parsing are supported.
    • pptx: Override parsing config for PPTX files, only digital parsing and layout parsing are supported.
    • xlsm: Override parsing config for XLSM files, only digital parsing and layout parsing are supported.
    • xlsx: Override parsing config for XLSX files, only digital parsing and layout parsing are supported.

#parsing_config_overrides=

def parsing_config_overrides=(value) -> ::Google::Protobuf::Map{::String => ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ParsingConfig}
Parameter
  • value (::Google::Protobuf::Map{::String => ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ParsingConfig}) —

    Map from file type to override the default parsing configuration based on the file type. Supported keys:

    • pdf: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported.
    • html: Override parsing config for HTML files, only digital parsing and layout parsing are supported.
    • docx: Override parsing config for DOCX files, only digital parsing and layout parsing are supported.
    • pptx: Override parsing config for PPTX files, only digital parsing and layout parsing are supported.
    • xlsm: Override parsing config for XLSM files, only digital parsing and layout parsing are supported.
    • xlsx: Override parsing config for XLSX files, only digital parsing and layout parsing are supported.
Returns
  • (::Google::Protobuf::Map{::String => ::Google::Cloud::DiscoveryEngine::V1beta::DocumentProcessingConfig::ParsingConfig}) —

    Map from file type to override the default parsing configuration based on the file type. Supported keys:

    • pdf: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported.
    • html: Override parsing config for HTML files, only digital parsing and layout parsing are supported.
    • docx: Override parsing config for DOCX files, only digital parsing and layout parsing are supported.
    • pptx: Override parsing config for PPTX files, only digital parsing and layout parsing are supported.
    • xlsm: Override parsing config for XLSM files, only digital parsing and layout parsing are supported.
    • xlsx: Override parsing config for XLSX files, only digital parsing and layout parsing are supported.