Class: Aws::Comprehend::Types::InputDataConfig

Inherits:
Struct
  • Object
show all
Includes:
Structure
Defined in:
lib/aws-sdk-comprehend/types.rb

Overview

The input properties for an inference job. The document reader config field applies only to non-text inputs for custom analysis.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#document_reader_configTypes::DocumentReaderConfig

Provides configuration parameters to override the default actions for extracting text from PDF documents and image files.



5065
5066
5067
5068
5069
5070
5071
# File 'lib/aws-sdk-comprehend/types.rb', line 5065

class InputDataConfig < Struct.new(
  :s3_uri,
  :input_format,
  :document_reader_config)
  SENSITIVE = []
  include Aws::Structure
end

#input_formatString

Specifies how the text in an input file should be processed:

  • ‘ONE_DOC_PER_FILE` - Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers.

  • ‘ONE_DOC_PER_LINE` - Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.

Returns:

  • (String)


5065
5066
5067
5068
5069
5070
5071
# File 'lib/aws-sdk-comprehend/types.rb', line 5065

class InputDataConfig < Struct.new(
  :s3_uri,
  :input_format,
  :document_reader_config)
  SENSITIVE = []
  include Aws::Structure
end

#s3_uriString

The Amazon S3 URI for the input data. The URI must be in same Region as the API endpoint that you are calling. The URI can point to a single input file or it can provide the prefix for a collection of data files.

For example, if you use the URI ‘S3://bucketName/prefix`, if the prefix is a single file, Amazon Comprehend uses that file as input. If more than one file begins with the prefix, Amazon Comprehend uses all of them as input.

Returns:

  • (String)


5065
5066
5067
5068
5069
5070
5071
# File 'lib/aws-sdk-comprehend/types.rb', line 5065

class InputDataConfig < Struct.new(
  :s3_uri,
  :input_format,
  :document_reader_config)
  SENSITIVE = []
  include Aws::Structure
end