Class: Aws::Comprehend::Types::InputDataConfig
- Inherits:
-
Struct
- Object
- Struct
- Aws::Comprehend::Types::InputDataConfig
- Includes:
- Structure
- Defined in:
- lib/aws-sdk-comprehend/types.rb
Overview
The input properties for an inference job. The document reader config field applies only to non-text inputs for custom analysis.
Constant Summary collapse
- SENSITIVE =
[]
Instance Attribute Summary collapse
-
#document_reader_config ⇒ Types::DocumentReaderConfig
Provides configuration parameters to override the default actions for extracting text from PDF documents and image files.
-
#input_format ⇒ String
Specifies how the text in an input file should be processed:.
-
#s3_uri ⇒ String
The Amazon S3 URI for the input data.
Instance Attribute Details
#document_reader_config ⇒ Types::DocumentReaderConfig
Provides configuration parameters to override the default actions for extracting text from PDF documents and image files.
5065 5066 5067 5068 5069 5070 5071 |
# File 'lib/aws-sdk-comprehend/types.rb', line 5065 class InputDataConfig < Struct.new( :s3_uri, :input_format, :document_reader_config) SENSITIVE = [] include Aws::Structure end |
#input_format ⇒ String
Specifies how the text in an input file should be processed:
-
‘ONE_DOC_PER_FILE` - Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers.
-
‘ONE_DOC_PER_LINE` - Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.
5065 5066 5067 5068 5069 5070 5071 |
# File 'lib/aws-sdk-comprehend/types.rb', line 5065 class InputDataConfig < Struct.new( :s3_uri, :input_format, :document_reader_config) SENSITIVE = [] include Aws::Structure end |
#s3_uri ⇒ String
The Amazon S3 URI for the input data. The URI must be in same Region as the API endpoint that you are calling. The URI can point to a single input file or it can provide the prefix for a collection of data files.
For example, if you use the URI ‘S3://bucketName/prefix`, if the prefix is a single file, Amazon Comprehend uses that file as input. If more than one file begins with the prefix, Amazon Comprehend uses all of them as input.
5065 5066 5067 5068 5069 5070 5071 |
# File 'lib/aws-sdk-comprehend/types.rb', line 5065 class InputDataConfig < Struct.new( :s3_uri, :input_format, :document_reader_config) SENSITIVE = [] include Aws::Structure end |