Class: ReductoAI::Resources::Extract

Inherits:

Object

Object
ReductoAI::Resources::Extract

show all

Defined in:: lib/reducto_ai/resources/extract.rb

Overview

Note:

Extraction operations consume credits based on document complexity and schema size.

Extract resource for structured data extraction.

Extracts specific information from documents based on a schema or instructions. Returns structured JSON data matching the provided schema.

Examples:

Extract with schema

client = ReductoAI::Client.new
schema = {
  invoice_number: "string",
  total_amount: "number",
  line_items: ["object"]
}

result = client.extract.sync(
  input: "https://example.com/invoice.pdf",
  instructions: schema
)
puts result["result"]

Instance Method Summary collapse

#async(input:, instructions:, async: nil, **options) ⇒ Hash
Extracts structured data from a document asynchronously.
#initialize(client) ⇒ Extract constructor private
A new instance of Extract.
#sync(input:, instructions:, **options) ⇒ Hash
Extracts structured data from a document synchronously.

Constructor Details

#initialize(client) ⇒ `Extract`

This method is part of a private API. You should avoid using this method if possible, as it may be removed or be changed in the future.

Returns a new instance of Extract.

Parameters:

client (Client) —
the Reducto API client



29
30
31

# File 'lib/reducto_ai/resources/extract.rb', line 29

def initialize(client)
  @client = client
end

Instance Method Details

#async(input:, instructions:, async: nil, **options) ⇒ `Hash`

Extracts structured data from a document asynchronously.

Returns immediately with a job_id. Poll with Jobs#retrieve to get results.

Examples:

Start async extraction

job = client.extract.async(
  input: "https://example.com/contract.pdf",
  instructions: { parties: ["string"], terms: "string" }
)
job_id = job["job_id"]

Parameters:

input (String, Hash) —
Document URL or hash with :url key
instructions (Hash, String) —
Extraction schema (same as #sync)
async (Boolean, nil) (defaults to: nil) —
Async mode flag
options (Hash) —
Additional extraction options

Returns:

(Hash) —
Job status with keys:
- "job_id" [String] - Job identifier for polling
- "status" [String] - Initial status ("processing")

Raises:

(ArgumentError) —
if input or instructions are nil/empty

#sync(input:, instructions:, **options) ⇒ `Hash`

Extracts structured data from a document synchronously.

Examples:

Extract invoice data

result = client.extract.sync(
  input: "https://example.com/invoice.pdf",
  instructions: {
    invoice_number: "string",
    total: "number"
  }
)

Parameters:

input (String, Hash) —
Document URL or hash with :url key
instructions (Hash, String) —
Extraction schema or instructions. Can be a simple hash (auto-wrapped as { schema: ... }) or a full instructions hash with a :schema key.
options (Hash) —
Additional extraction options

Returns:

(Hash) —
Extraction results with keys:
- "job_id" [String] - Job identifier
- "status" [String] - Job status ("succeeded")
- "result" [Hash] - Extracted data matching schema
- "usage" [Hash] - Credit usage details

Raises:

(ArgumentError) —
if input or instructions are nil/empty
(ClientError) —
if schema is invalid
(ServerError) —
if extraction fails

Class: ReductoAI::Resources::Extract

Overview

Examples:

Extract with schema

Instance Method Summary collapse

Constructor Details

#initialize(client) ⇒ Extract

Instance Method Details

#async(input:, instructions:, async: nil, **options) ⇒ Hash

Examples:

Start async extraction

#sync(input:, instructions:, **options) ⇒ Hash

Examples:

Extract invoice data

#initialize(client) ⇒ `Extract`

#async(input:, instructions:, async: nil, **options) ⇒ `Hash`

#sync(input:, instructions:, **options) ⇒ `Hash`