Class: ReductoAI::Resources::Parse

Inherits:

Object

Object
ReductoAI::Resources::Parse

show all

Defined in:: lib/reducto_ai/resources/parse.rb

Overview

Note:

Each parse operation consumes credits based on document complexity. See Reducto documentation for pricing details.

Parse resource for document parsing operations.

Converts documents (PDFs, images, etc.) into structured formats like Markdown, JSON, or HTML. Supports both synchronous and asynchronous modes.

Examples:

Synchronous parsing

client = ReductoAI::Client.new
result = client.parse.sync(
  input: "https://example.com/document.pdf",
  output_formats: { markdown: true }
)
puts result["result"]["markdown"]

Asynchronous parsing

job = client.parse.async(
  input: { url: "https://example.com/large-doc.pdf" },
  async: true
)
job_id = job["job_id"]

Instance Method Summary collapse

#async(input:, async: nil, **options) ⇒ Hash
Parses a document asynchronously.
#initialize(client) ⇒ Parse constructor private
A new instance of Parse.
#sync(input:, **options) ⇒ Hash
Parses a document synchronously.

Constructor Details

#initialize(client) ⇒ `Parse`

This method is part of a private API. You should avoid using this method if possible, as it may be removed or be changed in the future.

Returns a new instance of Parse.

Parameters:

client (Client) —
the Reducto API client



34
35
36

# File 'lib/reducto_ai/resources/parse.rb', line 34

def initialize(client)
  @client = client
end

Instance Method Details

#async(input:, async: nil, **options) ⇒ `Hash`

Parses a document asynchronously.

Returns immediately with a job_id. Poll with Jobs#retrieve to get results.

Examples:

Start async parse and poll

job = client.parse.async(input: "https://example.com/doc.pdf")
job_id = job["job_id"]

# Poll for completion
loop do
  status = client.jobs.retrieve(job_id: job_id)
  break if status["status"] == "succeeded"
  sleep 2
end

Parameters:

input (String, Hash) —
Document URL or hash with :url key
async (Boolean, nil) (defaults to: nil) —
Async mode flag (defaults to true if not provided)
options (Hash) —
Additional parsing options (same as #sync)

Returns:

(Hash) —
Job status with keys:
- "job_id" [String] - Job identifier for polling
- "status" [String] - Initial status ("processing")

Raises:

(ArgumentError) —
if input is nil

#sync(input:, **options) ⇒ `Hash`

Parses a document synchronously.

Blocks until parsing completes and returns the full result.

Examples:

Parse to markdown

result = client.parse.sync(
  input: "https://example.com/doc.pdf",
  output_formats: { markdown: true }
)

Parameters:

input (String, Hash) —
Document URL or hash with :url key
options (Hash) —
Additional parsing options

Options Hash (**options):

:output_formats (Hash) —
Output format configuration (e.g., { markdown: true, html: true })
:mode (String) —
Processing mode ("ocr", "auto")
:use_cache (Boolean) —
Whether to use cached results

Returns:

(Hash) —
Parsed document with keys:
- "job_id" [String] - Job identifier
- "status" [String] - Job status ("succeeded")
- "result" [Hash] - Parsed content by format (e.g., "markdown", "html")
- "usage" [Hash] - Credit usage details

Raises:

(ArgumentError) —
if input is nil
(ClientError) —
if document URL is invalid or inaccessible
(ServerError) —
if parsing fails

Class: ReductoAI::Resources::Parse

Overview

Examples:

Synchronous parsing

Asynchronous parsing

Instance Method Summary collapse

Constructor Details

#initialize(client) ⇒ Parse

Instance Method Details

#async(input:, async: nil, **options) ⇒ Hash

Examples:

Start async parse and poll

#sync(input:, **options) ⇒ Hash

Examples:

Parse to markdown

#initialize(client) ⇒ `Parse`

#async(input:, async: nil, **options) ⇒ `Hash`

#sync(input:, **options) ⇒ `Hash`