Module: PDFTDX

Defined in:
lib/pdftdx.rb,
lib/pdftdx/parser.rb,
lib/pdftdx/version.rb

Overview

PDF TDX Module

Defined Under Namespace

Modules: Parser

Constant Summary collapse

VERSION =

Version

'0.3.1'

Class Method Summary collapse

Class Method Details

.extract_data(pdf_file) ⇒ Object

Extract Data from PDF



16
17
18
19
20
21
22
23
# File 'lib/pdftdx.rb', line 16

def self.extract_data pdf_file

  # Dump PDF Data
  page_data = Pdftohtml.convert pdf_file

  # Process Page Data
  PDFTDX::Parser.process_page_files page_data
end