PDF::Inspector : A tool for analyzing PDF output

This library provides a number of PDF::Reader based tools for use in testing PDF output. Presently, the primary purpose of this tool is to support the tests found in Prawn, a pure Ruby PDF generation library.

However, it may be useful to others, so we have made it available as a gem in its own right.

Installation

The recommended installation method is via Rubygems.

gem install pdf-inspector

Or put this in your Gemfile, if you use Bundler:

group :test do
  gem 'pdf-inspector', :require => "pdf/inspector"
end

Usage

Check for text in the generated PDF:

rendered_pdf = your_pdf_document.render
text_analysis = PDF::Inspector::Text.analyze(rendered_pdf)
text_analysis.strings.should include("foo")

Note that strings returns an array containing one string for each text drawing operation in the PDF. As a result, sentences and paragraphs will often be returned in fragments. To test for the presence of a complete sentence or a longer string, join the array together with an operation like full_text = text_analysis.strings.join(" ").

Check number of pages

rendered_pdf = your_pdf_document.render
page_analysis = PDF::Inspector::Page.analyze(rendered_pdf)
page_analysis.pages.size.should == 2

Licensing

Matz’s terms for Ruby, GPLv2, or GPLv3. See LICENSE for details.

Mailing List

pdf-inspector is maintained as a dependency of prawn, the ruby PDF generation library.

Any questions or feedback should be sent to the Prawn google group.

http://groups.google.com/group/prawn-ruby

Authorship

pdf-inspector was originally developed by Gregory Brown as part of the Prawn1 project. In 2010, Gregory officially handed the project off to the Prawn core team. Currently active maintainers include Brad Ediger, Daniel Nelson, James Healy, and Jonathan Greenberg.

You can find the full list of Github users who have at least one patch accepted to pdf-inspector at:

https://github.com/sandal/pdf-inspector/contributors

References