delta_attack

Description

Extract MS Office files to plain text.

Installation

Archive Installation

$ rake install

Gem Installation

$ gem source -a http://gems.github.com
$ gem install moro-delta-attack

Features/Problems

Extract MS Office files to plain text usin Apache POI and JRuby. It works with Client/Server architecture.

The extract server is works on JRuby but the client is works with both cRuby and JRuby.

This library originally aim to index Office documents to fulltext serach engine.

Synopsis

first you start DeltaAttackServer, which needs JRuby and Apache POI

$ export CLASSPATH=path/to/poi-3.1-FINAL/poi-3.1-FINAL-20080629.jar:\
                   path/to/poi-3.1-FINAL/poi-scratchpad-3.1-FINAL-20080629.jar
$ jruby bin/delta_attack_server

Then you can use DeltaAttack::Client, in both CRuby(MRI) and JRuby.

require 'delta_attack/client'
DeletaAttack::Client.cast("path/to/some.xls")
Author

moro <[email protected]>

Copyright

Copyright © 2008 moro

License

MIT