delta_attack
Description
Extract MS Office files to plain text.
Installation
Archive Installation
$ rake install
Gem Installation
$ gem source -a http://gems.github.com
$ gem install moro-delta-attack
Features/Problems
Extract MS Office files to plain text usin Apache POI and JRuby. It works with Client/Server architecture.
The extract server is works on JRuby but the client is works with both cRuby and JRuby.
This library originally aim to index Office documents to fulltext serach engine.
Synopsis
first you start DeltaAttackServer, which needs JRuby and Apache POI
$ export CLASSPATH=path/to/poi-3.1-FINAL/poi-3.1-FINAL-20080629.jar:\
path/to/poi-3.1-FINAL/poi-scratchpad-3.1-FINAL-20080629.jar
$ jruby bin/delta_attack_server
Then you can use DeltaAttack::Client, in both CRuby(MRI) and JRuby.
require 'delta_attack/client'
DeletaAttack::Client.cast("path/to/some.xls")
Copyright
- Author
-
moro <[email protected]>
- Copyright
-
Copyright © 2008 moro
- License
-
MIT