README
Name
chupa-text-decomposer-libreoffice
Description
This is a ChupaText decomposer plugin for to extract text and meta-data from office files such as Microsoft Word file, Microsoft Excel file and OpenDocument Format file. It uses LibreOffice.
You can use libreoffice
decomposer.
It depends on pdf
decomposer. Because it converts a office file to
PDF file and extracts text and meta-data by pdf
decomposer.
Install
Install chupa-text-decomposer-libreoffice gem:
% gem install chupa-text-decomposer-libreoffice
Install LibreOffice from download page.
Now, you can extract text and meta-data from office files:
% chupa-text document.doc
Author
- Kouhei Sutou
<[email protected]>
License
LGPL 2.1 or later.
(Kouhei Sutou has a right to change the license including contributed patches.)