Module: Ddr::Extraction

Defined in:
lib/ddr/extraction.rb,
lib/ddr/extraction/version.rb,
lib/ddr/extraction/adapters.rb,
lib/ddr/extraction/extractor.rb,
lib/ddr/extraction/adapters/adapter.rb,
lib/ddr/extraction/metadata_extractor.rb,
lib/ddr/extraction/adapters/fits_adapter.rb,
lib/ddr/extraction/adapters/tika_adapter.rb,
lib/ddr/extraction/adapters/text_extraction_adapter.rb,
lib/ddr/extraction/adapters/metadata_extraction_adapter.rb

Overview

Ddr::Extraction - A file text and metadata extraction service.

Defined Under Namespace

Modules: Adapters Classes: Extractor, MetadataExtractor

Constant Summary collapse

VERSION =
"0.1.0"

Class Attribute Summary collapse

Class Method Summary collapse

Class Attribute Details

.metadata_adapterObject

Returns the value of attribute metadata_adapter.



13
14
15
# File 'lib/ddr/extraction.rb', line 13

def 
  @metadata_adapter
end

.text_adapterObject

Returns the value of attribute text_adapter.



13
14
15
# File 'lib/ddr/extraction.rb', line 13

def text_adapter
  @text_adapter
end

Class Method Details

.adaptersObject



22
23
24
# File 'lib/ddr/extraction.rb', line 22

def adapters
  Adapters.config
end

.configure_adapter(adapter) {|Adapters.get_adapter(adapter)| ... } ⇒ Object

Yields a configurable object for the named adapter.

Parameters:

  • adapter (Symbol)

    the name of the adapter - e.g., ‘:tika`, `:fits`

Yields:



18
19
20
# File 'lib/ddr/extraction.rb', line 18

def configure_adapter(adapter, &block)
  yield Adapters.get_adapter(adapter)
end

.set_defaultsObject



26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
# File 'lib/ddr/extraction.rb', line 26

def set_defaults
  bin_dir = File.expand_path("../../../bin", __FILE__)

  configure_adapter :tika do |tika|
    tika.version = "1.6"
    tika.path = File.join(bin_dir, "tika-app.jar")
    tika.download_url = "http://archive.apache.org/dist/tika/tika-app-#{tika.version}.jar"
    tika.checksum = "99df0d8c3f6a2be498d275053e611fb5afdf0a9d"
    tika.checksum_type = :SHA1
  end

  configure_adapter :fits do |fits|
    fits.version = "0.8.3"
    fits.path = File.join(bin_dir, "fits-#{fits.version}", "fits.sh")
    fits.download_url = "http://projects.iq.harvard.edu/files/fits/files/fits-#{fits.version}.zip"
  end

  adapters.text = :tika
  adapters. = :fits
end