Class: Raingrams::Model

Inherits:

Object

Object
Raingrams::Model

show all

Includes:: Helpers::Commonality, Helpers::Frequency, Helpers::Probability, Helpers::Random, Helpers::Similarity

Defined in:: lib/raingrams/model.rb

Direct Known Subclasses

BigramModel, HexagramModel, OpenVocabulary::Model, PentagramModel, QuadgramModel, TrigramModel

Instance Attribute Summary collapse

#ignore_case ⇒ Object readonly

Ignore case of parsed text.
#ignore_phone_numbers ⇒ Object readonly

Ignore Phone numbers.
#ignore_punctuation ⇒ Object readonly

Ignore the punctuation of parsed text.
#ignore_references ⇒ Object readonly

Ignore References.
#ignore_urls ⇒ Object readonly

Ignore URLs.
#ngram_size ⇒ Object readonly

Size of ngrams to use.
#prefixes ⇒ Object readonly

Probabilities of all (n-1) grams.
#starting_ngram ⇒ Object readonly

The sentence starting ngram.
#stoping_ngram ⇒ Object readonly

The sentence stopping ngram.

Class Method Summary collapse

.build(options = {}, &block) ⇒ Object

Creates a new model object with the given options.
.open(path) ⇒ Object

Marshals a model from the contents of the file at the specified path.
.train_with_file(path, options = {}) ⇒ Object

Creates a new model object with the given options and trains it with the contents of the specified path.
.train_with_paragraph(paragraph, options = {}) ⇒ Object

Creates a new model object with the given options and trains it with the specified paragraph.
.train_with_text(text, options = {}) ⇒ Object

Creates a new model object with the given options and trains it with the specified text.
.train_with_url(url, options = {}) ⇒ Object

Creates a new model object with the given options and trains it with the inner text of the paragraphs tags at the specified url.

Instance Method Summary collapse

#build(&block) ⇒ Object

Clears and rebuilds the model.
#clear ⇒ Object

Clears the model of any training data.
#each_ngram(&block) ⇒ Object

Iterates over the ngrams that compose the model, passing each one to the given block.
#grams ⇒ Object

Returns all grams within the model.
#grams_following(gram) ⇒ Object

Returns all grams which occur directly after the specified gram.
#grams_preceeding(gram) ⇒ Object

Returns all grams which preceed the specified gram.
#has_gram?(gram) ⇒ Boolean

Returns true if the model contain the specified gram, returns false otherwise.
#has_ngram?(ngram) ⇒ Boolean

Returns true if the model contains the specified ngram, returns false otherwise.
#initialize(options = {}, &block) ⇒ Model constructor

Creates a new NgramModel with the specified options.
#ngrams ⇒ Object

Returns the ngrams that compose the model.
#ngrams_ending_with(gram) ⇒ Object

Returns the ngrams which end with the specified gram.
#ngrams_following(gram) ⇒ Object

Returns all ngrams which occur directly after the specified gram.
#ngrams_from_fragment(fragment) ⇒ Object

Returns the ngrams extracted from the specified fragment of text.
#ngrams_from_sentence(sentence) ⇒ Object

Returns the ngrams extracted from the specified sentence.
#ngrams_from_text(text) ⇒ Object (also: #ngrams_from_paragraph)

Returns the ngrams extracted from the specified text.
#ngrams_from_words(words) ⇒ Object

Returns the ngrams extracted from the specified words.
#ngrams_including_all(*grams) ⇒ Object

Returns the ngrams including all of the specified grams.
#ngrams_including_any(*grams) ⇒ Object

Returns the ngrams including any of the specified grams.
#ngrams_postfixed_by(postfix) ⇒ Object

Returns the ngrams postfixed by the specified postfix.
#ngrams_preceeding(gram) ⇒ Object

Returns all ngrams which preceed the specified gram.
#ngrams_prefixed_by(prefix) ⇒ Object

Returns the ngrams prefixed by the specified prefix.
#ngrams_starting_with(gram) ⇒ Object

Returns the ngrams starting with the specified gram.
#ngrams_with(&block) ⇒ Object

Selects the ngrams that match the given block.
#parse_sentence(sentence) ⇒ Object

Parses the specified sentence and returns an Array of tokens.
#parse_text(text) ⇒ Object

Parses the specified text and returns an Array of sentences.
#refresh(&block) ⇒ Object

Refreshes the probability tables of the model.
#save(path) ⇒ Object

Saves the model to the file at the specified path.
#set_ngram_frequency(ngram, value) ⇒ Object

Sets the frequency of the specified ngram to the specified value.
#to_hash ⇒ Object

Returns a Hash representation of the model.
#train_with_file(path) ⇒ Object

Train the model with the contents of the specified path.
#train_with_ngram(ngram) ⇒ Object

Train the model with the specified ngram.
#train_with_ngrams(ngrams) ⇒ Object

Train the model with the specified ngrams.
#train_with_paragraph(paragraph) ⇒ Object

Train the model with the specified paragraphs.
#train_with_sentence(sentence) ⇒ Object

Train the model with the specified sentence.
#train_with_text(text) ⇒ Object

Train the model with the specified text.
#train_with_url(url) ⇒ Object

Train the model with the inner text of the paragraph tags at the specified url.

Methods included from Helpers::Random

#random_gram, #random_gram_sentence, #random_ngram, #random_paragraph, #random_sentence, #random_text

Methods included from Helpers::Commonality

#common_ngrams_from_fragment, #common_ngrams_from_sentence, #common_ngrams_from_text, #common_ngrams_from_words, #fragment_commonality, included, #sentence_commonality, #text_commonality

Methods included from Helpers::Similarity

Methods included from Helpers::Probability

#fragment_probability, #probabilities_for, #probability_of_ngram, #probability_of_ngrams, #sentence_probability, #text_probability

Methods included from Helpers::Frequency

#frequencies_for, #frequency_of_ngram, #frequency_of_ngrams

Constructor Details

#initialize(options = {}, &block) ⇒ `Model`

Creates a new NgramModel with the specified options.

options must contain the following keys:

:ngram_size: The size of each gram.

options may contain the following keys:

:ignore_case: Defaults to false.
:ignore_punctuation: Defaults to true.
:ignore_urls: Defaults to false.
:ignore_phone_numbers: Defaults to false.

# File 'lib/raingrams/model.rb', line 59

def initialize(options={},&block)
  @ngram_size = options[:ngram_size]
  @starting_ngram = Ngram.new(Tokens.start * @ngram_size)
  @stoping_ngram = Ngram.new(Tokens.stop * @ngram_size)

  @ignore_case = false
  @ignore_punctuation = true
  @ignore_urls = true
  @ignore_phone_numbers = false
  @ignore_references = false

  if options.has_key?(:ignore_case)
    @ignore_case = options[:ignore_case]
  end

  if options.has_key?(:ignore_punctuation)
    @ignore_punctuation = options[:ignore_punctuation]
  end

  if options.has_key?(:ignore_urls)
    @ignore_urls = options[:ignore_urls]
  end

  if options.has_key?(:ignore_phone_numbers)
    @ignore_phone_numbers = options[:ignore_phone_numbers]
  end

  if options.has_key?(:ignore_references)
    @ignore_references = options[:ignore_references]
  end

  @prefixes = {}

  block.call(self) if block
end

Instance Attribute Details

#ignore_case ⇒ `Object` (readonly)

Ignore case of parsed text



30
31
32

# File 'lib/raingrams/model.rb', line 30

def ignore_case
  @ignore_case
end

#ignore_phone_numbers ⇒ `Object` (readonly)

Ignore Phone numbers



39
40
41

# File 'lib/raingrams/model.rb', line 39

def ignore_phone_numbers
  @ignore_phone_numbers
end

#ignore_punctuation ⇒ `Object` (readonly)

Ignore the punctuation of parsed text



33
34
35

# File 'lib/raingrams/model.rb', line 33

def ignore_punctuation
  @ignore_punctuation
end

#ignore_references ⇒ `Object` (readonly)

Ignore References



42
43
44

# File 'lib/raingrams/model.rb', line 42

def ignore_references
  @ignore_references
end

#ignore_urls ⇒ `Object` (readonly)

Ignore URLs



36
37
38

# File 'lib/raingrams/model.rb', line 36

def ignore_urls
  @ignore_urls
end

#ngram_size ⇒ `Object` (readonly)

Size of ngrams to use



21
22
23

# File 'lib/raingrams/model.rb', line 21

def ngram_size
  @ngram_size
end

#prefixes ⇒ `Object` (readonly)

Probabilities of all (n-1) grams



45
46
47

# File 'lib/raingrams/model.rb', line 45

def prefixes
  @prefixes
end

#starting_ngram ⇒ `Object` (readonly)

The sentence starting ngram



24
25
26

# File 'lib/raingrams/model.rb', line 24

def starting_ngram
  @starting_ngram
end

#stoping_ngram ⇒ `Object` (readonly)

The sentence stopping ngram



27
28
29

# File 'lib/raingrams/model.rb', line 27

def stoping_ngram
  @stoping_ngram
end

Class Method Details

.build(options = {}, &block) ⇒ `Object`

Creates a new model object with the given options. If a block is given, it will be passed the newly created model. After the block as been called the model will be built.

# File 'lib/raingrams/model.rb', line 100

def self.build(options={},&block)
  self.new(options) do |model|
    model.build(&block)
  end
end

.open(path) ⇒ `Object`

Marshals a model from the contents of the file at the specified path.

# File 'lib/raingrams/model.rb', line 150

def self.open(path)
  model = nil

  File.open(path) do |file|
    model = Marshal.load(file)
  end

  return model
end

.train_with_file(path, options = {}) ⇒ `Object`

Creates a new model object with the given options and trains it with the contents of the specified path.

# File 'lib/raingrams/model.rb', line 130

def self.train_with_file(path,options={})
  self.build(options) do |model|
    model.train_with_file(path)
  end
end

.train_with_paragraph(paragraph, options = {}) ⇒ `Object`

Creates a new model object with the given options and trains it with the specified paragraph.

# File 'lib/raingrams/model.rb', line 110

def self.train_with_paragraph(paragraph,options={})
  self.build(options) do |model|
    model.train_with_paragraph(paragraph)
  end
end

.train_with_text(text, options = {}) ⇒ `Object`

Creates a new model object with the given options and trains it with the specified text.

# File 'lib/raingrams/model.rb', line 120

def self.train_with_text(text,options={})
  self.build(options) do |model|
    model.train_with_text(text)
  end
end

.train_with_url(url, options = {}) ⇒ `Object`

Creates a new model object with the given options and trains it with the inner text of the paragraphs tags at the specified url.

# File 'lib/raingrams/model.rb', line 140

def self.train_with_url(url,options={})
  self.build(options) do |model|
    model.train_with_url(url)
  end
end

Instance Method Details

#build(&block) ⇒ `Object`

Clears and rebuilds the model.

# File 'lib/raingrams/model.rb', line 549

def build(&block)
  refresh do
    clear

    block.call(self) if block
  end
end

#clear ⇒ `Object`

Clears the model of any training data.

# File 'lib/raingrams/model.rb', line 560

def clear
  @prefixes.clear
  return self
end

#each_ngram(&block) ⇒ `Object`

Iterates over the ngrams that compose the model, passing each one to the given block.

# File 'lib/raingrams/model.rb', line 243

def each_ngram(&block)
  @prefixes.each do |prefix,table|
    table.each_gram do |postfix_gram|
      block.call(prefix + postfix_gram) if block
    end
  end

  return self
end

#grams ⇒ `Object`

Returns all grams within the model.

# File 'lib/raingrams/model.rb', line 433

def grams
  @prefixes.keys.inject(Set.new) do |all_grams,gram|
    all_grams + gram
  end
end

#grams_following(gram) ⇒ `Object`

Returns all grams which occur directly after the specified gram.

# File 'lib/raingrams/model.rb', line 465

def grams_following(gram)
  gram_set = Set.new

  ngram_starting_with(gram).each do |ngram|
    gram_set << ngram[1]
  end

  return gram_set
end

#grams_preceeding(gram) ⇒ `Object`

Returns all grams which preceed the specified gram.

# File 'lib/raingrams/model.rb', line 452

def grams_preceeding(gram)
  gram_set = Set.new

  ngrams_ending_with(gram).each do |ngram|
    gram_set << ngram[-2]
  end

  return gram_set
end

#has_gram?(gram) ⇒ `Boolean`

Returns true if the model contain the specified gram, returns false otherwise.

Returns:

(Boolean)

# File 'lib/raingrams/model.rb', line 443

def has_gram?(gram)
  @prefixes.keys.any? do |prefix|
    prefix.include?(gram)
  end
end

#has_ngram?(ngram) ⇒ `Boolean`

Returns true if the model contains the specified ngram, returns false otherwise.

Returns:

(Boolean)

# File 'lib/raingrams/model.rb', line 231

def has_ngram?(ngram)
  if @prefixes.has_key?(ngram.prefix)
    return @prefixes[ngram.prefix].has_gram?(ngram.last)
  else
    return false
  end
end

#ngrams ⇒ `Object`

Returns the ngrams that compose the model.

# File 'lib/raingrams/model.rb', line 215

def ngrams
  ngram_set = NgramSet.new

  @prefixes.each do |prefix,table|
    table.each_gram do |postfix_gram|
      ngram_set << (prefix + postfix_gram)
    end
  end

  return ngram_set
end

#ngrams_ending_with(gram) ⇒ `Object`

Returns the ngrams which end with the specified gram.

# File 'lib/raingrams/model.rb', line 318

def ngrams_ending_with(gram)
  ngram_set = NgramSet.new

  @prefixes.each do |prefix,table|
    if table.has_gram?(gram)
      ngram_set << (prefix + gram)
    end
  end

  return ngram_set
end

#ngrams_following(gram) ⇒ `Object`

Returns all ngrams which occur directly after the specified gram.

# File 'lib/raingrams/model.rb', line 418

def ngrams_following(gram)
  ngram_set = NgramSet.new

  ngrams_starting_with(gram).each do |starts_with|
    ngrams_prefixed_by(starts_with.postfix).each do |ngram|
      ngram_set << ngram
    end
  end

  return ngram_set
end

#ngrams_from_fragment(fragment) ⇒ `Object`

Returns the ngrams extracted from the specified fragment of text.



378
379
380

# File 'lib/raingrams/model.rb', line 378

def ngrams_from_fragment(fragment)
  ngrams_from_words(parse_sentence(fragment))
end

#ngrams_from_sentence(sentence) ⇒ `Object`

Returns the ngrams extracted from the specified sentence.



385
386
387

# File 'lib/raingrams/model.rb', line 385

def ngrams_from_sentence(sentence)
  ngrams_from_words(wrap_sentence(parse_sentence(sentence)))
end

#ngrams_from_text(text) ⇒ `Object` Also known as: ngrams_from_paragraph

Returns the ngrams extracted from the specified text.

# File 'lib/raingrams/model.rb', line 392

def ngrams_from_text(text)
  parse_text(text).inject([]) do |ngrams,sentence|
    ngrams + ngrams_from_sentence(sentence)
  end
end

#ngrams_from_words(words) ⇒ `Object`

Returns the ngrams extracted from the specified words.

# File 'lib/raingrams/model.rb', line 369

def ngrams_from_words(words)
  return (0...(words.length-@ngram_size+1)).map do |index|
    Ngram.new(words[index,@ngram_size])
  end
end

#ngrams_including_all(*grams) ⇒ `Object`

Returns the ngrams including all of the specified grams.

# File 'lib/raingrams/model.rb', line 356

def ngrams_including_all(*grams)
  ngram_set = NgramSet.new

  each_ngram do |ngram|
    ngram_set << ngram if ngram.includes_all?(*grams)
  end

  return ngram_set
end

#ngrams_including_any(*grams) ⇒ `Object`

Returns the ngrams including any of the specified grams.

# File 'lib/raingrams/model.rb', line 333

def ngrams_including_any(*grams)
  ngram_set = NgramSet.new

  @prefixes.each do |prefix,table|
    if prefix.includes_any?(*grams)
      table.each_gram do |postfix_gram|
        ngram_set << (prefix + postfix_gram)
      end
    else
      table.each_gram do |postfix_gram|
        if grams.include?(postfix_gram)
          ngram_set << (prefix + postfix_gram)
        end
      end
    end
  end

  return ngram_set
end

#ngrams_postfixed_by(postfix) ⇒ `Object`

Returns the ngrams postfixed by the specified postfix.

# File 'lib/raingrams/model.rb', line 284

def ngrams_postfixed_by(postfix)
  ngram_set = NgramSet.new

  @prefixes.each do |prefix,table|
    if prefix[1..-1] == postfix[0..-2]
      if table.has_gram?(postfix.last)
        ngram_set << (prefix + postfix.last)
      end
    end
  end

  return ngram_set
end

#ngrams_preceeding(gram) ⇒ `Object`

Returns all ngrams which preceed the specified gram.

# File 'lib/raingrams/model.rb', line 403

def ngrams_preceeding(gram)
  ngram_set = NgramSet.new

  ngrams_ending_with(gram).each do |ends_with|
    ngrams_postfixed_by(ends_with.prefix).each do |ngram|
      ngram_set << ngram
    end
  end

  return ngram_set
end

#ngrams_prefixed_by(prefix) ⇒ `Object`

Returns the ngrams prefixed by the specified prefix.

# File 'lib/raingrams/model.rb', line 269

def ngrams_prefixed_by(prefix)
  ngram_set = NgramSet.new

  return ngram_set unless @prefixes.has_key?(prefix)

  ngram_set += @prefixes[prefix].grams.map do |gram|
    prefix + gram
  end

  return ngram_set
end

#ngrams_starting_with(gram) ⇒ `Object`

Returns the ngrams starting with the specified gram.

# File 'lib/raingrams/model.rb', line 301

def ngrams_starting_with(gram)
  ngram_set = NgramSet.new

  @prefixes.each do |prefix,table|
    if prefix.first == gram
      table.each_gram do |postfix_gram|
        ngram_set << (prefix + postfix_gram)
      end
    end
  end

  return ngram_set
end

#ngrams_with(&block) ⇒ `Object`

Selects the ngrams that match the given block.

# File 'lib/raingrams/model.rb', line 256

def ngrams_with(&block)
  selected_ngrams = NgramSet.new

  each_ngram do |ngram|
    selected_ngrams << ngram if block.call(ngram)
  end

  return selected_ngrams
end

#parse_sentence(sentence) ⇒ `Object`

Parses the specified sentence and returns an Array of tokens.

# File 'lib/raingrams/model.rb', line 163

def parse_sentence(sentence)
  sentence = sentence.to_s

  if @ignore_punctuation
    # eat tailing punctuation
    sentence.gsub!(/[\.\?!]*$/,'')
  end

  if @ignore_case
    # downcase the sentence
    sentence.downcase!
  end

  if @ignore_urls
    sentence.gsub!(/\s*\w+:\/\/[\w\/\+_\-,:%\d\.\-\?&=]*\s*/,' ')
  end

  if @ignore_phone_numbers
    # remove phone numbers
    sentence.gsub!(/\s*(\d-)?(\d{3}-)?\d{3}-\d{4}\s*/,' ')
  end

  if @ignore_references
    # remove RFC style references
    sentence.gsub!(/\s*[\(\{\[]\d+[\)\}\]]\s*/,' ')
  end

  if @ignore_punctuation
    # split and ignore punctuation characters
    return sentence.scan(/\w+[\-_\.:']\w+|\w+/)
  else
    # split and accept punctuation characters
    return sentence.scan(/[\w\-_,:;\.\?\!'"\\\/]+/)
  end
end

#parse_text(text) ⇒ `Object`

Parses the specified text and returns an Array of sentences.

# File 'lib/raingrams/model.rb', line 202

def parse_text(text)
  text = text.to_s

  if @ignore_urls
    text.gsub!(/\s*\w+:\/\/[\w\/\+_\-,:%\d\.\-\?&=]*\s*/,' ')
  end

  return text.scan(/[^\s\.\?!][^\.\?!]*[\.\?\!]/)
end

#refresh(&block) ⇒ `Object`

Refreshes the probability tables of the model.

# File 'lib/raingrams/model.rb', line 539

def refresh(&block)
  block.call(self) if block

  @prefixes.each_value { |table| table.build }
  return self
end

#save(path) ⇒ `Object`

Saves the model to the file at the specified path.

# File 'lib/raingrams/model.rb', line 568

def save(path)
  File.open(path,'w') do |file|
    Marshal.dump(self,file)
  end

  return self
end

#set_ngram_frequency(ngram, value) ⇒ `Object`

Sets the frequency of the specified ngram to the specified value.



478
479
480

# File 'lib/raingrams/model.rb', line 478

def set_ngram_frequency(ngram,value)
  probability_table(ngram).set_count(ngram.last,value)
end

#to_hash ⇒ `Object`

Returns a Hash representation of the model.



579
580
581

# File 'lib/raingrams/model.rb', line 579

def to_hash
  @prefixes
end

#train_with_file(path) ⇒ `Object`

Train the model with the contents of the specified path.



520
521
522

# File 'lib/raingrams/model.rb', line 520

def train_with_file(path)
  train_with_text(File.read(path))
end

#train_with_ngram(ngram) ⇒ `Object`

Train the model with the specified ngram.



485
486
487

# File 'lib/raingrams/model.rb', line 485

def train_with_ngram(ngram)
  probability_table(ngram).count(ngram.last)
end

#train_with_ngrams(ngrams) ⇒ `Object`

Train the model with the specified ngrams.



492
493
494

# File 'lib/raingrams/model.rb', line 492

def train_with_ngrams(ngrams)
  ngrams.each { |ngram| train_with_ngram(ngram) }
end

#train_with_paragraph(paragraph) ⇒ `Object`

Train the model with the specified paragraphs.



506
507
508

# File 'lib/raingrams/model.rb', line 506

def train_with_paragraph(paragraph)
  train_with_ngrams(ngrams_from_paragraph(paragraph))
end

#train_with_sentence(sentence) ⇒ `Object`

Train the model with the specified sentence.



499
500
501

# File 'lib/raingrams/model.rb', line 499

def train_with_sentence(sentence)
  train_with_ngrams(ngrams_from_sentence(sentence))
end

#train_with_text(text) ⇒ `Object`

Train the model with the specified text.



513
514
515

# File 'lib/raingrams/model.rb', line 513

def train_with_text(text)
  train_with_ngrams(ngrams_from_text(text))
end

#train_with_url(url) ⇒ `Object`

Train the model with the inner text of the paragraph tags at the specified url.

# File 'lib/raingrams/model.rb', line 528

def train_with_url(url)
  doc = Nokogiri::HTML(open(url))

  return doc.search('p').map do |p|
    train_with_paragraph(p.inner_text)
  end
end

Class: Raingrams::Model

Direct Known Subclasses

Instance Attribute Summary collapse

Class Method Summary collapse

Instance Method Summary collapse

Methods included from Helpers::Random

Methods included from Helpers::Commonality

Methods included from Helpers::Similarity

Methods included from Helpers::Probability

Methods included from Helpers::Frequency

Constructor Details

#initialize(options = {}, &block) ⇒ Model

Instance Attribute Details

#ignore_case ⇒ Object (readonly)

#ignore_phone_numbers ⇒ Object (readonly)

#ignore_punctuation ⇒ Object (readonly)

#ignore_references ⇒ Object (readonly)

#ignore_urls ⇒ Object (readonly)

#ngram_size ⇒ Object (readonly)

#prefixes ⇒ Object (readonly)

#starting_ngram ⇒ Object (readonly)

#stoping_ngram ⇒ Object (readonly)

Class Method Details

.build(options = {}, &block) ⇒ Object

.open(path) ⇒ Object

.train_with_file(path, options = {}) ⇒ Object

.train_with_paragraph(paragraph, options = {}) ⇒ Object

.train_with_text(text, options = {}) ⇒ Object

.train_with_url(url, options = {}) ⇒ Object

Instance Method Details

#build(&block) ⇒ Object

#clear ⇒ Object

#each_ngram(&block) ⇒ Object

#grams ⇒ Object

#grams_following(gram) ⇒ Object

#grams_preceeding(gram) ⇒ Object

#has_gram?(gram) ⇒ Boolean

#has_ngram?(ngram) ⇒ Boolean

#ngrams ⇒ Object

#ngrams_ending_with(gram) ⇒ Object

#ngrams_following(gram) ⇒ Object

#ngrams_from_fragment(fragment) ⇒ Object

#ngrams_from_sentence(sentence) ⇒ Object

#ngrams_from_text(text) ⇒ Object Also known as: ngrams_from_paragraph

#ngrams_from_words(words) ⇒ Object

#ngrams_including_all(*grams) ⇒ Object

#ngrams_including_any(*grams) ⇒ Object

#ngrams_postfixed_by(postfix) ⇒ Object

#ngrams_preceeding(gram) ⇒ Object

#ngrams_prefixed_by(prefix) ⇒ Object

#ngrams_starting_with(gram) ⇒ Object

#ngrams_with(&block) ⇒ Object

#parse_sentence(sentence) ⇒ Object

#parse_text(text) ⇒ Object

#refresh(&block) ⇒ Object

#save(path) ⇒ Object

#set_ngram_frequency(ngram, value) ⇒ Object

#to_hash ⇒ Object

#train_with_file(path) ⇒ Object

#train_with_ngram(ngram) ⇒ Object

#train_with_ngrams(ngrams) ⇒ Object

#train_with_paragraph(paragraph) ⇒ Object

#train_with_sentence(sentence) ⇒ Object

#train_with_text(text) ⇒ Object

#train_with_url(url) ⇒ Object