Module: StanfordParser
- Defined in:
- lib/stanfordparser.rb
Overview
Wrapper for the Stanford Natural Language Parser.
Defined Under Namespace
Classes: DefaultParser, DocumentPreprocessor, FeatureLabel, LexicalizedParser, StandoffDocumentPreprocessor, StandoffNode, StandoffParsedText, StandoffSentence, StandoffToken, Tree, Word
Constant Summary collapse
- VERSION =
"2.2.1"
- EN_PENN_TREEBANK_TOKENIZER =
The default sentence segmenter and tokenizer. This is an English-language tokenizer with support for Penn Treebank markup.
"edu.stanford.nlp.process.PTBTokenizer"
- ENGLISH_PCFG_MODEL =
Path to an English PCFG model that comes with the Stanford Parser. The location is relative to the parser root directory. This is a valid value for the grammar parameter of the LexicalizedParser constructor.
"$(ROOT)/englishPCFG.ser.gz"
- ROOT =
The root directory of the Stanford parser installation.
initialize_on_load