Module: StanfordParser

Defined in:
lib/stanfordparser.rb

Overview

Defined Under Namespace

Classes: DefaultParser, DocumentPreprocessor, FeatureLabel, LexicalizedParser, StandoffDocumentPreprocessor, StandoffNode, StandoffParsedText, StandoffSentence, StandoffToken, Tree, Word

Constant Summary collapse

VERSION =
"2.2.1"
EN_PENN_TREEBANK_TOKENIZER =

The default sentence segmenter and tokenizer. This is an English-language tokenizer with support for Penn Treebank markup.

"edu.stanford.nlp.process.PTBTokenizer"
ENGLISH_PCFG_MODEL =

Path to an English PCFG model that comes with the Stanford Parser. The location is relative to the parser root directory. This is a valid value for the grammar parameter of the LexicalizedParser constructor.

"$(ROOT)/englishPCFG.ser.gz"
ROOT =

The root directory of the Stanford parser installation.

initialize_on_load