czech-stemmer

Czech stemmer is pure Ruby port of CzechStemmer Java class from Lucene.

Installation

gem install czech-stemmer

Usage

require 'czech-stemmer'

CzechStemmer.stem("předseda") # => "předsd"
CzechStemmer.stem("mladými") # => "mlad"

Stemmer works only with lowercased letters in suffixes. Based on Lucene CzechStemmer with all test passed. Note the difference between stemming and lemmatization.

Copyright (c) 2014 Ondrej Odchazel. See LICENSE.txt for further details.