sphinxtrain-ruby

Toolkit for training/adapting CMU Sphinx acoustic models.

The main goal is to help with adapting existing acoustic models to a specific speaker/accent. Currently only the English Voxforge model is supported as a base - in fact an adapted one created by Peter Grasch in 2013 using the most up to date training data available at that time. I can confirm his results of a few percent performance increase over Voxforge 0.4 for my accent at least (British English).

Installation

Please note that Ruby >= 2.1 is required for this gem.

Add this line to your application's Gemfile:

gem 'sphinxtrain-ruby'

And then execute:

$ bundle

Or install it yourself as:

$ gem install sphinxtrain-ruby

Usage

Run sphinxtrain-ruby from the command line and follow the instructions. See some example output here.

To Do

[ ] Add support for different data sets, not just the example from CPU Sphinx
[ ] Allow re-recording when mistakes are made
[ ] Re-factor code and add specs
[ ] Consider using actual libs rather than command line tools for adapting model
[ ] Make command line wget downloads less verbose

Contributing

Fork it ( https://github.com/watsonbox/sphinxtrain-ruby/fork )
Create your feature branch (git checkout -b my-new-feature)
Commit your changes (git commit -am 'Add some feature')
Push to the branch (git push origin my-new-feature)
Create a new Pull Request