sphinxtrain-ruby

Build Status

Toolkit for training/adapting CMU Sphinx acoustic models.

The main goal is to help with adapting existing acoustic models to a specific speaker/accent. Currently only the English Voxforge model is supported as a base - in fact an adapted one created by Peter Grasch in 2013 using the most up to date training data available at that time. I can confirm his results of a few percent performance increase over Voxforge 0.4 for my accent at least (British English).

Installation

Please note that Ruby >= 2.1 is required for this gem.

Add this line to your application's Gemfile:

gem 'sphinxtrain-ruby'

And then execute:

$ bundle

Or install it yourself as:

$ gem install sphinxtrain-ruby

Usage

Run sphinxtrain-ruby from the command line and follow the instructions. See some example output here.

To Do

  • [ ] Add support for different data sets, not just the example from CPU Sphinx
  • [ ] Allow re-recording when mistakes are made
  • [ ] Re-factor code and add specs
  • [ ] Consider using actual libs rather than command line tools for adapting model
  • [ ] Make command line wget downloads less verbose

Contributing

  1. Fork it ( https://github.com/watsonbox/sphinxtrain-ruby/fork )
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create a new Pull Request