Quick Notes: Sphinx4 custom acoustic model files notes.

The whole process of creating custom acoustic model is described here.
Read it thoroughly. If you are still not getting what are the required files and where to get them from this note is for you.

Given that structure is:

your_db.dic - Phonetic dictionary
your_db.phone - Phoneset file
your_db.lm.DMP - Language model
your_db.filler - List of fillers
your_db_train.fileids - List of files for training
your_db_train.transcription - Transcription for training
your_db_test.fileids - List of files for testing
your_db_test.transcription - Transcription for testing

speaker_1

file_1.wav - Recording of speech utterance

speaker_2

file_2.wav

Following files could be built by lmtool web service :

your_db.dic -

Phonetic dictionary

your_db.phone - Phoneset file
your_db.filler - List of fillers

after you've got those files ready, you'll need .DMP file:

your_db.lm.DMP - Language model

it is generated from .lm file with sphinx_lm_convert programm which is shipped with sphinxbase-7.0 archive. See this section on installation instructions of sphinxbase. You should use following commands to generate this file:

 sphinx_lm_convert -i model.lm -o model.dmp

sphinx_lm_convert -i model.dmp -ifmt dmp -o model.lm -ofmt arpa

After you've got that running, you should list all audio files that you want use for training and their matching phrases in remaining files:

your_db_train.fileids -

List of files for training

your_db_train.transcription - Transcription for training

Quick Notes

Tuesday, 1 May 2012

Sphinx4 custom acoustic model files notes.

No comments:

Post a Comment