The whole process of creating custom acoustic model is described here.
Read it thoroughly. If you are still not getting what are the required files and where to get them from this note is for you.
Given that structure is:
Following files could be built by lmtool web service :
Read it thoroughly. If you are still not getting what are the required files and where to get them from this note is for you.
Given that structure is:
- etc
- your_db.dic - Phonetic dictionary
- your_db.phone - Phoneset file
- your_db.lm.DMP - Language model
- your_db.filler - List of fillers
- your_db_train.fileids - List of files for training
- your_db_train.transcription - Transcription for training
- your_db_test.fileids - List of files for testing
- your_db_test.transcription - Transcription for testing
- wav
- speaker_1
- file_1.wav - Recording of speech utterance
- speaker_2
- file_2.wav
Following files could be built by lmtool web service :
- your_db.dic - Phonetic dictionary
- your_db.phone - Phoneset file
- your_db.filler - List of fillers
- your_db.lm.DMP - Language model
sphinx_lm_convert -i model.lm -o model.dmp
sphinx_lm_convert -i model.dmp -ifmt dmp -o model.lm -ofmt arpaAfter you've got that running, you should list all audio files that you want use for training and their matching phrases in remaining files:
- your_db_train.fileids - List of files for training
- your_db_train.transcription - Transcription for training
No comments:
Post a Comment