This part of the manual describes the procedure(s) for training acoustic
models using the Sphinx3 trainer. General training procedures are described
first, and followed by more detailed descriptions of the programs and scripts
used, and the analysis of their logs and other outputs.
BEFORE YOU TRAIN
THE GENERAL-PROCEDURE CHART |
Training chart for the
sphinx2 trainer
=========================
OBSOLETE
(The sphinx2 trainer is no longer used in CMU)
Training chart for the
sphinx3 trainer
=========================
type of model
|
----------------------------------
| |
CONTINUOUS SEMI-CONTINUOUS
| |
| vector-qunatization
| |
----------------------------------
|...make ci mdef
|...flat_initialize CI models
training CI models
|...make cd untied mdef
|...initialize
|
training CD untied models
|
|
|
decision tree building
|...prune trees
|...tie states
|...make cd tied mdef
training CD tied models
|
|
recursive ----------------------------------
gaussian splitting.. | |
continuous models semi-continuous models
| |
| |
----------- |
| | deleted interpolation
decode with ADAPT |
sphinx3 | |---ADAPT
decoder <------- | |
----------------
make cd tied mdef ... | .............|
with decode dict and | convert to
pruned trees | sphinx2
decode with |
sphinx3 |
decoder |
|
decode with
sphinx2
decoder
(currently opensource
and restricted to
working with sampling
rates 8khz and 16khz.
Once the s3 trainer is
released, this will have
to change to allow
people who train with
different sampling rates
to use this decoder)
back to index
Nenhum comentário:
Postar um comentário