Provided by: pocketsphinx-utils_0.8.0+real-0ubuntu6_amd64 bug

NAME

       pocketsphinx_batch - Run speech recognition in batch mode

SYNOPSIS

       pocketsphinx_batch -hmm hmmdir -dict dictfile [ options ]...

DESCRIPTION

       Run  speech  recognition  over  a  list  of  utterances in batchmode.  A list of arguments
       follows:

       -adcdev
              name for audio input (platform-specific)

       -adchdr
              Size of audio file header in bytes (headers are ignored)

       -adcin Input is raw audio data

       -agc   Automatic gain control for c0 ('max', 'emax', 'noise', or 'none')

       -agcthresh
              Initial threshold for automatic gain control

       -allphone
              Do phoneme recognition

       -alpha Preemphasis parameter

       -backtrace
              Print back trace of recognition results

       -beam  Beam width applied to every frame in Viterbi  search  (smaller  values  mean  wider
              beam)

       -bestpath
              Run bestpath (Dijkstra) search over word lattice (3rd pass)

       -bestpathlw
              Language model probability weight for bestpath search

       -cachesen
              Cache senone scores from first pass search

       -cep2spec
              Input is cepstral files, output is log spectral files

       -cepdir
              files directory (prefixed to filespecs in control file)

       -cepext
              Input files extension (prefixed to filespecs in control file)

       -ceplen
              Number of components in the input feature vector

       -cmn   Cepstral mean normalization scheme ('current', 'prior', or 'none')

       -cmninit
              Initial values (comma-separated) for cepstral mean when 'prior' is used

       -compallsen
              Compute  all  senone  scores  in  every  frame  (can  be faster when there are many
              senones)

       -ctl   file listing utterances to be processed

       -ctlcount
              No. of utterances to be processed (after skipping -ctloffset entries)

       -ctlincr
              Do every Nth line in the control file

       -ctloffset
              No. of utterances at the beginning of -ctl file to be skipped

       -dict  pronunciation dictionary (lexicon) input file

       -dither
              Add 1/2-bit noise

       -doublebw
              Use double bandwidth filters (same center freq)

       -dsratio
              Frame GMM computation downsampling ratio

       -fbtype
              FB Type of mel_scale or log_linear

       -fdict word pronunciation dictionary input file

       -feat  Feature stream type, depends on the acoustic model

       -fillpen
              Filler word transition penalty

       -frate Frame rate

       -fsg   state grammar

       -fsgbfs
              Force backtrace from FSG final state

       -fsgctlfn
              finite state grammar control file

       -fsgusealtpron
              Use alternative pronunciations for FSG

       -fsgusefiller
              (FSG Mode (Mode 2) only) Insert filler words at each state.

       -fwd3g Use trigrams in first pass search

       -fwdflat
              Run forward flat-lexicon search over word lattice (2nd pass)

       -fwdflatbeam
              Beam width applied to every frame in second-pass flat search

       -fwdflatefwid
              Minimum number of end frames for a word to be searched in fwdflat search

       -fwdflatlw
              Language model probability weight for flat lexicon (2nd pass) decoding

       -fwdflatsfwin
              Window of frames in lattice to search for successor words in fwdflat search

       -fwdflatwbeam
              Beam width applied to word exits in second-pass flat search

       -fwdtree
              Run forward lexicon-tree search (1st pass)

       -hmm   containing acoustic model files.

       -hyp   output file name

       -hypseg
              output with segmentation file name

       -input_endian
              Endianness of input data, big or little, ignored if NIST or MS Wav

       -kdmaxbbi
              Maximum number of Gaussians per leaf node in kd-Trees

       -kdmaxdepth
              Maximum depth of kd-Trees to use

       -kdtree
              file for Gaussian selection

       -latsize
              Lattice size

       -lifter
              Length of sin-curve for liftering, or 0 for no liftering.

       -live  Get input from audio hardware

       -lm    trigram language model input file

       -lmctl a set of language model

       The -hmm and -dict arguments are  always  required.   Either  -lm  or  -fsg  is  required,
       depending on whether you are using a statistical language model or a finite-state grammar.
       To do batchmode recognition, you will need to specify a control file, using -ctl This is a
       simple  text  file containing one entry per line.  Each entry is the name of an input file
       relative to the -cepdir directory, and without the filename extension (which is  given  in
       the -cepext argument).

       If  you are using acoustic feature files as input (see sphinx_fe(1) for information on how
       to generate these), you can also specify a subpart of a file, using the following format:

              FILENAME START-FRAME END-FRAME UTTERANCE-ID

AUTHOR

       Written by numerous people at CMU from 1994 onwards.  This manual page by  David  Huggins-
       Daines <dhuggins@cs.cmu.edu>

COPYRIGHT

       Copyright © 1994-2007 Carnegie Mellon University.  See the file COPYING included with this
       package for more information.

SEE ALSO

       pocketsphinx_continuous(1), sphinx_fe(1).

                                            2007-08-27                      POCKETSPHINX_BATCH(1)