The following is the result of pocketsphinx, and here I use the corpus as follow:
Mandarin language model: zh_broadcastnews_64000_utf8.DMP, zh_broadcastnews_64000_utf8.dic
Mandarin Broadcast News acoustic models: zh_broadcastnews_16k_ptm256_8000.tar.bz2
Then I got the following log:
I:\>cd I:\3000\pocketsphinx-0.7-win32
I:\3000\pocketsphinx-0.7-win32>pocketsphinx_continuous.exe pocketsphinx.args
INFO: cmd_ln.c(559): Parsing command line:
\
-hmm I:/3000/pocketsphinx-0.7-win32/model_zh/zh_broadcastnews_ptm256_800
0 \
-lm I:/3000/pocketsphinx-0.7-win32/model_zh/zh_broadcastnews_64000_utf8.
DMP \
-dict I:/3000/pocketsphinx-0.7-win32/model_zh/zh_broadcastnews_utf8.dic
Current configuration:
[NAME] [DEFLT] [VALUE]
-adcdev
-agc none none
-agcthresh 2.0 2.000000e+000
-alpha 0.97 9.700000e-001
-argfile
-ascale 20.0 2.000000e+001
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-048
-bestpath yes yes
-bestpathlw 9.5 9.500000e+000
-bghist no no
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-debug 0
-dict I:/3000/pocketsphinx-0.7-win32/model_zh/zh_broad
castnews_utf8.dic
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-008
-frate 100 100
-fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-064
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+000
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-029
-fwdtree yes yes
-hmm I:/3000/pocketsphinx-0.7-win32/model_zh/zh_broad
castnews_ptm256_8000
-infile
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-latsize 5000 5000
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm I:/3000/pocketsphinx-0.7-win32/model_zh/zh_broad
castnews_64000_utf8.DMP
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+000
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+002
-lpbeam 1e-40 1.000000e-040
-lponlybeam 7e-29 7.000000e-029
-lw 6.5 6.500000e+000
-maxhmmpf -1 -1
-maxnewoov 20 20
-maxwpf -1 -1
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-007
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+000
-pbeam 1e-48 1.000000e-048
-pip 1.0 1.000000e+000
-pl_beam 1e-10 1.000000e-010
-pl_pbeam 1e-5 1.000000e-005
-pl_window 0 0
-rawlogdir
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+004
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-003
-smoothspec no no
-svspec
-time no no
-tmat
-tmatfloor 0.0001 1.000000e-004
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+003
-usewdphones no no
-uw 1.0 1.000000e+000
-var
-varfloor 0.0001 1.000000e-004
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-029
-wip 0.65 6.500000e-001
-wlen 0.025625 2.562500e-002
INFO: cmd_ln.c(559): Parsing command line:
\
-alpha 0.97 \
-doublebw no \
-nfilt 40 \
-ncep 13 \
-lowerf 133.33334 \
-upperf 6855.4976 \
-nfft 512 \
-wlen 0.0256 \
-transform legacy \
-feat s2_4x \
-agc none \
-cmn current \
-varnorm no
Current configuration:
[NAME] [DEFLT] [VALUE]
-agc none none
-agcthresh 2.0 2.000000e+000
-alpha 0.97 9.700000e-001
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-dither no no
-doublebw no no
-feat 1s_c_d_dd s2_4x
-frate 100 100
-input_endian little little
-lda
-ldadim 0 0
-lifter 0 0
-logspec no no
-lowerf 133.33334 1.333333e+002
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+004
-seed -1 -1
-smoothspec no no
-svspec
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+003
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.560000e-002
INFO: acmod.c(242): Parsed model-specific feature parameters from I:/3000/pocket
sphinx-0.7-win32/model_zh/zh_broadcastnews_ptm256_8000/feat.params
INFO: feat.c(697): Initializing feature stream to type: 's2_4x', ceplen=13, CMN=
'current', VARNORM='no', AGC='none'
INFO: cmn.c(142): mean[0]= 12.00, mean[1..12]= 0.0
INFO: mdef.c(520): Reading model definition: I:/3000/pocketsphinx-0.7-win32/mode
l_zh/zh_broadcastnews_ptm256_8000/mdef
INFO: bin_mdef.c(173): Allocating 68760 * 8 bytes (537 KiB) for CD tree
INFO: tmat.c(205): Reading HMM transition probability matrices: I:/3000/pocketsp
hinx-0.7-win32/model_zh/zh_broadcastnews_ptm256_8000/transition_matrices
INFO: acmod.c(117): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: I:/3000/pocketsphinx
-0.7-win32/model_zh/zh_broadcastnews_ptm256_8000/means
INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(294): 256x24
INFO: ms_gauden.c(294): 256x3
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: I:/3000/pocketsphinx
-0.7-win32/model_zh/zh_broadcastnews_ptm256_8000/variances
INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(294): 256x24
INFO: ms_gauden.c(294): 256x3
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(354): 24440 variance values floored
INFO: acmod.c(119): Attempting to use PTHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: I:/3000/pocketsphinx
-0.7-win32/model_zh/zh_broadcastnews_ptm256_8000/means
INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(294): 256x24
INFO: ms_gauden.c(294): 256x3
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: I:/3000/pocketsphinx
-0.7-win32/model_zh/zh_broadcastnews_ptm256_8000/variances
INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(294): 256x24
INFO: ms_gauden.c(294): 256x3
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(354): 24440 variance values floored
INFO: ptm_mgau.c(472): Loading senones from dump file I:/3000/pocketsphinx-0.7-w
in32/model_zh/zh_broadcastnews_ptm256_8000/sendump
INFO: ptm_mgau.c(496): BEGIN FILE FORMAT DESCRIPTION
INFO: ptm_mgau.c(559): Rows: 256, Columns: 8210
INFO: ptm_mgau.c(591): Using memory-mapped I/O for senones
INFO: ptm_mgau.c(834): Maximum top-N: 4
INFO: dict.c(306): Allocating 101599 * 20 bytes (1984 KiB) for word entries
INFO: dict.c(321): Reading main dictionary: I:/3000/pocketsphinx-0.7-win32/model
_zh/zh_broadcastnews_utf8.dic
INFO: dict.c(212): Allocated 737 KiB for strings, 977 KiB for phones
INFO: dict.c(324): 97495 words read
INFO: dict.c(330): Reading filler dictionary: I:/3000/pocketsphinx-0.7-win32/mod
el_zh/zh_broadcastnews_ptm256_8000/noisedict
INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(333): 8 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(404): Allocating 70^3 * 2 bytes (669 KiB) for word-initial trip
hones
INFO: dict2pid.c(131): Allocated 59080 bytes (57 KiB) for word-final triphones
INFO: dict2pid.c(195): Allocated 59080 bytes (57 KiB) for single-phone word trip
hones
INFO: ngram_model_arpa.c(77): No \data\ mark in LM file
INFO: ngram_model_dmp.c(142): Will use memory-mapped I/O for LM file
INFO: ngram_model_dmp.c(196): ngrams 1=63944, 2=16600781, 3=20708460
INFO: ngram_model_dmp.c(242): 63944 = LM.unigrams(+trailer) read
INFO: ngram_model_dmp.c(291): 16600781 = LM.bigrams(+trailer) read
INFO: ngram_model_dmp.c(317): 20708460 = LM.trigrams read
INFO: ngram_model_dmp.c(342): 32337 = LM.prob2 entries read
INFO: ngram_model_dmp.c(362): 24468 = LM.bo_wt2 entries read
INFO: ngram_model_dmp.c(382): 27937 = LM.prob3 entries read
INFO: ngram_model_dmp.c(410): 32424 = LM.tseg_base entries read
INFO: ngram_model_dmp.c(466): 63944 = ascii word strings read
INFO: ngram_search_fwdtree.c(99): 476 unique initial diphones
INFO: ngram_search_fwdtree.c(147): 0 root, 0 non-root channels, 122 single-phone
words
INFO: ngram_search_fwdtree.c(186): Creating search tree
INFO: ngram_search_fwdtree.c(191): before: 0 root, 0 non-root channels, 122 sing
le-phone words
INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 75539
INFO: ngram_search_fwdtree.c(338): after: 461 root, 75411 non-root channels, 27
single-phone words
INFO: ngram_search_fwdflat.c(156): fwdflat: min_ef_width = 4, max_sf_win = 25
INFO: continuous.c(367): pocketsphinx_continuous.exe COMPILED ON: Apr 16 2011, A
T: 02:51:40
Allocating 32 buffers of 2500 samples each
READY....
Listening...
Stopped listening, please wait...
000000000: 爱
READY....
Allocating 32 buffers of 2500 samples each
READY....
I face a big problem here, every time I ran the program I can only got one result. For example, only the word 爱 is computed and then the program stopped there.
I am not sure what happened.
I tried to use the version for Linux, after compiling all the codes, the error "ad_oss.c(103): Failed to open audio device(/dev/dsp): No such file or directory
FATAL_ERROR: "continuous.c", line 242: Failed top open audio device" appeared. I install the oss_compat according to the FAQ and found that some of the modules cannot be found. The problem I faced is the same as http://sourceforge.net/projects/cmusphinx/forums/forum/5471/topic/5164223?message=11341904. I hope that who can give me an answer.