perl script for multi jobs and print the output to the log at the same time utils/run.pl JOB=1:5 log command
run.sh the main script for handling part1 and part2.
Basic Structure of one model:
cmd.sh : Config how the project is handed. If seg is used, queue.pl; if single pc is used, run.pl; if NFS and ssh is configured mkdir .queue && touch machines && for name in all_names export; do echo ${name} > .queue/machines; done; if used self-wrriten .pl, configure as the single pc but replace the .pl script
path.sh : Config the kaldi root path and path. If self preferred paths are applied, then specify in this script.
RESULTS save the models results
local saves all the scripts
conf saves all the configure file
steps, utils are commonly used scripts
Chapter 3
Data pre-processing
Kalda uses tables to store and find objects. There are list tables and save tables. List tables have .sc p they look like name \root\absolute\name or in UNIX use pipeline like this name gzip -c \root\absulote\name.zip | the last | means to pass this to the stdout. If there are program needs this pipeline just add program like this | program then the program would catch the pipeline result. This way can save memory. So the list tables are more like a index. Save tables are like .ark. and has contents like This is ark content1 \nThis is ark content 2 This table stores have special usage in binary files. When the tables are stored as ‘b’ the making is ’\0B . In binary tables we could add offsets and apply slicing like :17[:,0:8] menas to get byte from 17 and get the former 9 dimensions of data.
Using tables to input and output: e.g wav-to-duration this is a kaldi’s own program and is used to generate spk2dur.apk (the time duration for each text)
Tages for tables:
Writing features:
1. Table types: `scp` or `ark` if two types of tables are generated at the same time , must be ark scp ordered.
2. Binary mode: `b` save the apk tables as binary and is default
3. Text mode:
Kaldi Learning RecordTwo parts in speech recognitionPrenounciation modelGrammar modelChapter 2: Basic of Kaldi\egs/yesnoperl script for multi jobs and print the output to the log at the same time utils/run.pl JOB=1:5 log commandrun.sh the main sc