服务器和虚拟机怎么安装Kaldi?

9 篇文章 0 订阅
4 篇文章 0 订阅

前言

安装kaldi折腾十个小时,终于安装完成,踩了各种坑,好多问题网上查不到,我都怀疑实验室的服务器有问题了,最后还是在宏林师兄的帮忙下解决,感谢!学习语音道阻且长呀,大家一起努力一起进步吧!

克隆kaldi到本地

(如果你是用的实验室服务器,最好创建一个自己的文件夹,不要放在公用的地方)
在自己文件目录下输入

git clone https://github.com/kaldi-asr/kaldi.git kaldi --origin upstream

因为文件比较大,如果出现拷贝错误,可以通过特殊手段(说了过不了审),然后自己改成你懂得ip

git config --global http.proxy 'socks5://127.0.0.1:1080'
git config --global https.proxy 'socks5://127.0.0.1:1080'
git config --list

如果还是不行记得通过

git config --edit

修改,多试几次,肯定可以

tools/INSTALL安装

进入tools目录

cd tools/

查看当前目录的INSTALL安装文件

(base) [Sxd13StuA@gpu13 tools]$ cat INSTALL
To check the prerequisites for Kaldi, first run

  extras/check_dependencies.sh

and see if there are any system-level installations you need to do. Check the
output carefully. There are some things that will make your life a lot easier
if you fix them at this stage. If your system default C++ compiler is not
supported, you can do the check with another compiler by setting the CXX
environment variable, e.g.

  CXX=g++-4.8 extras/check_dependencies.sh

Then run

  make

which by default will install ATLAS headers, OpenFst, SCTK and sph2pipe.
OpenFst requires a relatively recent C++ compiler with C++11 support, e.g.
g++ >= 4.7, Apple clang >= 5.0 or LLVM clang >= 3.3. If your system default
compiler does not have adequate support for C++11, you can specify a C++11
compliant compiler as a command argument, e.g.

  make CXX=g++-4.8

If you have multiple CPUs and want to speed things up, you can do a parallel
build by supplying the "-j" option to make, e.g. to use 4 CPUs

  make -j 4

In extras/, there are also various scripts to install extra bits and pieces that
are used by individual example scripts.  If an example script needs you to run
one of those scripts, it will tell you what to do.

检查依赖extras/check_dependencies.sh

上述文件要求我们进入extras/目录下执行check_dependencies.sh脚本查看系统依赖是否满足需要。

(base) [Sxd13StuA@gpu13 tools]$ cd extras/
(base) [Sxd13StuA@gpu13 extras]$ ./check_dependencies.sh

如果出现这样的结果就是没有问题,但是我猜你一定会不会这么顺利

./check_dependencies.sh: WARNING python 2.7 is not the default python. We fixed this by adding a correct symlink more prominently on the path.
 ... If you really want to use python 3.6.10 as default, add an empty file /home/Sxd13/Sxd13StuA/zhudaobin/kaldi/tools/extras/python/.use_default_python and run this script again.
./check_dependencies.sh: all OK.

把四个依赖包放在tools/目录下

你先把这四个文件放在你的tools/目录下

在这里插入图片描述

我会给提供链接,你们可以免费下载,点我下载!

修改Makefile文件

然后更改Makefile文件

(base) [Sxd13StuA@gpu13 extras]$ cd ..
(base) [Sxd13StuA@gpu13 tools]$ vim Makefile 

WGETOPENFST_VERSION改成这样,如果你的openfst文件是其他版本,你就改成你自己的版本,改成这样,你会少出现很多bug(血泪教训)

WGET ?= wget--no-check-certificate
OPENFST_VERSION ?= 1.6.7

检查g++是否安装

输入g++ --version查看自己有没有安装g++,没安后面也执行不了,跟管理员要一下密码或者找管理员帮你安装一下g++

sudo apt install g++

默认安装的都是比较高的版本,不用担心版本问题。

(base) [Sxd13StuA@gpu13 tools]$ g++ --version
g++ (GCC) 7.3.0
Copyright © 2017 Free Software Foundation, Inc.
本程序是自由软件;请参看源代码的版权声明。本软件没有任何担保;
包括没有适销性和某一专用目的下的适用性担保。

这样就ok了
继续按照INSTALL的要求make,不过先看看你的服务器CPU核心数

(base) [Sxd13StuA@gpu13 tools]$ nproc
32

输出的数字表示你的服务器有多少核,决定了你的并行速度,不过照我的经验,最好不要用make -j 4这个指令,直接make就好了,不过make的话会非常非常慢,但是出错概率比make -j 4要小很多
(make -j 4中的4可以理解为几倍速,它是由计算机的CPU核心数决定的,也就是上面nproc查到的结果)
在这里插入图片描述

安装irstlm(非必须)

可能还会提示你没有安装irstlm,这个不是必须的,但是还是安了省心

(base) [Sxd13StuA@gpu13 tools]$ ./extras/install_irstlm.sh 
***() Installation of IRSTLM finished successfully
***() Please source the tools/extras/env.sh in your path.sh to enable it

安装后进入env.sh文件更改

(base) [Sxd13StuA@gpu13 tools]$ vim env.sh
export PATH=/home/你的文件目录/kaldi/tools/python:${PATH}
export IRSTLM=/home/你的文件目录/kaldi/tools/irstlm
export PATH=${PATH}:${IRSTLM}/bin

记得把文件目录改成你自己的目录。

(base) [Sxd13StuA@gpu13 tools]$ source env.sh 

Src/INSTALL安装

我们先进入Src查看INSTALL,看看怎么安装:

(base) [Sxd13StuA@gpu13 tools]$ cd ../src/
(base) [Sxd13StuA@gpu13 src]$ cat INSTALL 

These instructions are valid for UNIX-like systems (these steps have
been run on various Linux distributions; Darwin; Cygwin).  For native Windows
compilation, see ../windows/INSTALL.

You must first have completed the installation steps in ../tools/INSTALL
(compiling OpenFst; getting ATLAS and CLAPACK headers).

The installation instructions are

  ./configure --shared
  make depend -j 8
  make -j 8

Note that we added the "-j 8" to run in parallel because "make" takes a long
time.  8 jobs might be too many for a laptop or small desktop machine with not
many cores.

For more information, see documentation at http://kaldi-asr.org/doc/
and click on "The build process (how Kaldi is compiled)".

可以看出有三个必要指令:

  ./configure --shared
  make depend -j 8
  make -j 8

我们依次执行:

./configure --shared

(base) [Sxd13StuA@gpu13 src]$ ./configure --shared
Configuring KALDI to use MKL.
Backing up kaldi.mk to kaldi.mk.bak ...
Checking compiler g++ ...
Checking OpenFst library in /home/Sxd13/Sxd13StuA/zhudaobin/kaldi/tools/openfst-1.6.7 ...
Checking cub library in /home/Sxd13/Sxd13StuA/zhudaobin/kaldi/tools/cub-1.8.0 ...
Performing OS specific configuration ...
On Linux: Checking for linear algebra header files ...
Configuring MKL library directory: Found /opt/intel/mkl/lib/intel64
MKL libs MKL_LDLIBS = -L/opt/intel/mkl/lib/intel64 -Wl,-rpath=/opt/intel/mkl/lib/intel64 -l:libmkl_intel_lp64.so -l:libmkl_core.so -l:libmkl_sequential.so -ldl -lpthread -lm.
MKL compile flags MKL_CXXFLAGS = -I/opt/intel/mkl/include.
*** MKL self-reported version:
Intel(R) Math Kernel Library Version 2019.0.2 Product Build 20190118 for Intel(R) 64 architecture applications
Successfully configured for Linux with MKL libraries found in /opt/intel/mkl
Using CUDA toolkit /usr/local/cuda (nvcc compiler and runtime libraries)
INFO: Configuring Kaldi not to link with Speex. Don't worry, it's only needed if
      you intend to use 'compress-uncompress-speex', which is very unlikely.
Kaldi has been successfully configured. To compile:

  make -j clean depend; make -j <NCPU>

where <NCPU> is the number of parallel builds you can afford to do. If unsure,
use the smaller of the number of CPUs or the amount of RAM in GB divided by 2,
to stay within safe limits. 'make -j' without the numeric value may not limit
the number of parallel jobs at all, and overwhelm even a powerful workstation,
since Kaldi build is highly parallelized.

如果你前面都按照我说的做了,这里应该不会出现中文警告。
如果出现找不到openfst之类的错误,一定是你前面没安那四个包。

make depend和make

一种makefile的规则,通过扫描仪个目录下的所有C\C++ 代码,从而判断出文件之间的依赖关系,如a.cc文件中调用了b.h(如以形势include<b.h>),如果之后a.cc文件被改动,那 么只需要重新编译a.cc文件,不需要编译b.h文件。否则所有的文件都需要重新编译。

(base) [Sxd13StuA@gpu13 src]$ make depend
(base) [Sxd13StuA@gpu13 src]$ make

持续时间大约一个小时。
我们检测一下看看有没有安装成功:

测试egs/yesno/s5/的案例

(base) [Sxd13StuA@gpu13 src]$ cd ../egs/yesno/s5/
(base) [Sxd13StuA@gpu13 s5]$ ./run.sh 

出现以下内容就是安装成功了!

HCLGa is not stochastic
add-self-loops --self-loop-scale=0.1 --reorder=true exp/mono0a/final.mdl exp/mono0a/graph_tgpr/HCLGa.fst 
steps/decode.sh --nj 1 --cmd utils/run.pl exp/mono0a/graph_tgpr data/test_yesno exp/mono0a/decode_test_yesno
decode.sh: feature type is delta
steps/diagnostic/analyze_lats.sh --cmd utils/run.pl exp/mono0a/graph_tgpr exp/mono0a/decode_test_yesno
steps/diagnostic/analyze_lats.sh: see stats in exp/mono0a/decode_test_yesno/log/analyze_alignments.log
Overall, lattice depth (10,50,90-percentile)=(1,1,2) and mean=1.2
steps/diagnostic/analyze_lats.sh: see stats in exp/mono0a/decode_test_yesno/log/analyze_lattice_depth_stats.log
local/score.sh --cmd utils/run.pl data/test_yesno exp/mono0a/graph_tgpr exp/mono0a/decode_test_yesno
local/score.sh: scoring with word insertion penalty=0.0,0.5,1.0
%WER 0.00 [ 0 / 232, 0 ins, 0 del, 0 sub ] exp/mono0a/decode_test_yesno/wer_10_0.0
  • 2
    点赞
  • 10
    收藏
    觉得还不错? 一键收藏
  • 打赏
    打赏
  • 1
    评论
在Windows上安装Kaldi可能会比较复杂,因为Kaldi主要是针对Linux系统开发的。但是,你可以通过使用虚拟机或者Windows Subsystem for Linux(WSL)来在Windows上安装Kaldi。 其中,使用WSL可能是最简单的方法。你可以按照以下步骤来安装Kaldi: 1. 安装WSL。在Windows 10上,你可以通过“控制面板”->“程序”->“启用或关闭Windows功能”来启用WSL。启用后,在Microsoft Store中搜索“Ubuntu”并安装。 2. 打开Ubuntu,更新软件包列表:在Ubuntu终端中运行以下命令: ``` sudo apt-get update ``` 3. 安装Kaldi依赖项。在Ubuntu终端中运行以下命令: ``` sudo apt-get install -y g++ automake autoconf libtool subversion git zlib1g-dev libatlas-base-dev libatlas3-base gfortran liblapack-dev libblas-dev libffi-dev python3 python3-pip python3-yaml python3-six python3-setuptools ``` 4. 下载Kaldi。在Ubuntu终端中运行以下命令: ``` git clone https://github.com/kaldi-asr/kaldi.git ``` 5. 编译Kaldi。在Ubuntu终端中运行以下命令: ``` cd kaldi/tools ./extras/check_dependencies.sh make -j <num-jobs> cd ../src ./configure --shared --mathlib=ATLAS make depend -j <num-jobs> make -j <num-jobs> ``` 其中,<num-jobs>是编译使用的CPU核心数。 6. 安装完成后,你可以通过运行以下命令来测试Kaldi: ``` cd ../egs/yesno/s5 ./run.sh ``` 该命令会对一个简单的Yes/No语音识别任务进行训练和测试。 注意:由于Kaldi是一个非常庞大的软件包,它的编译和安装可能需要一定的时间和计算资源。另外,在Windows上使用WSL安装Kaldi可能会有一些限制和不便之处,建议在Linux操作系统上进行安装和使用。
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

Bean冷的心

你的鼓励将是我前进的动力~

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值