PaddleSpeech调研、安装、使用

最新推荐文章于 2024-12-17 20:36:18 发布

little___ant

最新推荐文章于 2024-12-17 20:36:18 发布

阅读量1.9k

点赞数

文章标签： python asr paddlespeech

本文链接：https://blog.csdn.net/little___ant/article/details/128938426

版权

PaddleSpeech概述

PaddleSpeech asr 模块目前只支持中英文的语音自动识别，建议在Linux环境下安装和使用。

配置环境要求：

gcc >= 4.8.5
paddlepaddle >= 2.4.1
python >= 3.7
PaddleSpeech文档链接：https://github.com/PaddlePaddle/PaddleSpeech

PaddleSpeech asr 模块仅能识别wav文件，且该音频文件需要采样率为16kHz

PaddleSpeech安装

PaddleSpeech安装链接：https://paddlespeech.readthedocs.io/en/latest/install.html#easy-get-the-basic-function-support-linux-mac-and-windows

PaddleSpeech安装有三种方法，本次调研使用easy方式

PaddleSpeech安装步骤

注：本次调研安装均在docker容器中完成

1.Linux环境中下载 miniconda ，方便管理资源包

wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh

2.根据安装目录，配置miniconda环境变量

export PATH=/root/miniconda3/bin:$PATH

此方法为一次性配置环境变量，再次打开terminal需要重新配置，也可使用

vim ~/.bash_profile
source ~/.bashrc

在用户级别下永久设置环境变量

3.配置gcc

apt install build-essential

4.安装PaddleSpeech，注意使用清华源

pip install pytest-runner -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install paddlespeech -i https://pypi.tuna.tsinghua.edu.cn/simple

PaddleSpeech使用

以command line 使用：

中文转录：

paddlespeech asr --lang zh --input zh.wav

英文转录（分别使用两种模型）：

paddlespeech asr --lang en  --model transformer_librispeech --input en_test.wav
paddlespeech asr --lang en  --model deepspeech2offline_librispeech--input en_test.wav

以python API 使用：

from paddlespeech.cli.asr.infer import ASRExecutor
asr = ASRExecutor()
result = asr(audio_file="zh.wav")
print(result)