使用的数据集
THCHS30是Dong Wang, Xuewei Zhang, Zhiyong Zhang这几位大神发布的开放语音数据集,可用于开发中文语音识别系统。
为了感谢这几位大神,我是跪在电脑前写的本帖代码。
可以参考这个,tql: https://github.com/xxbb1234021/speech_recognition
下载中文语音数据集(5G+):
#coding: utf-8
import tensorflow as tf
import numpy as np
import os
from collections import Counter
import librosa
from joblib import Parallel, delayed
wav_path = 'data/wav/train'
label_file = 'data/doc/trans/train.word.txt'
def get_wav_files(wav_path = wav_path):
wav_files = []
for (dirpa