基于Tacotron1的中文语音合成 Text to Speech TTS

最新推荐文章于 2024-06-20 09:43:22 发布

m0_50619880

最新推荐文章于 2024-06-20 09:43:22 发布

阅读量352

点赞数

二、复现

wav_file = trn[:-4] + '.wav' 改为： wav_file = trn[:-4]

datasets 中 thchs30.py 29 行 'biaobei_48000' 替换成你训练资料的目录, 以data_thchs30可以换成data 或者train

trn_files = glob.glob(os.path.join(in_dir, 'biaobei_48000', '*.trn'))

trn_files = glob.glob(os.path.join(in_dir, 'data', '*.trn'))

注意：如果是以tarin为目录，train中的trn是指向data的文件路径，而非训练所需要的拼音，如果这部分没处理好会影响后续模型的训练。

如果是data_thchs30 可以将这一段替换原本代码，原本代码中有点小问题，没能提出到训练所需的拼音音标

def build_from_path(in_dir, out_dir, num_workers=1, tqdm=lambda x: x):
executor = ProcessPoolExecutor(max_workers=num_workers)
futures = []
index = 1
https://www.vocabulary.com/lists/6977351
trn_files = glob.glob(os.path.join(in_dir, 'data', '*.trn'))
for trn in trn_files:
with open(trn,encoding='utf-8') as f:
content = f.readline().strip('\n')
#print("content",content)
pinyin = f.readline().strip('\n')
wav_file = trn[:-4]# + '.wav'
https://www.vocabulary.com/lists/6974497
#print("pinyin",pinyin)
#print("wav_file",wav_file)
task = partial(_process_utterance, out_dir, index, wav_file, pinyin)
futures.append(executor.submit(task))
https://www.vocabulary.com/lists/6975993
index += 1
return [future.result() for future in tqdm(futures) if future.result() is not None]