【语音数据】tgt和praatio实现长短格式转换

tektsy

已于 2024-01-19 15:16:15 修改

阅读量408

点赞数 9

分类专栏： praat与python 文章标签： python 开发语言音频

于 2024-01-19 15:15:23 首次发布

本文链接：https://blog.csdn.net/ndz2020/article/details/135693475

版权

praat与python 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

背景

需要处理一批textgrid文件，实现长短格式转换（同时出于实验需要删掉一层tier）。
由于textgrid文件中的text存在换行符（\n），且长短格式混淆，导致tgt和praatio都没有办法直接读取。

解决方案

实践发现：

	读取长格式换行符	读取短格式换行符
tgt	N	Y
praatio	Y	N

在不改变库中源代码的前提下，使用了except实现将一批混合长短格式的textgrid文件批量转为短格式。
代码如下：

from praatio import textgrid as otg
import tgt
import glob
import loguru
import os


def process_by_praatio(path, new_path):
    tg = otg.openTextgrid(path, True)
    tg.removeTier('待删除tier名')
    tg.save(new_path, "short_textgrid", True)


def process_by_tgt(path, new_path):
    tg = tgt.read_textgrid(path)
    tg.delete_tier('待删除tier名')
    tgt.write_to_file(tg, new_path, format='short', encoding='utf-8')
    loguru.logger.success(f'File {path} processed successfully by tgt')


path = #输入待处理目录
new_path = #输入输出目录
if not os.path.exists(new_path):
    os.makedirs(new_path)

files = glob.glob(path + '/*.TextGrid')
for file in files:
    loguru.logger.info(f'Processing file {file}')
    new_filepath = new_path + '/' + os.path.basename(file)
    try:
        process_by_tgt(file, new_filepath)
    except IndexError:
        loguru.logger.error(f'File {file} failed to process by tgt; trying praatio')
        try:
            process_by_praatio(file, new_filepath)
        except:
            loguru.logger.error(f'File {file} failed to process by praatio')

tektsy

关注

9
点赞
踩
8

收藏

觉得还不错? 一键收藏
0
评论
【语音数据】tgt和praatio实现长短格式转换

需要处理一批textgrid文件，实现长短格式转换（同时出于实验需要删掉一层tier）。由于textgrid文件中的text存在换行符（\n），且长短格式混淆，导致tgt和praatio都没有办法直接读取。
复制链接

扫一扫

专栏目录