批量修改目标检测VOC格式XML标签文件的标签名称

Lunar*

已于 2024-06-19 10:57:42 修改

阅读量267

点赞数 3

分类专栏：目标检测文章标签：目标检测 xml 人工智能

于 2024-06-18 08:42:00 首次发布

本文链接：https://blog.csdn.net/qq_45141261/article/details/139759900

版权

目标检测专栏收录该内容

2 篇文章 0 订阅

订阅专栏

在目标检测任务中，可能会遇到需要修改标签名称的情况，例如修改标签类别或将多个不同的标签合并为一个。本文介绍一个Python脚本，可以批量修改VOC数据集中XML标签文件的标签名称。

功能说明

该脚本使用 xml.etree.ElementTree 库来解析和修改XML文件，通过遍历指定文件夹中的所有XML文件，将指定的标签名称替换为新的标签名称，并将修改后的文件保存到输出文件夹中。

代码实现

以下是实现该功能的代码：

import os
import xml.etree.ElementTree as ET
from tqdm import tqdm

def change_label_name(input_path, label_dict, output_path):
    """
    批量修改VOC数据集中XML标签文件的标签名称

    参数：
    - input_path: 原始XML文件夹路径
    - label_dict: 标签修改字典，键为原始标签，值为新标签
    - output_path: 修改后XML文件夹路径
    """
    xml_files = [f for f in os.listdir(input_path) if f.endswith('xml')]
    for file in tqdm(xml_files, desc="Processing XML files"):
        file_path = os.path.join(input_path, file)
        output_file_path = os.path.join(output_path, file)
        tree = ET.parse(file_path)
        root = tree.getroot()
        for obj in root.findall('object'):
            name = obj.find('name')
            if name is not None and name.text in label_dict:
                name.text = label_dict[name.text]
        tree.write(output_file_path, encoding='utf-8')

if __name__ == '__main__':
    input_path = '/path/to/original/xmls'  # 原始XML文件夹路径
    output_path = '/path/to/modified/xmls'  # 修改后XML文件夹路径

    # 标签字典，键为修改前的类别，值为修改后的类别
    label_dict = {
                  'old_class1': 'new_class1',
                  'old_class2': 'new_class2',
                  'old_class3': 'new_class3',
                  'old_class4': 'new_class4',
                  }  

    change_label_name(input_path, label_dict, output_path)