AU-AIR数据集json文件转YOLO格式

最新推荐文章于 2024-03-02 22:03:19 发布

薄荷味的风　

最新推荐文章于 2024-03-02 22:03:19 发布

阅读量1.6k

点赞数 18

文章标签： json YOLO python

本文链接：https://blog.csdn.net/weixin_45457604/article/details/135174519

版权

文章讲述了如何将AU-AIR数据集中的COCO标注转换为YOLO所需的txt格式，以便于模型训练。

摘要由CSDN通过智能技术生成

AU-AIR数据集是第一个用于目标检测的多模态无人机数据集,可用于小目标检测。

数据集下载地址：AU-AIR:无人机视觉检测数据集_数据集-飞桨AI Studio星河社区

数据集中给出的标签文件是json文件，在YOLO系列模型上运行需要转化为txt文件，可使用下列方法进行转化，亲测可用。

#将数据集转化为 YOLO 格式的数据集
#--json_path 输入的json文件路径
#--save_path 保存的文件夹名字，默认为当前目录下的labels。

import os
import json
from tqdm import tqdm
import argparse

parser = argparse.ArgumentParser()
#这里根据自己的json文件位置，换成自己的就行
parser.add_argument('--json_path', default='annotations.json',type=str, help="input: coco format(json)")#json文件地址
#这里设置.txt文件保存位置
parser.add_argument('--save_path', default='labels', type=str, help="specify where to save the output dir of labels")#保存label文件地址
arg = parser.parse_args()

def convert(size, box):
    dw = 1. / (size[0])
    dh = 1. / (size[1])
    x = box[0] + box[2] / 2.0
    y = box[1] + box[3] / 2.0
    w = box[2]
    h = box[3]
#round函数确定(xmin, ymin, xmax, ymax)的小数位数
    x = round(x * dw, 6)
    w = round(w * dw, 6)
    y = round(y * dh, 6)
    h = round(h * dh, 6)
    return (x, y, w, h)

if __name__ == '__main__':
json_file = arg.json_path # COCO Object Instance 类型的标注
ana_txt_save_path = arg.save_path # 保存的路径

    data = json.load(open(json_file, 'r'))
    if not os.path.exists(ana_txt_save_path):
        os.makedirs(ana_txt_save_path)

    list_file = open(os.path.join('dataSet_path/', 'trainval.txt'), 'w')#保存训练txt文件
    for img in tqdm(data['annotations']):
        filename = img["image_name"]
        img_width = img["image_width:"]#此处要有这个:，json文件中就有
        img_height = img["image_height"]
        #img_id = img["id"]
        head, tail = os.path.splitext(filename)
        ana_txt_name = head + ".txt" # 对应的txt名字，与jpg一致
        f_txt = open(os.path.join(ana_txt_save_path, ana_txt_name), 'w')
        bbox = img["bbox"]
        for i in range(len(bbox)):
            x = bbox[i]
            box = [bbox[i]["left"], bbox[i]["top"], bbox[i]["width"], bbox[i]["height"]]
            box_c = convert((img_width, img_height), box)
            f_txt.write("%s %s %s %s %s\n" % (bbox[i]["class"], box_c[0], box_c[1], box_c[2], box_c[3]))