COCO数据集训练格式转换成YOLO格式

最新推荐文章于 2024-08-15 09:53:02 发布

Doctor_Wu_

最新推荐文章于 2024-08-15 09:53:02 发布

阅读量5.9k

点赞数 1

分类专栏：深度学习: 2D图像目标检测与识别

本文链接：https://blog.csdn.net/Boys_Wu/article/details/107015833

版权

深度学习: 2D图像目标检测与识别专栏收录该内容

1 篇文章 0 订阅

订阅专栏

COCO转YOLO：

比如coco2017train或coco2017val数据集中标注的目标(类别)位置在 Annotations 中以 (x, y, width, height) 来进行表示，x，y表示bbox左上角位置，width, height表示bbox的宽和高。而YOLO训练或者进行验证的时候读取的标注格式是以 (xmin, ymin, xmax, ymax) 来进行表示，xmin, ymin表示bbox左上角位置， xmax, ymax表示bbox右下角位置。

所以将COCO标注格式（.json）转换成YOLO格式（.txt）并存储，比如转换val数据集的格式，其代码如下：


```python
import json
from collections import defaultdict

"""hyper parameters"""
json_file_path = '/home/wsy/data/coco2014/annotations/instances_val2014.json'
images_dir_path = '/home/wsy/data/coco2014/val2014/'
output_path = '/home/wsy/code/pytorch-YOLOv4/data/val.txt'

"""load json file"""
name_box_id = defaultdict(list)
id_name = dict()
with open(json_file_path, encoding='utf-8') as f:
    data = json.load(f)
    annotations = data['annotations']

for ant in annotations:
    id = ant['image_id']
    name = '/home/wsy/data/coco2014/val2014/COCO_val2014_%012d.jpg' % id
    cat = ant['category_id']

    if cat >= 1 and cat <= 11:
        cat = cat - 1
    elif cat >= 13 and cat <= 25:
        cat = cat - 2
    elif cat >= 27 and cat <= 28:
        cat = cat - 3
    elif cat >= 31 and cat <= 44:
        cat = cat - 5
    elif cat >= 46 and cat <= 65:
        cat = cat - 6
    elif cat == 67:
        cat = cat - 7
    elif cat == 70:
        cat = cat - 9
    elif cat >= 72 and cat <= 82:
        cat = cat - 10
    elif cat >= 84 and cat <= 90:
        cat = cat - 11

    name_box_id[name].append([ant['bbox'], cat])

"""write to txt"""
with open(output_path, 'w') as f:
    for key in name_box_id.keys():
        f.write(key)
        box_infos = name_box_id[key]
        for info in box_infos:
            x_min = int(info[0][0])
            y_min = int(info[0][1])
            x_max = x_min + int(info[0][2])
            y_max = y_min + int(info[0][3])

            box_info = " %d,%d,%d,%d,%d" % (
                x_min, y_min, x_max, y_max, int(info[1]))
            f.write(box_info)
        f.write('\n')