FasterRCNN画框小程序——VOC2007格式（python）

最新推荐文章于 2024-07-19 04:06:16 发布

SueLYS

最新推荐文章于 2024-07-19 04:06:16 发布

阅读量1.4k

点赞数

分类专栏： Python 深度学习

本文链接：https://blog.csdn.net/Suii_v5/article/details/73863362

版权

本文介绍了一种利用Python编写小程序的方法，来快速标注Faster R-CNN训练所需的数据。针对二值图像，通过查找像素值为255的区域，确定(x_min, y_min, x_max, y_max)坐标，并更新VOC2007格式的XML文件。作者在不足24小时的Python学习时间内完成了这个程序的编写，期待良好的标注效果。" 126326705,9349560,DaVinci Developer工具导入AUTOSAR XML实战,"['Davinci Dev', 'AutoSAR', 'XML工具', '系统描述']

摘要由CSDN通过智能技术生成

用Faster RCNN训练数据，手动标注好辛苦，好在我的数据是二值的，找到对应的像素值为255的(x_min,y_min,x_max,y_max)然后替换xml中的对应值就好了

首先要有一个VOC2007格式的xml文件，在这个基础上进行修改

学python的日子加起来不超过24小时，编这个小程序花了一天的时间，希望有个好结果。加油

# get the gt's x_min,y_min,x_max,y_max and replace the xml's values
# first traverse the image to find the pixel==255's position
# second find the x_min,y_min,x_max,y_max
# third read the xml file and replace the values
# by LYS 6/28/2017 
from PIL import Image,ImageDraw
import xml.etree.cElementTree as ET
import os

# according to the image's path to locate the position
def get_positions(image_path):
    im = Image.open(image_path).convert('L')#open the image and convert to gray image
    draw = ImageDraw.Draw(im)
    width = im.size[0]
    height = im.size[1]
    x = []
    y = []
    for w in range(0, width):
        for h in range(0, height):
	    pixel &