【Python】计算VOC格式XML文件中目标面积和长宽比并生成直方图

最新推荐文章于 2024-04-18 00:30:00 发布

YaoYee_7

最新推荐文章于 2024-04-18 00:30:00 发布

阅读量2.8k

点赞数 17

分类专栏： Python

本文链接：https://blog.csdn.net/YaoYee_21/article/details/112490344

版权

Python 专栏收录该内容

60 篇文章

订阅专栏

博客围绕目标检测精度问题展开，指出可对anchor进行参数优化，RPN网络生成的anchor影响检测精度，需统计数据中目标区域面积和长宽比。介绍了代码思路，包括遍历xml文件、定位坐标值、计算面积长宽比等，运行代码后在生成图片标题加入相关数据并配有进度条。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

在这里插入图片描述

1.Introduction

最近目标检测的精度上不去，看看别人的文章，发现可以针对anchor进行参数优化，RPN网络生成的anchor数量与种类很大程度上影响着检测精度，anchor与检测目标越接近，检测精度越高。所以我们就需要统计下自己数据中目标区域的面积和长宽比。
在这里插入图片描述

2.Materials and methods

代码思路主要有：

（1）遍历文件夹中的xml文件
（2）定位xmin，xmax，ymin，ymax四个坐标值
（3）计算面积和长宽比，生成列表
（4）统计列表直方图

emm，没啥说的，上代码吧。

# -*- coding: utf-8 -*-
"""
Created on Sun Jan 10 21:48:48 2021

@author: YaoYee
"""
 
import os
import xml.etree.cElementTree as et
import numpy as np
import matplotlib.pyplot as plt
from tqdm import tqdm

 
path="C:/Users/YaoYee/Desktop/Annotations"
files=os.listdir(path)


area_list = []
ratio_list = []


def file_extension(path): 
    return os.path.splitext(path)[1] 
 

for xmlFile in tqdm(files, desc='Processing'): 
    if not os.path.isdir(xmlFile): 
        if file_extension(xmlFile) == '.xml':
            tree=et.parse(os.path.join(path,xmlFile))
            root=tree.getroot()
            filename=root.find('filename').text
            # print("--Filename is", xmlFile)
            
            for Object in root.findall('object'):
                bndbox=Object.find('bndbox')
                xmin=bndbox.find('xmin').text
                ymin=bndbox.find('ymin').text
                xmax=bndbox.find('xmax').text
                ymax=bndbox.find('ymax').text
                
                area = ( int(ymax)-int(ymin)) * (int(xmax)-int(xmin) )
                area_list.append(area)
                # print("Area is", area)
                
                ratio = ( int(ymax)-int(ymin)) / (int(xmax)-int(xmin) )
                ratio_list.append(ratio)
                # print("Ratio is", round(ratio,2))


square_array = np.array(area_list)
square_max = np.max(square_array)
square_min = np.min(square_array)
square_mean = np.mean(square_array)
square_var = np.var(square_array)
plt.figure(1)
plt.hist(square_array,20)
plt.xlabel('Area in pixel')
plt.ylabel('Frequency of area')
plt.title('Area\n' \
          +'max='+str(square_max)+', min='+str(square_min)+'\n' \
          +'mean='+str(int(square_mean))+', var='+str(int(square_var))
          )


ratio_array = np.array(ratio_list)
ratio_max = np.max(ratio_array)
ratio_min = np.min(ratio_array)
ratio_mean = np.mean(ratio_array)
ratio_var = np.var(ratio_array)
plt.figure(2)
plt.hist(ratio_array,20)
plt.xlabel('Ratio of length / width')
plt.ylabel('Frequency of ratio')
plt.title('Ratio\n' \
          +'max='+str(round(ratio_max,2))+', min='+str(round(ratio_min,2))+'\n' \
          +'mean='+str(round(ratio_mean,2))+', var='+str(round(ratio_var,2))
          )