1 DOTA数据集简介
DOTA数据集全称:Dataset for Object deTection in Aerial images
DOTA数据集v1.0共收录2806张4000 × 4000的图片,总共包含188282个目标。
- DOTA数据集论文介绍:https://arxiv.org/pdf/1711.10398.pdf
- 数据集官网:https://captain-whu.github.io/DOTA/dataset.html
DOTA数据集有三个版本:
- DOTAV1.0
类别数目:15
类别名称:plane, ship, storage tank, baseball diamond, tennis court, basketball court, ground track field, harbor, bridge, large vehicle, small vehicle, helicopter, roundabout, soccer ball field , swimming pool - DOTAV1.5
类别数目:16
类别名称:plane, ship, storage tank, baseball diamond, tennis court, basketball court, ground track field, harbor, bridge, large vehicle, small vehicle, helicopter, roundabout, soccer ball field, swimming pool , container crane - DOTAV2.0
类别数目:18
类别名称:plane, ship, storage tank, baseball diamond, tennis court, basketball court, ground track field, harbor, bridge, large vehicle, small vehicle, helicopter, roundabout, soccer ball field, swimming pool, container crane, airport , helipad
2 标签
在对数据集进行数据增强时,我们需要知道相关标签文件格式
每个对象有10个数值,前8个代表一个矩形框四个角的坐标,第9个表示对象类别,第10个表示识别难易程度,0表示简单,1表示困难。
下面是一个类似的文件
950.0 851.0 931.0 852.0 932.0 817.0 952.0 817.0 small-vehicle 1
475.0 982.0 456.0 982.0 461.0 841.0 481.0 842.0 large-vehicle 0
424.0 978.0 400.0 982.0 403.0 840.0 426.0 839.0 large-vehicle 0
395.0 984.0 373.0 985.0 376.0 842.0 399.0 843.0 large-vehicle 0
365.0 979.0 344.0 978.0 346.0 839.0 369.0 838.0 large-vehicle 0
337.0 977.0 317.0 977.0 321.0 836.0 339.0 835.0 large-vehicle 0
310.0 978.0 287.0 979.0 286.0 838.0 311.0 838.0 large-vehicle 0
154.0 947.0 250.0 947.0 250.0 971.0 154.0 971.0 large-vehicle 0
140.0 894.0 255.0 894.0 255.0 919.0 140.0 919.0 large-vehicle 0
116.0 862.0 236.0 862.0 236.0 888.0 116.0 888.0 large-vehicle 0
146.0 771.0 269.0 771.0 269.0 796.0 146.0 796.0 large-vehicle 0
136.0 741.0 271.0 741.0 271.0 766.0 136.0 766.0 large-vehicle 0
136.0 713.0 271.0 713.0 271.0 735.0 136.0 735.0 large-vehicle 0
可以看出其标签是由四点组成的旋转框。
3 数据增强和数据集调整
相关数据增强和数据集调整我放在本专栏的其他文章中