MMDet——pipline及数据增强模块解析

Irving.Gao

已于 2022-10-09 14:34:54 修改

阅读量2.6k

点赞数 3

分类专栏： OpenMMLab 文章标签： python numpy 开发语言

于 2022-10-08 23:14:43 首次发布

本文链接：https://blog.csdn.net/qq_45779334/article/details/127215793

版权

OpenMMLab 专栏收录该内容

25 篇文章 3 订阅

订阅专栏

博主目前在基于mmdet做HD Map的相关工作，因此需要从数据集、pipline以及模型结构各个方面都需要进行重构，而在pipline中，对于之前Detection通用的pipline，需要对Box的GT处理进行变化，以此记录。
首先贴上mmdet官方对pipline的讲解：教程 3: 自定义数据预处理流程
其次贴上另一个大神带图讲解：mmdetection中数据增强的可视化

这里我们仅对train_pipline进行讲解：

`train_pipline`

train_pipeline = [
    dict(type='LoadImageFromFile'),
    dict(type='LoadAnnotations', with_bbox=True),
    dict(type='Resize', img_scale=(1333, 800), keep_ratio=True),
    dict(type='RandomFlip', flip_ratio=0.5),
    dict(type='Normalize', **img_norm_cfg),
    dict(type='Pad', size_divisor=32),
    dict(type='DefaultFormatBundle'),
    dict(type='Collect', keys=['img', 'gt_bboxes', 'gt_labels']),
]

LoadImageFromFile和LoadAnnotations
- 从名字就可以知道，这两个method分别是读取image和annotation。
- 一般LoadImageFromFile不需要更改；
- 如果你的GT不是Object Detection中的box，那么就要重点要更改一下LoadAnnotations函数。

例如我这里的GT是点的形式，那么我就把原先的_load_bboxes改成了_load_pts，同时对里边，只需要把点保存成numpy.array的形式，放进results里边就行了。

	def _load_pts(self, results):
        """Private function to load points annotations.
        Args:
            results (dict): Result dict from :obj:`mmdet.CustomDataset`.
        Returns:
            dict: The dict contains loaded points annotations.
        """
        pts_ego3d_list = []
        for pts_anno in results['ann_info']:
            if self.pts_type == '2D':
                pts_ego3d_list.append(np.array(pts_anno['pts_ego3d'], dtype=np.float32)[:, :2])  
            else:
                pts_ego3d_list.append(np.array(pts_anno['pts_ego3d'], dtype=np.float32))    
        results['gt_pts'] = np.stack(pts_ego3d_list, axis=0)
        return results
        
	def __call__(self, results):
        """Call function to load multiple types annotations.

        Args:
            results (dict): Result dict from :obj:`mmdet.CustomDataset`.

        Returns:
            dict: The dict contains loaded bounding box, label, mask and
                semantic segmentation annotations.
        """
        if self.with_pts:
            results = self._load_pts(results)
        if self.with_label:
            results = self._load_labels(results)
        return results

Resize
- Resize是对所有的info进行resize，例如image、gt_bboxes等；
- 主要说一下几个参数：
  - img_scale：配合keep_ratio一起使用；可以是个list，例如img_scale=[(736, 1333), (768, 1333), (800, 1333)]，如果keep_ratio=True，那么image和其他的GT都会按照给定img_scale中能够保证图像最大的参数去resize；如果为keep_ratio=False，那么就是按照给定的img_scale分别去resize长和宽。总之，最终的image的对应边尺寸不会大于img_scale给定的任意一边。
  - 参考理解：目标检测中的resize
  - MMDetection 图像缩放 Resize 详细说明

dict(type='Resize', img_scale=(1333, 800), keep_ratio=True),

RandomFlip
- RandomFlip是对所有的info进行flip，例如image、gt_bboxes等；
- 主要说一下几个参数：
  - flip_ratio：每张image做flip的概率。

Irving.Gao

关注

3
点赞
踩
10

收藏

觉得还不错? 一键收藏
1
评论
MMDet——pipline及数据增强模块解析

博主目前在基于mmdet做HD Map的相关工作，因此需要从数据集、pipline以及模型结构各个方面都需要进行重构，而在pipline中，对于之前Detection通用的pipline，需要对Box的GT处理进行变化，以此记录。例如我这里的GT是点的形式，那么我就把原先的。，同时对里边，只需要把点保存成。
复制链接

扫一扫

专栏目录