模型训练知识点记录

작은 여우

已于 2022-08-31 00:25:33 修改

阅读量269

点赞数

分类专栏： # 目标跟踪文章标签：深度学习人工智能

于 2022-08-29 14:15:46 首次发布

本文链接：https://blog.csdn.net/yhsunhfut/article/details/126583848

版权

5 篇文章 0 订阅

订阅专栏

一些知识点记录一下：

Test-Time Augmentation，测试时数据增强
- 测试时将原始数据做不同形式的增强，然后取结果的平均值作为最终结果。可以进一步提升最终结果的精度
- The input size significantly influences detection accuracy, since high resolution make the detectors " small objects" clearly to increase successful detections. The multi-scale test can make the detector trainer with limited input size (e.g. 320*320) to see those small objects that only can be ‘see’ by the large input size (1000*600) .

 dataloader = DataLoader(dataset, num_workers=2, batch_size=3)

DataLoader(dataset, batch_size=1, shuffle=False, sampler=None,num_workers=0, collate_fn=default_collate, pin_memory=False,drop_last=False)

dataset：加载的数据集(Dataset对象)
- shuffle，表示数据是否打乱
- sampler，样本抽样
- num_workers，使用多进程加载的进程数，0表示不使用多进程
- collate_fn，如何将多个样本数据拼接成一个batch，一般使用默认的拼接方式即可
- pin_memory，是否将数据保存在pin memory区，pin memory中的数据转到GPU会快一点
- drop_last，dataset中的数据个数可能不是batch_size的整数倍，drop_last为True将多出来不足一个batch的数据丢弃。
Conv2d函数详解dilation(Pytorch):
- dilation扩张，一般情况下，卷积核和输入图像对应的位置之间的计算是相同尺寸的，也就是说卷积核的大小是 $3\times3$ ，那么它在输入图像上每次作用的区域是 $3\times3$ 。这种情况下 $d i l a t i o n = 0$ 。当 $d i l a t i o n = 1$ 时，表示的是下图这种情况。
- groups：分组。指的是对输入通道进行分组，如果 $g ro u p s = 1$ ，那么输入就一组，输出也为一组。如果 $g ro u p s = 2$ ，那么就将输入分为两组，那么相应的输出也是两组。另外需要注意的是in_channels和out_channels必须能整除gropus。