02-VGGNet的学习笔记
以下仅为个人的学习笔记,仅供参考
一、VGG中3个3*3卷积相对于1个7 *7卷积,在参数上比较少了百分之多少?(假设输入和输出通道数均为C)
1.对于3个33卷积层,参数量为333C^2;
对于1个77的卷积层,参数量为7 * 7C^2,
前者相比后者的参数减少量为(49-27)/49 = 44.89%
二、VGG-16和VGG-19差别在哪?
1.在网络结构上,VGG-19在VGG-16的第3、4、5 block分别添加了一个3*3卷积层
2.在相同训练、测试条件下,VGG-19获得精度比VGG-16稍高一点
三、读完该论文,对你的启发点有哪些?
1.在论文实验方法上的启发
- 在实验过程中,想要训练出最优的模型不会一次就能得出来,需要用到对比实验
- 正如VGG中的Table3-Table6展示的,根据多个模型对比实验的结果,分析其中的关联和隐藏信息,最后再对网络进行调整
- 在实验中逐步留下有效的模型,进一步分析影响精度的因素
2.采用多模型融合来提升精度
3.利用3个33卷积核可代替一个77卷积核,并且减少了很多参数
四、从网上找一张图片,执行vgg16,观察top5输出的类别,并将输出结果截图
----------------------------------------------------------------
Layer (type) Output Shape Param #
================================================================
Conv2d-1 [-1, 64, 224, 224] 1,792
ReLU-2 [-1, 64, 224, 224] 0
Conv2d-3 [-1, 64, 224, 224] 36,928
ReLU-4 [-1, 64, 224, 224] 0
MaxPool2d-5 [-1, 64, 112, 112] 0
Conv2d-6 [-1, 128, 112, 112] 73,856
ReLU-7 [-1, 128, 112, 112] 0
Conv2d-8 [-1, 128, 112, 112] 147,584
ReLU-9 [-1, 128, 112, 112] 0
MaxPool2d-10 [-1, 128, 56, 56] 0
Conv2d-11 [-1, 256, 56, 56] 295,168
ReLU-12 [-1, 256, 56, 56] 0
Conv2d-13 [-1, 256, 56, 56] 590,080
ReLU-14 [-1, 256, 56, 56] 0
Conv2d-15 [-1, 256, 56, 56] 590,080
ReLU-16 [-1, 256, 56, 56] 0
MaxPool2d-17 [-1, 256, 28, 28] 0
Conv2d-18 [-1, 512, 28, 28] 1,180,160
ReLU-19 [-1, 512, 28, 28] 0
Conv2d-20 [-1, 512, 28, 28] 2,359,808
ReLU-21 [-1, 512, 28, 28] 0
Conv2d-22 [-1, 512, 28, 28] 2,359,808
ReLU-23 [-1, 512, 28, 28] 0
MaxPool2d-24 [-1, 512, 14, 14] 0
Conv2d-25 [-1, 512, 14, 14] 2,359,808
ReLU-26 [-1, 512, 14, 14] 0
Conv2d-27 [-1, 512, 14, 14] 2,359,808
ReLU-28 [-1, 512, 14, 14] 0
Conv2d-29 [-1, 512, 14, 14] 2,359,808
ReLU-30 [-1, 512, 14, 14] 0
MaxPool2d-31 [-1, 512, 7, 7] 0
AdaptiveAvgPool2d-32 [-1, 512, 7, 7] 0
Linear-33 [-1, 4096] 102,764,544
ReLU-34 [-1, 4096] 0
Dropout-35 [-1, 4096] 0
Linear-36 [-1, 4096] 16,781,312
ReLU-37 [-1, 4096] 0
Dropout-38 [-1, 4096] 0
Linear-39 [-1, 1000] 4,097,000
================================================================
Total params: 138,357,544
Trainable params: 138,357,544
Non-trainable params: 0
----------------------------------------------------------------
Input size (MB): 0.57
Forward/backward pass size (MB): 218.78
Params size (MB): 527.79
Estimated Total Size (MB): 747.15
----------------------------------------------------------------
img: 小柯基.jpg is: Pembroke, Pembroke Welsh corgi
263 n02113023 狗, Pembroke, Pembroke Welsh corgi
time consuming:0.90s