深度学习入门(3) - CNN

andyc_03

已于 2024-03-31 19:51:03 修改

阅读量994

点赞数 18

文章标签：深度学习 cnn 人工智能

于 2024-03-31 19:50:14 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/andyc_03/article/details/137207490

版权

CNN

Convolutional Layer

We use a filter to slide over the image spatially (computing dot products)

请添加图片描述

Interspersed with activation function as well

What it learns?

First-layer conv filters: local image templates (Often learns oriented edges, opposing colors)

Problems:

For large images, we need many layers to get information about the whole image

Solution: Downsample inside the network

Feature map shrinks with each layer

Solution: Padding : adding zeros around the input

Pooling layer

-> downsampling

Without parameters that needs to be learnt.

ex:

max pooling

Aver pooling

…

FC layer(Fully Connected)

The last layer should always be a FC layer.

Batch normalization

we need to force inputs to be nicely scaled at each layer so that we can do the optimization more easily.

Usually inserted after FC layer / Convolutional layer, before non-linearity

Pros:

make the network easier to train

robust to initialization

Cons:

behaves differently during training and testing

请添加图片描述

Architechtures (History of ImageNet Challenge)

AlexNet

Input 3 * 277 * 277

Layer filters 64 kernel 11 stride 4 pad 2

We need to pay attention to the Memory, pramas, flop size
请添加图片描述

ZFNet

larger AlexNet

VGG

Rules:

All conv 3*3 stride 1 pad 1
max pool 2*2 stride 2
after pool double channels

Stages:

conv-conv-pool

conv-conv-pool

conv-conv-pool

conv-conv-[conv]-pool

conv-conv-[conv]-pool

GoogLeNet

Stem network: aggressively downsamples input

Inception module:

请添加图片描述

Use such local unit with different kernal size

Use 1*1 Bottleneck to reduce channel dimensions

At the end, rather than flatting to destroy the spatial information with giant parameters

GoogLeNet use average pooling: 7 * 7 * 1024 -> 1024

There is only on FClayer at the last.

找到瓶颈位置，尽可能降低需要学习的参数数量/内存占用

Auxiliary Classifiers:

To help the deep network converge (batch normalization was not invented then): Auxiliary classification outputs to inject additional gradient at lower layers

Residual Networks

We find out that, somtimes we make the net deeper but it turns out to be underfitted.

Deeper network should strictly have the capability to do whatever a shallow one can, but it’s hard to learn the parameters.

So we need the residual network!

请添加图片描述

This can help learning Identity, with all the parameters to be 0.

The still imitate VGG with its sat b

ResNeXt

Adding grops improves preforamance with same computational complexity.

MobileNets

reduce cost to make it affordable on mobile devices

Transfer learning

We can pretrain the model on a dataset.

When applying it to a new dataset, just finetune/Use linear classifier on the top layers.

Froze the main body of the net.

有一定争议，不需要预训练也能在2-3x的时间达到近似的效果

关注

18
点赞
踩
19

收藏

觉得还不错? 一键收藏
0
评论
深度学习入门(3) - CNN

ex:
复制链接

扫一扫

andyc_03 CSDN认证博客专家 CSDN认证企业博客

码龄7年

342: 原创

1万+: 周排名

9889: 总排名

7万+: 访问

: 等级

3733: 积分

154: 粉丝

231: 获赞

23: 评论

250: 收藏

私信

关注

热门文章

分类专栏

最新评论

【论文阅读】CLIP:Learning Transferable Visual Models From Natural Language Supervision
CSDN-Ada助手: 你好，CSDN 开始提供 #论文阅读# 的列表服务了。请看：https://blog.csdn.net/nav/advanced-technology/paper-reading?utm_source=csdn_ai_ada_blog_reply 。如果你有更多需求，请来这里 https://gitcode.net/csdn/csdn-tags/-/issues/34?utm_source=csdn_ai_ada_blog_reply 给我们提。
【论文阅读】EgoPCA: A New Framework for Egocentric Hand-Object Interaction
CSDN-Ada助手: 你好，CSDN 开始提供 #论文阅读# 的列表服务了。请看：https://blog.csdn.net/nav/advanced-technology/paper-reading?utm_source=csdn_ai_ada_blog_reply 。如果你有更多需求，请来这里 https://gitcode.net/csdn/csdn-tags/-/issues/34?utm_source=csdn_ai_ada_blog_reply 给我们提。
机器学习小结
CSDN-Ada助手: Python入门技能树或许可以帮到你：https://edu.csdn.net/skill/python?utm_source=AI_act_python
树链剖分
xyzcoolplayer: 不够详细，应该讲一下代码（代码看不懂）
【01trie】【启发式合并】P6072 『MdOI R1』Path
qq_54179200: 可以讲一下bel数组的用处嘛，没看懂，还有v是存什么的

大家在看

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。