CMU 11-785 L10 CNN architecture

zealscott

于 2020-05-19 19:28:29 发布

阅读量244

点赞数

分类专栏： CMU 11-785 文章标签： filter 深度学习数据挖掘机器学习神经网络

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/crazy_scott/article/details/106221395

版权

CMU 11-785 专栏收录该内容

22 篇文章

订阅专栏

Architecture

在这里插入图片描述

A convolutional neural network comprises “convolutional” and “downsampling ” layers
- Convolutional layers comprise neurons that scan their input for patterns
- Downsampling layers perform max operations on groups of outputs from the convolutional layers
  - Perform on individual map
  - For reduce the number of parameters
The two may occur in any sequence, but typically they alternate
Followed by an MLP with one or more layers

A convolutional layer

Each activation map has two components
- An affine map, obtained by convolution over maps in the previous layer
  - Each affine map has, associated with it, a learnable filter
- An activation that operates on the output of the convolution
What is a convolution
- Scanning an image with a “filter”
- Equivalent to scanning with an MLP
Weights
- size of the filter $\times$ no. of maps in previous layer
Size
- Image size: $N\times N$
- Filter: $M\times M$
- Stride: $S$
- Output size = $\lfloor(N-M) / S\rfloor+1$
Jargon
- Filters are often called “Kernels”
- The outputs of individual filters are called “channels”

Notion

Each convolution layer maintains the size of the image
- With appropriate zero padding
- If performed without zero padding it will decrease the size of the input
Each convolution layer may increase the number of maps from the previous layer
- Depends on the number of filters
Each pooling layer with hop $D$ decreases the size of the maps by a factor of $D$
Filters within a layer must all be the same size, but sizes may vary with layer
- Similarly for pooling, $D$ may vary with layer
In general the number of convolutional filters increases with layers
- Because the patterns gets more complex, hence larger combinations of patterns to capture
Training is as in the case of the regular MLP
- The only difference is in the structure of the network

博客等级

码龄7年

196
原创

411
点赞

1797
收藏

326
粉丝

关注

私信

热门文章

分类专栏

最新评论

基于IMDb数据集的情感分析（Doc2Vec模型与神经网络实现）
gdisnsagu: 有预处理完的文件吗
KMP算法详解（C++实现）
2401_84256088: 又臭又长还有错，看我写的 /** * @param s 待匹配的字符串 * @param p 模式串 * @return s是否包含p * next[j]表示以p[j]结尾的子串，的最长相等先后缀的长度 */ bool kmp (const string &s, const string &p) { int n = s.size(), m = p.size(), next[m], i, j, k; next[0] = 0; for (j = 1; j < m; j++) { for (k = next[j-1]; k && p[j] != p[k]; k = next[k - 1]); next[j] = p[j] == p[k] ? k + 1 : 0; } for (i = 0, j = 0; i < n && j < m;) { if (s[i] == p[j]) i++, j++; else j = next[j]; } return j == m; }
矩阵求导法则与性质
Jerry fk: 我也在纠结这玩意儿，我刚看了定义，他那个刚好写反了
hexo下LaTeX无法显示的解决方案
风翼飞镰: 这是关键啊:CDN地址！
python plot hist 密度图概率和不为1
尚未填写: 有用，感谢！想要绘制多组数据的概率图的话，只需把不同的weights添加到一个列表即可，比如： x_value = [train_points, test_points] train_weights = np.ones_like(train_points)/float(len(train_points)) test_weights = np.ones_like(test_points)/float(len(test_points)) weights = [train_weights, test_weights] plt.hist(x_value, bins=10, histtype="bar", alpha=0.5, label=["training set", "test set"], weights=weights) plt.legend() plt.show()

大家在看

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。