CV笔记 231

最新推荐文章于 2024-07-12 17:53:13 发布

Endeavour_DYL

最新推荐文章于 2024-07-12 17:53:13 发布

阅读量99

点赞数

文章标签：大数据

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/Endeavour_DYL/article/details/125778346

版权

Convolutional Layer

low-level features-- 边缘特征；high--level features-- complex

increasing the depth ：

spatial dimension: output size = (N-F) / stride + 1;

Pooling Layer:
only on spatial, the depth doesn't change；filter_size； no overlap

Activation Functions:

1. Sigmoid :range[0,1]; Saturated nuerons kill the gradient

2. tanh(x) : range[-1, 1]; still kill gradient when saturated

3.ReLU : No saturated, computationally efficient, actually more biologically plausible than sigmoid. dead ReLU, will never activate and update

4. Leaky ReLU ： f(x) = max(0.01x, x); no saturated, computationally efficient

5. Exponential Linear Units(ELU)

6. Maxout "Neuron" : Linear Regime, no saturated, no die

In practice:

1. Use ReLU. Be carefu with your learning rates.

2. Try out Leaky ReLU/ Maxout/ELU

3. Try out tanh but don't expect much

4. Don't use Sigmoid.

Date processing:

- Substract the mean image(e.g. AlexNet)

(mean image = [32, 32, 3] array

- Substract per-channel mean (e.g. VGGNet)\

(mean along each channel = 3 numbers)

Weight Initializing :

Batch Normalization:

1. Compute the empirical mean and variance independently for each dimension

2. Normalize

Batcth Normalization is more likely to use in standard convolutional neural networks

Situation 1: loss 几乎不变， accuracy提升很多；虽然这里的分布依然很分散，损失项因此很接近，分布在向正确的方向移动，权重参数也在朝正确的方向改变，accuracy此时可能jump

Rough range for learning rate : [1e-3, ..., 1e-5]

Cross-validation strategy:

交叉验证是在训练集上训练，在验证集验证，观察超参数的效果

First Stage : obly a few epochs to get rough idea of what params work

Second stage : longer ruuning time finer search

CNN 框架：

pooling layer没有需要训练的参数，其只是在观察池化区域

VGGNet:

Why use smaller filters? : deeper, more non-linearities, fewer parameters

输出尺寸与输入尺寸保持一致：使用零填充。

ResNet:

residual connection : 更深的网络不一定具有更好的效果

F(x) is the residual , sue layers to fit residual F(x)=H(x) - x instead of H(x) directly

RNN Network:

input: video, sentence, variably length, 可变序列数据

读取输入，更新隐藏状态，生成输出

Notice: the same function and the same set pf parameters are used at evey time step

Vanilla Recurent Neural Network:

tanh 将结果映射到[-1, 1]

Re-use the same weight matrix at evey time-step

Semantic Segmentation :

sliding window -- computational cost

fully convolutional -- 保持原尺寸； the training dataset is obtained by labeling each pixel

downsample -- pooling, strided convolution

uosample -- unpooling

Learnable Upsampling :Transpose Convolution

Strided Convolution

Transpose Convolution

卷积与转置卷积操作不是可逆的，对于同一个卷积核，经过转置卷积并不能恢复原来的值，只能保持原来的形状。

Generative Models:

Unsupervied Learning

data : x ; just data, no labels

goal : learn some underlying hidden structure of the data

examples : clustering, dimensionality reduction, feature learnin, densit estimation, etc

supervised learning

data : (x, y), x is data, y is label;

goal : learn a function to map x->y

examples : classification, regression, object detection, semantic segmentation, image captioning,etc..

Why Generative Models ?

从数据分布中创造出我们想要的真是样本

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
CV笔记 231

231 课程笔记
复制链接

扫一扫

博客等级

码龄5年

3
原创

0
点赞

1
收藏

0
粉丝

关注

私信

热门文章

您愿意向朋友推荐“博客详情页”吗？

强烈不推荐
不推荐
一般般
推荐
强烈推荐

提交

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。