CS231n Convolutional Neural Networks for Visual Recognition

最新推荐文章于 2020-02-25 10:42:12 发布

gdymind

最新推荐文章于 2020-02-25 10:42:12 发布

阅读量428

点赞数

分类专栏：机器学习 CNN 文章标签： cnn 机器学习图像识别

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/gdymind/article/details/78290793

版权

机器学习同时被 2 个专栏收录

14 篇文章 0 订阅

订阅专栏

3 篇文章 0 订阅

订阅专栏

@(机器学习和人工智能)[机器学习, CNN]

Lecture 1 | Introduction to Convolutional Neural Networks for Visual Recognition

History：
- 1960s：recognize & reconstruct
- object recognition is so hard $\Rightarrow$ first we do object segmetation
- feature based segmetation：
- SVM, boosting: complex；overfit(data quailty is changing)
两个最经典的data set：
- PASCAL Visual Object Challenge(object detection benchmark )
- ImageNet Large Scale Visual Recognition Challenge
CNN基本算法在1998年由LeCun等提出，2012年在ImgeNet上大显身手火了起来，再次火起来原因：电路集成规模越来越大，GPU的快速发展，data的质量和数量爆炸式增长。
学习CNN的预备知识：微积分，线性代数，CS229

Lecture 2 | Image Classification

Data Driven Approach
1. Collect a dataset of images and labels
2. Use Machine Learining to train a classifier
3. Evaluate the classifier on new images

k-Nearest Neighbors(kNN)

在最近的k个邻居中，哪一类个数最多，就归为哪一类
hyperparameters: choices about the algorithm that we set rather than learn
how to set proper hyperparameters: split dataset into train, validation and test set
- train set(most data)
- validation set: envaluate
- test set: test once
k-Nearest Neighbors on imgages never used

Linear Classification

super important and help us build CNNs
parametric approach: image(array of numbers) → f(x,W) (score function) → 10 numbers giving class scores
- $x$ : input
- $W$ : weight or parameters
- $b$ : bias
  
  假设有10类，则最终得到10行1列的列向量，其中每个数字代表了是该类的可能性，数字越大可能性越大。
举例说明，下面是对于一个给定的 $W$ ，4个像素的image，分为3类的计算过程：

训练结果的可视化：
Linear Classification可以理解为平面上的直线，各分类器将平面上的不同区域分为不同类别：

所以有一些线性不可分问题，一层线性分类器是解决不了的，因为在平面上无法用一条直线将两类分开，如异或，或下图中的例子。

Lecture 3 | Loss Functions and Optimization

loss funciton: quantify how good/bad our current classifier is given a dataset {(xi,yi)}Ni=1 , where xi is image and yi is (integer) label.
1. $L = \frac{1}{N}\sum\limits_i L_i(f(x_i, W), y_i)$
2. Multiclass SVM loss: $s_i = f(x_i, W)$
  $L_i = \sum\limits_{j \neq y_i} \max(0, s_i - s_j + 1)$

若s都很小，约等于0，则loss等于类别数量减一，可以用来debug。
4. Loss等于0的W不只一个，比如2W。
5. 不应该关注training data上的performance，而关注testing data上的。

回归项使其倾向于选择一个更简单的 W <script type="math/tex" id="MathJax-Element-393">W</script>。
6. 常见regularizaton:举例

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。