[notes] ImageNet Classification with Deep Convolutional Neual Network

最新推荐文章于 2022-12-08 23:00:14 发布

AI记忆

最新推荐文章于 2022-12-08 23:00:14 发布

阅读量3.3k

点赞数

分类专栏： deep learning 深度学习论文与相关应用

本文链接：https://blog.csdn.net/sunbaigui/article/details/28105847

版权

深度学习论文与相关应用同时被 2 个专栏收录

100 篇文章 228 订阅

订阅专栏

deep learning

31 篇文章 1 订阅

订阅专栏

Paper:
ImageNet Classification with Deep Convolutional Neual Network

Achievements:
The model addressed by Alex etl. achieved top-1 and top-5 test error rate of 37.5% and 17.0% of classifying the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes.

Model Architecture:

model architecture plot:

contains eight learned layers five convolutional and three fully-connected .
The kernels of the second, fourth, and fifth convolutional layers are connected only to those kernel maps in the previous layer which reside on the same GPU. The kernels of the third convolutional layer are connected to all kernel maps in the second layer .

Response-normalization layers follow the first and second convolutional layers . Max-pooling layers, of the kind described in Section 3.4, follow both response-normalization layers as well as the fifth convolutional layer . The ReLU non-linearity is applied to the output of every convolutional and fully-connected layer.

Interesting Points:
ReLU Nonlinearity: speed-up, six times faster than an equivalent network with tanh neurons.
Overlapping Pooling: enhance accuracy and prevent overfitting , reduces the top-1 and top-5 error rates by 0.4% and 0.3%; training model with overlapping pooling find it slightly more difficult to overfit.

Dropout：prevent overfitting, reduces complex co-adaptations of neurons, since a neuron cannot rely on the presence of particular other neurons. It is, therefore, forced to learn more robust features that are useful in conjunction with many different random subsets of the other neurons.