卷积神经网络 理解
This article will help others understand common terminologies involved in Convolution Neural Network in the form of ‘what is what’ in plain English and without much interference of mathematics or statistics.
本文将帮助其他人以简单的英语“什么是什么”的形式理解卷积神经网络中涉及的常用术语,而不会过多地干扰数学或统计学。
You can refer to my other blog for ‘What is what’ understanding of neural networks.
您可以参考我的其他博客,以了解神经网络的“什么是什么”。
Let's start…
开始吧…
Convolutional Neural Network:or CNN or ConvNets is a variant of Deep Learning algorithm for learning and analyzing visual tasks.
卷积神经网络: CNN或ConvNets是深度学习算法的一种变体,用于学习和分析视觉任务。
Key applications: Object recognition and detectionFace recognition
关键应用:物体识别和检测面部识别
Image Representation: Images can be represented in a matrix form and each cell is a pixel. The hand-written number 1 can be represented in a matrix form as below (the image is taken from MNIST dataset). Depending upon the intensity of the darkness in each pixel the value ranges between 0 and 1. Rest all cells are 0.
图像表示:图像可以矩阵形式表示,每个像元都是一个像素。 手写数字1可以如下所示以矩阵形式表示(图像取自MNIST数据集)。 根据每个像素中暗度的强度,该值的范围为0到1。其余所有像元均为0。
![Image for post](https://i-blog.csdnimg.cn/blog_migrate/0243a200b5e986c31b76cd0474dc4922.png)
And, all colored images are represented using Red Green and Blue channels as we know already. Depending upon the concentration of each RGB color at a pixel, the actual color is defined. We can represent the colored images as well in a Mat