DeepLearning tutorial（4）CNN卷积神经网络原理简介+代码详解

CNN卷积神经网络原理与Python+Theano实现

最新推荐文章于 2025-10-14 21:01:54 发布

原创

最新推荐文章于 2025-10-14 21:01:54 发布 · 8.8w 阅读

317 ·

CC 4.0 BY-SA版权

文章标签：

#deeplearning #卷积神经网络 #cnn #python #theano

本文深入浅出地介绍了CNN卷积神经网络的基本原理，并基于Python和Theano详细解读了LeNet5的实现过程，包括卷积、子采样、全连接层的构建和训练方法。同时，文章指出了代码实现与经典LeNet5结构的差异。

DeepLearning tutorial（4）CNN卷积神经网络原理简介+代码详解

@author：wepon

@blog：http://blog.csdn.net/u012162613/article/details/43225445

本文介绍多层感知机算法，特别是详细解读其代码实现，基于python theano，代码来自：Convolutional Neural Networks (LeNet)。经详细注释的代码和原始代码：放在我的github地址上，可下载。

一、CNN卷积神经网络原理简介

要讲明白卷积神经网络，估计得长篇大论，网上有很多博文已经写得很好了，所以本文就不重复了，如果你了解CNN，那可以往下看，本文主要是详细地解读CNN的实现代码。如果你没学习过CNN，在此推荐周晓艺师兄的博文：Deep Learning（深度学习）学习笔记整理系列之（七），以及UFLDL上的卷积特征提取、池化

CNN的最大特点就是稀疏连接（局部感受）和权值共享，如下面两图所示，左为稀疏连接，右为权值共享。稀疏连接和权值共享可以减少所要训练的参数，减少计算复杂度。

至于CNN的结构，以经典的LeNet5来说明：

这个图真是无处不在，一谈CNN，必说LeNet5，这图来自于这篇论文：Gradient-Based Learning Applied to Document Recognition，论文很长，第7页那里开始讲LeNet5这个结构，建议看看那部分。

我这里简单说一下，LeNet5这张图从左到右，先是input，这是输入层，即输入的图片。input-layer到C1这部分就是一个卷积层（convolution运算），C1到S2是一个子采样层（pooling运算），关于卷积和子采样的具体过程可以参考下图：

最低0.47元/天解锁文章

6 条评论

KimLee1895 2017.11.24
请问在python3.6中如何解决UnicodeDecodeError: 'ascii' codec can't decode byte 0x90 in position 614: ordinal not in range(128)

一棵树YKS 2017.10.15
请问github登陆不了，怎么办

jacksonjack001 2017.04.26
代码中这句 pooled_out = downsample.max_pool_2d( input=conv_out, ds=poolsize, ignore_border=True ) 在theano中已经过时了，用 from theano.tensor.signal.pool import pool_2d 和 #maxpooling，最大子采样过程 pooled_out = pool_2d( input=conv_out, ws=poolsize, ignore_border=True )
- jacksonjack001回复jacksonjack001 2017.09.06
  [reply]u013422403[/reply] 还有开头一处 from theano.tensor.signal import pool
- jacksonjack001回复jacksonjack001 2017.04.26
  [reply]u013422403[/reply] 或者看下官网http://deeplearning.net/software/theano/library/tensor/signal/pool.html?highlight=max_pool_2d#theano.tensor.signal.pool.max_pool_2d_same_size

Tron1994 2017.03.06
小白求问，怎么进行预测

qazasdwsx 2016.11.22
#lower layer上每个神经元获得的梯度来自于："num output feature maps * filter height * filter width" /pooling size 实在理解不能

eroszip 2016.09.01
楼主你好，自定义的网络层的时候，bp就直接采用theano.function然后加update的形式就可以了么，不同层的的敏感度以及梯度计算的细节是不是就无需考虑

元气少女缘结神 2016.08.22
你好，我的在提取CNN的Flatten层的特征时候出现问题：http://blog.csdn.net/wd1603926823/article/details/52223373 可以帮忙看下吗

AA5514113569 2016.04.20
想问下为什么图片中说入的单张图片是(32,32)，可代码中是（28,28）

sinat_31135199 2016.04.16
博主您好：您的前几个程序我都已经成功运行，但是运行这个程序时，会出现错误，如下： RuntimeError: Cuda error: k_copy_4d: invalid device function. Apply node that caused the error: GpuContiguous(GpuDimShuffle{1,0,2,3}.0) Toposort index: 22 Inputs types: [CudaNdarrayType(float32, 4D)] Inputs shapes: [(1, 20, 5, 5)] Inputs strides: [(0, 25, -5, -1)] Inputs values: ['not shown'] Outputs clients: [[GpuCorrMM_gradWeights{valid, (1, 1)}(GpuContiguous.0, GpuContiguous.0)]] HINT: Re-running with most Theano optimization disabled could give you a back-trace of when this node was created. This can be done with by setting the Theano flag 'optimizer=fast_compile'. If that does not work, Theano optimizations can be disabled with 'optimizer=None'. HINT: Use the Theano flag 'exception_verbosity=high' for a debugprint and storage map footprint of this apply node. 我在网上查了一些方法，并没有解决。请问您有什么好的解决方案吗？非常感谢您分享的经验！
- hello_csdn1111回复sinat_31135199 2017.03.14
  [reply]sinat_31135199[/reply] 这个问题你解决了吗？我以为出现了这个问题好像是版本不对应导致的

wstonea 2016.03.28
问问题　训练完后，把model保存起来 with open('%s/best_model_cnn_l3.pkl'%mstone.theano_path, 'wb') as f: pickle.dump(layer3, f) 之后再加载，报错 [code=python] predict_model = theano.function( [index], layer3.y_pred, givens={ x: test_set_x[index * batch_size: (index + 1) * batch_size] } ) [/code] －－－－－－－－－－－－ ... loading data parameter on_unused_input='warn' to theano.function. To disable it completely, use on_unused_input='ignore'.