线性代数|机器学习-P33卷积神经网络ImageNet和卷积规则

取个名字真难呐

已于 2024-09-05 19:57:44 修改

阅读量1k

点赞数 14

文章标签：算法机器学习矩阵人工智能线性代数

于 2024-09-04 21:07:35 首次发布

本文链接：https://blog.csdn.net/scar2016/article/details/141904136

版权

文章目录

1. ImageNet
2. 卷积计算
3. 周期循环矩阵和非周期循环矩阵
4. 循环卷积特征值
5. Kronecker Product

1. ImageNet

ImageNet 的论文paper链接如下：详细请直接阅读相关论文即可
通过网盘分享的文件：imagenet_cvpr09.pdf
链接: https://pan.baidu.com/s/1Rkb6S5RbCHZUBrgUCIv0FA?pwd=6ffn 提取码: 6ffn

涉及到的知识点：
– drop-out–防止神经网络过拟合
– 正则化-- 方便数据训练

2. 卷积计算

2.1 两个多项式卷积

教授讲得通用公式听起来糊里糊涂的，就以简单的实际案例来解释吧！
假设我们有两个多项式表示如下：
$\begin{equation} P(x)=1+2x+3x^2;Q(x)=4+5x;H(x)=P(x)Q(x) \end{equation}$

两个多项式相乘后展开可得结果如下：
$\begin{equation} P(x)=1+2x+3x^2;Q(x)=4+5x;H(x)=P(x)Q(x) \end{equation}$
$\begin{equation} H(x)=4+13x+22x^2+15x^3 \end{equation}$
那么我们是否可以根据卷积的形式直接算出来了？
$\begin{equation} P(x):p=[1,2,3],Q(x):q=[4,5,0] \end{equation}$
那么两个序列卷积如下 ,可得，
多项式的乘积等同于其系数的卷积,多项式乘法可以看作是序列卷积的一个具体应用

2.2 函数卷积

函数卷积定义：若 $f (x), g (x)$ 有界且可积，以为函数卷积连续形式如下：
$\begin{equation} K(x)=f(x)*g(x)=\int_{-\infty}^{+\infty}f(t)g(x-t)\mathrm{dt} \end{equation}$

2.3 循环卷积

具体参考上节笔记
线性代数|机器学习-P32循环矩阵的特征向量-傅里叶矩阵

3. 周期循环矩阵和非周期循环矩阵

Toeplitz Matrix :
对于非周期循环矩阵来说，我们用托普利兹矩阵Toeplitz Matrix 表示，主要特点为斜对角值相等，但不循环
Circulant Matrix;
对于周期循环矩阵来说，我们用循环矩阵Circulant Matrix 表示，主要特点为斜对角值相等，并且元素循环，也是循环卷积矩阵，根据上节课学习可得，任意一个循环卷积矩阵C都可以是位移矩阵P的线性组合，并且矩阵P的特征向量为傅里叶矩阵。

在这里插入图片描述

周期循环矩阵C的特征向量为傅里叶矩阵，以4阶举例可得：
$\begin{equation} C=c_0+c_1P+c_2P^2+c_3P^3, P=\begin{bmatrix} 0&1&0&0\\\\ 0&0&1&0\\\\ 0&0&0&1\\\\ 1&0&0&0\end{bmatrix};F_4=\begin{bmatrix} 1&1&1&1\\\\ 1&i&i^2&i^3\\\\ 1&i^2&i^4&i^6\\\\ 1&i^3&i^6&i^9\end{bmatrix} \end{equation}$
循环矩阵C的特征值可以用傅里叶F表示：

4. 循环卷积特征值

4.1 卷积计算的分解

在这里插入图片描述

我们定义矩阵如下：
$\begin{equation} C=\begin{bmatrix} c_0&c_1\\\\ c_1&c_0 \end{bmatrix};D=\begin{bmatrix} d_0&d_1\\\\ d_1&d_0 \end{bmatrix};F=\begin{bmatrix} 1&1\\\\ 1&-1 \end{bmatrix} \end{equation}$
$\begin{equation} Fc=\begin{bmatrix} c_0+c_1\\\\ c_0-c_1 \end{bmatrix};Fd=\begin{bmatrix} d_0+d_1\\\\ d_0-d_1 \end{bmatrix};\end{equation}$
$\begin{equation} (Fc).* (Fd)=\begin{bmatrix} (c_0+c_1)(d_0+d_1)\\\\ (c_0-c_1)(d_0-d_1) \end{bmatrix}=\begin{bmatrix} c_0d_0+c_0d_1+c_1d_0+c_1d_1\\\\ c_0d_0-c_0d_1-c_1d_0+c_1d_1 \end{bmatrix};\end{equation}$
$\begin{equation} c\otimes d=\begin{bmatrix} c_0&c_1\\\\c_1&c_0 \end{bmatrix}\begin{bmatrix} d_0\\\\d_1 \end{bmatrix}=\begin{bmatrix} c_0d_0+c_1d_1\\\\c_1d_0+c_0d_1 \end{bmatrix};\end{equation}$
$\begin{equation} F(c\otimes d)=\begin{bmatrix} 1&1\\\\1&-1 \end{bmatrix}\begin{bmatrix} c_0d_0+c_1d_1\\\\c_1d_0+c_0d_1 \end{bmatrix}=\begin{bmatrix} c_0d_0+c_1d_1+c_1d_0+c_0d_1\\\\c_0d_0+c_1d_1-c_1d_0-c_0d_1 \end{bmatrix};\end{equation}$
小结卷积规则如下：

在这里插入图片描述

4.2 运算量

因为我们知道傅里叶变换中有一个大名鼎鼎的快速傅里叶变换的算法FFT，其运算复杂度为 $N\log N$

方式一：对于先卷积后傅里叶变换的计算量如下：
$\begin{equation} F(c\otimes d)=N^2+N\log N\end{equation}$
方式二：先进行傅里叶变换后在点积的计算量如下：
$\begin{equation} (Fc).* (Fd)=2N\log N+N\end{equation}$
当N=1024时，可得：
$\begin{equation} \frac{F(c\otimes d)}{(Fc).* (Fd)}=\frac{1024+10}{2*10+1}=49.238\end{equation}$
简单来说，对于同样的卷积计算来说，我们选择方式二，如果把数列先进行傅里叶变换，再将序列点乘，得到的计算量在N=1024情况下，方式一的计算量居然是方式二的接近50倍。简直令人发指！！！所以我们需要拥抱FFT快速傅里叶变换，将数据的处理换一种方式进行，这样可以大大提高程序运行的速度！！！真是伟大的傅里叶！！！

4.3 二维卷积公式

假设我们有两个函数 $f (x, y), g (x, y)$ ，它们的二维卷积公式如下：
$\begin{equation} h(x,y)=f(x,y)*g(x,y)=\int^{\infty}\int^{\infty}f(u,v)g(x-u,y-v)\mathrm{du}\mathrm{dv}\end{equation}$

5. Kronecker Product

Kronecker Product 介绍：
举例介绍：
$\begin{equation} A=\begin{bmatrix}1&2\\\\3&4\end{bmatrix};B=\begin{bmatrix}0&5\\\\6&7\end{bmatrix}\end{equation}$
$\begin{equation} A\otimes B=\begin{bmatrix}1\cdot B&2\cdot B\\\\3\cdot B&4\cdot B\end{bmatrix}= \begin{bmatrix} 0&5&0&10\\\\ 6&7&12&14\\\\ 0&15&0&20\\\\ 18&21&24&28 \end{bmatrix}\end{equation}$