激活函数公式、导数、图像笔记

秋山丶雪绪

已于 2023-04-10 15:22:52 修改

阅读量922

点赞数 1

文章标签： python pytorch 深度学习

于 2020-07-16 23:51:25 首次发布

本文链接：https://blog.csdn.net/weixin_43605641/article/details/107394116

版权

绘图代码

import numpy as np
import matplotlib
import matplotlib.pyplot as plt
import torch
import torch.nn as nn
import torch.nn.functional as F


def draw_ax(ax, x, y, title):
    ax.plot(x, y)
    ax.spines['right'].set_color('none')  # 去掉右边的边框线
    ax.spines['top'].set_color('none')  # 去掉上边的边框线
    # 移动下边边框线，相当于移动 X 轴
    ax.xaxis.set_ticks_position('bottom')
    ax.spines['bottom'].set_position(('data', 0))
    # 移动左边边框线，相当于移动 y 轴
    ax.yaxis.set_ticks_position('left')
    ax.spines['left'].set_position(('data', 0))
    plt.title(title, fontsize=15)
    plt.grid(True)


X = np.arange(-20, 20, 0.05)
x1 = torch.from_numpy(X).requires_grad_(True)

# title = 'Sigmoid'
# y1 = torch.sigmoid(x1)
# title = 'tanh'
# y1 = torch.tanh(x1)
# title = 'ReLU'
# y1 = torch.relu(x1)
# title = 'Leaky ReLU'
# y1 = F.leaky_relu(x1, 0.1)
# title = 'Swish'
# y1 = x1 * torch.sigmoid(0.1*x1)
# title = 'Mish'
# y1 = x1 * F.softplus(x1).tanh()
title = 'SiLU'
y1 = F.silu(x1)

y2 = torch.autograd.grad(outputs=y1, inputs=x1, grad_outputs=torch.ones_like(x1))

Y1 = y1.detach().numpy()
Y2 = y2[0].detach().numpy()

matplotlib.rc('figure', figsize=(24, 7))
ax1 = plt.subplot(1, 2, 1)
draw_ax(ax1, X, Y1, title)
ax2 = plt.subplot(1, 2, 2)
draw_ax(ax2, X, Y2, f'Derivative of {title}')

# ax1.plot(X, Y1, label=r'$\beta = 0.1$')
# plt.show()
plt.savefig('SiLU.jpg')

1. Sigmoid

公式： ${\rm{Sigmoid}}(x)=\frac{1}{1+e^{-x}}$
导数： ${\rm{Sigmoid}}^{\prime}(x)={\rm{Sigmoid}}(x)(1-{\rm{Sigmoid}}(x))$

在这里插入图片描述

2. tanh

公式： ${\rm{tanh}}(x)=\frac{e^x-e^{-x}}{e^x+e^{-x}}$
导数： ${\rm{tanh}}^{\prime}(x)=1-{\rm{tanh}}^2(x)$

在这里插入图片描述

3. ReLU

公式： ${\rm{ReLU}}(x)={\rm{max}}(0,x)$
导数：
${\rm{ReLU}}^{\prime}(x)= \begin{cases} 1, & \text{if $x>0$} \\ 0, & \text{if $x<0$} \end{cases}$

在这里插入图片描述

4. Leaky ReLU

公式： ${\rm{LeakyReLU}}(x)={\rm{max}}(0,x)+ {\rm{negative\_slope}}*{\rm{min}}(0,x)$
导数：
${\rm{LeakyReLU}}^{\prime}(x)= \begin{cases} 1, & \text{if $x>0$} \\ {\rm{negative\_slope}}, & \text{if $x<0$} \end{cases}$

在这里插入图片描述

5. Swish

公式： ${\rm{Swish}}(x)=x*{\rm{Sigmoid}}(\beta x)$

class Swish(nn.Module):
    def forward(self, x):
        return x * torch.sigmoid(x)

在这里插入图片描述

6. Mish

公式： ${\rm{Mish}}(x)=x*{\rm{tanh}}(ln(1+e^x))$

class Mish(nn.Module):
    def forward(self, x):
        return x * F.softplus(x).tanh()

在这里插入图片描述

7. SiLU

公式： ${\rm{SiLU}}(x)=x*{\rm{Sigmoid}}(x)$
在这里插入图片描述

8. FReLU

FReLU 是专门为视觉任务设计的激活函数，将ReLU和PReLU扩展为2D激活函数，论文地址。

公式：
$\begin{array}{c} f\left(x_{c, i, j}\right)=\max \left(x_{c, i, j}, \mathbb{T}\left(x_{c, i, j}\right)\right) \\ \\ \mathbb{T}\left(x_{c, i, j}\right)=x_{c, i, j}^{\omega} \cdot p_{c}^{\omega} \\ \\ x_{c, i, j}^{\omega} \cdot p_{c}^{\omega}=\sum\limits_{i-1 \leq h \leq i+1, j-1 \leq w \leq j+1} x_{c, h, w} \cdot p_{c, h, w} \end{array}$ 其中， $x_{c, i, j}$ 代表需要激活的像素， $c, i, j$ 对应 channel 和 2D 位置。

公式较为晦涩抽象，看论文中的图比较直观：在这里插入图片描述
实际上就是将与 $x$ 比大小的数 $(0, p x)$ 替换为以 $x$ 为中心的一个 $3\times 3$ 卷积值，边角部分 padding 1。
为了防止混淆个人将其叫做激活核。
激活核的参数用高斯初始化，参与网络训练，对于 $C$ 个通道的卷积输出，若用 $\times 3$ 大小的激活核进行激活，则额外需要训练 $\times 3 \times 3$ 个参数。文中尝试过平均池化和最大池化，都没有带参数的卷积效果好。

秋山丶雪绪

关注

1
点赞
踩
12

收藏

觉得还不错? 一键收藏
0
评论
激活函数公式、导数、图像笔记

目录1. Sigmoid2. tanh3. ReLU4. Leaky ReLU5. Swish6. Mish1. Sigmoid公式：Sigmoid(x)=11+ex {\rm{Sigmoid}}(x)=\frac{1}{1+e^x} Sigmoid(x)=1+ex1导数：Sigmoid′(x)=Sigmoid(x)(1−Sigmoid(x)) {\rm{Sigmoid}}^{\prime}(x)={\rm{Sigmoid}}(x)(1-{\rm{Sigmoid}}(x)) Sigmoid′(x
复制链接

扫一扫