怎样克服神经网络训练中argmax的不可导性？

最新推荐文章于 2023-08-23 04:53:50 发布

酷暑冷冰

最新推荐文章于 2023-08-23 04:53:50 发布

阅读量1.2k

点赞数 4

分类专栏：机器学习文章标签：机器学习深度学习 python

本文链接：https://blog.csdn.net/weixin_43913077/article/details/121370864

版权

机器学习专栏收录该内容

6 篇文章 0 订阅

订阅专栏

文章目录

1. strainght through Gumbel (estimator)
2. stop gradient operation
3. 可以对argmax/argmin 这种不可导的操作直接忽视，也就是锁定

1. strainght through Gumbel (estimator)

令： $a r g m a x (v) = s o f t m a x (v) + c; c = a r g m a x (v) - s o f t m a x (v), 且为常数$
在这里插入图片描述

2. stop gradient operation

在这里插入图片描述
方法：正向传播就和往常一样，反向传播时，将梯度从不可导那个点copy到不可导点的前面的最近一个可导点。
$q u a n t i z e = i n p u t + (q u a n t i z e - i n p u t) . d e t a c h ()$

3. 可以对argmax/argmin 这种不可导的操作直接忽视，也就是锁定

就是抛弃不可传导的位置

class ArgMax(torch.autograd.Function):
	@staticmethod
	def forward(ctx, input):
        idx = torch.argmax(input, 1)
        output = torch.zeros_like(input)
        output.scatter_(1, idx, 1) # 此处直接用1来替换argmax的位置，抛弃了此处的梯度
        return output
	
	@staticmethod
	def backward(ctx, grad_output):
        return grad_output

酷暑冷冰

关注

4
点赞
踩
9

收藏

觉得还不错? 一键收藏
0
评论
怎样克服神经网络训练中argmax的不可导性？

文章目录1. strainght through Gumbel (estimator)2. stop gradient operation3. 可以对argmax/argmin 这种不可导的操作直接忽视，也就是锁定1. strainght through Gumbel (estimator)令：argmax(v)=softmax(v)+c;c=argmax(v)−softmax(v),且为常数argmax(v)=softmax(v) + c ; c=argmax(v) -softmax(v),且为常数
复制链接

扫一扫