tensorflow.nn.softmax实现方式

最新推荐文章于 2024-03-27 16:27:08 发布

VIP文章沉心修炼

最新推荐文章于 2024-03-27 16:27:08 发布

阅读量1.9k

点赞数

分类专栏： tensorflow

本文链接：https://blog.csdn.net/keep_vitality/article/details/82110326

版权

跑模型的时候遇到loss为nan的情况，图里面有对softmax归一化后的值取对数的操作，担心是这里算出来0。一般softmax的计算会减去序列的最大值。即

 tf.exp(logits - tf.reduce_max(logits))  / tf.reduce_sum(tf.exp(logits - tf.reduce_max(logits)))

但是看tf源码没看懂哪里在做这个运算，但是注释文档里写的这样算的：

def softmax(logits, dim=-1, name=None):
 '''
    Computes softmax activations.
    This function performs the equivalent of

    softmax = tf.exp(logits) / tf.reduce_sum(tf.exp(logits), dim)
  '''

不敢相信，毕竟减序列最大值是常规操作，所以测了一下看到底是怎么算的。

(Pdb) x = [1.0,1.0,

最低0.47元/天解锁文章

优惠劵

沉心修炼

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
tensorflow.nn.softmax实现方式

跑模型的时候遇到loss为nan的情况，图里面有对softmax归一化后的值取对数的操作，担心是这里算出来0。一般softmax的计算会减去序列的最大值。即 tf.exp(logits - tf.reduce_max(logits)) / tf.reduce_sum(tf.exp(logits - tf.reduce_max(logits)))但是看tf源码没看懂哪里在做这个运算，但是...
复制链接

扫一扫