tf.nn.sampled_softmax_loss

最新推荐文章于 2023-09-28 16:36:57 发布

我是京城小白

最新推荐文章于 2023-09-28 16:36:57 发布

阅读量985

点赞数

分类专栏：推荐系统

本文链接：https://blog.csdn.net/wdh315172/article/details/107757662

版权

推荐系统专栏收录该内容

11 篇文章 1 订阅

订阅专栏

sampled_softmax_loss 可以看作是Softmax计算的一种加速方式。

import tensorflow as tf

help(tf.nn.sampled_softmax_loss)

Help on function sampled_softmax_loss_v2 in module tensorflow.python.ops.nn_impl:

sampled_softmax_loss_v2(weights, 
    biases, 
    labels, 
    inputs, 
    num_sampled, 
    num_classes, 
    num_true=1, 
    sampled_values=None, 
    remove_accidental_hits=True, 
    seed=None, 
    name='sampled_softmax_loss')

    Computes and returns the sampled softmax training loss.
    
    This is a faster way to train a softmax classifier over a huge number of
    classes.
    
    This operation is for training only.  It is generally an underestimate of
    the full softmax loss.
    
    A common use case is to use this method for training, and calculate the full
    sigmoid loss for evaluation or inference as in the following example:
    
    ```python
    if mode == "train":
      loss = tf.nn.sampled_softmax_loss(
          weights=weights,
          biases=biases,
          labels=labels,
          inputs=inputs,
          ...)

    elif mode == "eval":
      logits = tf.matmul(inputs, tf.transpose(weights))
      logits = tf.nn.bias_add(logits, biases)
      labels_one_hot = tf.one_hot(labels, n_classes)
      loss = tf.nn.softmax_cross_entropy_with_logits(
          labels=labels_one_hot,
          logits=logits)
    ```
    
    See our [Candidate Sampling Algorithms Reference]
    (https://www.tensorflow.org/extras/candidate_sampling.pdf)
    
    Also see Section 3 of [Jean et al., 2014](http://arxiv.org/abs/1412.2007)
    ([pdf](http://arxiv.org/pdf/1412.2007.pdf)) for the math.
    
    Note: when doing embedding lookup on `weights` and `bias`, "div" partition
    strategy will be used. Support for other partition strategy will be added
    later.
    
    Args:

      weights: A `Tensor` of shape `[num_classes, dim]`, or a list of `Tensor`
        objects whose concatenation along dimension 0 has shape [num_classes,
        dim].  The (possibly-sharded) class embeddings.

      biases: A `Tensor` of shape `[num_classes]`.  The class biases.

      labels: A `Tensor` of type `int64` and shape `[batch_size, num_true]`. The
        target classes.  Note that this format differs from the `labels` argument
        of `nn.softmax_cross_entropy_with_logits`.

      inputs: A `Tensor` of shape `[batch_size, dim]`.  The forward activations of
        the input network.

      num_sampled: An `int`.  The number of classes to randomly sample per batch.

      num_classes: An `int`. The number of possible classes.

      num_true: An `int`.  The number of target classes per training example.

      sampled_values: a tuple of (`sampled_candidates`, `true_expected_count`,
        `sampled_expected_count`) returned by a `*_candidate_sampler` function.
        (if None, we default to `log_uniform_candidate_sampler`)

      remove_accidental_hits:  A `bool`.  whether to remove "accidental hits"
        where a sampled class equals one of the target classes.  Default is True.

      seed: random seed for candidate sampling. Default to None, which doesn't set
        the op-level random seed for candidate sampling.

      name: A name for the operation (optional).
    

    Returns:

      A `batch_size` 1-D tensor of per-example sampled softmax losses.

具体含义：

weights：shape= [num_classes, dim]，num_classes是分类的类别数量，即词表的大小；dim表示输入的Embedding维度。(其实就是youtubeNet中的item权重向量)

biases: shape = [num_classes], 同上。

labels: shape = [batch_size, num_true], batch_size为批训练的样本数量，nums_true 数量为1，为正样本真实的索引值。

inputs: shape = [batch_size, dim], batch_size为批训练的样本数量，dim表示输入的Embedding维度。

num_sampled: 负样本采样数量。

num_classes: 分类的类别数量，即词表大小。

num_true: 实际的正样本个数；

sampled_values: 采样的负样本，如果是None，默认使用sampler采样，优先采高频词作为负样本；

remove_accidental_hits: 如果采样的负样本是正样本，要不要去掉；

如果有疑问，请通过例子测试理解：

https://github.com/shenweichen/DeepMatch/blob/master/examples/run_youtubednn.py

我是京城小白

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录