transformers.generator_utils函数源码解析之RepetitionPenaltyLogitsProcessor

最新推荐文章于 2023-11-27 17:50:42 发布

will-wil

最新推荐文章于 2023-11-27 17:50:42 发布

阅读量2.2k

点赞数 1

分类专栏： nlp学习笔记文章标签： python 自然语言处理机器翻译

本文链接：https://blog.csdn.net/yangyanbao8389/article/details/121651056

版权

nlp学习笔记专栏收录该内容

7 篇文章 0 订阅

订阅专栏

主要记录源码中解决文本生成中词组重复出现的问题，代码中有具体操作解析。

class RepetitionPenaltyLogitsProcessor(LogitsProcessor):
    r"""
    :class:`transformers.LogitsProcessor` enforcing an exponential penalty on repeated sequences.

    Args:
        repetition_penalty (:obj:`float`):
            The parameter for repetition penalty. 1.0 means no penalty. See `this paper
            <https://arxiv.org/pdf/1909.05858.pdf>`__ for more details.
    """

    def __init__(self, penalty: float):
        if not isinstance(penalty, float) or not (penalty > 0):
            raise ValueError(f"`penalty` has to be a strictly positive float, but is {penalty}")

        self.penalty = penalty

    def __call__(self, input_ids: torch.LongTensor, scores: torch.FloatTensor) -> torch.FloatTensor:
        #scores为cur-step的词表分布[batch,seq,vocab_size]，input_ids为输入decoder的文本序列[batch,seq]，则score则是获取当前已经生成文本序列的token概率
        score = torch.gather(scores, 1, input_ids) 

        # if score < 0 then repetition penalty has to be multiplied to reduce the previous token probability
        #减少已经出现的token的概率
        score = torch.where(score < 0, score * self.penalty, score / self.penalty) 
        
        #将减少后的概率重分配到原始的cur-step词表分布中
        scores.scatter_(1, input_ids, score) 
        return scores

will-wil

关注

1
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
transformers.generator_utils函数源码解析之RepetitionPenaltyLogitsProcessor

主要记录源码中解决文本生成中词组重复出现的问题，代码中有具体操作解析。class RepetitionPenaltyLogitsProcessor(LogitsProcessor): r""" :class:`transformers.LogitsProcessor` enforcing an exponential penalty on repeated sequences. Args: repetition_penalty (:obj:`float`):
复制链接

扫一扫