BLEU源码笔记

最新推荐文章于 2023-03-25 17:30:29 发布

luputo

最新推荐文章于 2023-03-25 17:30:29 发布

阅读量793

点赞数

本文链接：https://blog.csdn.net/luo3300612/article/details/96454408

版权

BLEU源码笔记

本文参考代码为coco-caption

回顾

详细解释参见我的这篇博客，本文仅仅是代码解释

BLEU是2002年提出的一个机器翻译的自动度量，它从n-gram准确率的角度对比机器翻译和人工翻译的结果，计算公式为
$BP\exp\big(\sum_{n=1}^Nw_n\log p_n\big)$
其中
$\begin{cases} 1& \text{if c>r}\\ e^{(1-r/c)}& \text{else} \end{cases}$
比如当我计算下例的bleu值时

candidate = "It is a guide to action which ensures that the military always obeys the commands of the party."
reference1 = "It is a guide to action that ensures that the military will forever heed Party commands."
reference2 = "It is the guiding principle which guarantees the military forces always being under the command of the Party."
reference3 = "It is the practical guide for the army always to heed the directions of the party."

得到的结果即为（对原句进行大小写和标点的处理）

BLEU1:0.9444444444444444,
BLEU2:0.7453559924999299,
BLEU3:0.6240726989348756,
BLEU4:0.5045666840058485

需要注意的是，这里的BLEUi是上面公式中 $N = i$ 的结果，而非仅仅是 $p_i$ （我一开始这么以为是因为发现BLEU1= $p 1$ ，但BLEU2不等于 $p_2$ ，以为官方代码出错了，结果和自己写的代码怎么比都不对，后来才发现这只是N=1的特殊情况）

在coco-caption中的对应代码为

bleus = []
bleu = 1.
for k in range(n):
    bleu *= float(comps['correct'][k] + tiny) \
            / (comps['guess'][k] + small)
    bleus.append(bleu ** (1./(k+1)))
ratio = (self._testlen + tiny) / (self._reflen + small) ## N.B.: avoid zero division
if ratio < 1:
    for k in range(n):
        bleus[k] *= math.exp(1 - 1/ratio)

其中tiny、small均为防止分子分母为0而添加的微小值，comps['correct'][k]就是正确的k-gram（clip之后）数量，comps['guess'][k]就是candidate中k-gram的数量，ratio就是公式中的BP

这里实际上对计算公式做了些变形
$\begin{aligned} BLEU &= BP\exp\big(\sum_{n=1}^Nw_n\log p_n\big)\\ &=BP\prod_{n=1}^Ne^{w_n\log p_n}\\ &=BP\prod_{n=1}^N(e^{\log p_n})^{w_n}\\ &=BP(\prod_{n=1}^Np_n)^{w_n}\\ \end{aligned}$
此时与代码中变量bleu的连乘和加入到列表bleus中使用的指数就相互对应了

这是计算一句candidate和多句references的方法，如果对于整个candidate和references的集合计算整体的bleu，其计算方法就是将每一对的comps中对应的元素加起来，如前文所说，comps包括comps['correct'][k]就是正确的k-gram（clip之后）数量，comps['guess'][k]就是candidate中k-gram的数量，comps['reflen']就是BP计算公式中的r，comps['testlen']就是BP计算公式中的’c’，将它们逐个加起来，然后整体返回到上面的代码中计算即可

luputo

关注

0
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
BLEU源码笔记

BLEU源码笔记本文参考代码为coco-caption回顾详细解释参见我的这篇博客，本文仅仅是代码解释BLEU是2002年提出的一个机器翻译的自动度量，它从n-gram准确率的角度对比机器翻译和人工翻译的结果，计算公式为BLEU=BPexp⁡(∑n=1Nwnlog⁡pn)BLEU = BP\exp\big(\sum_{n=1}^Nw_n\log p_n\big)BLEU=BPexp(...
复制链接

扫一扫