Smoothing: Add-one smoothing

最新推荐文章于 2024-06-03 12:46:54 发布

weixin_34417635

最新推荐文章于 2024-06-03 12:46:54 发布

阅读量337

点赞数

文章标签：人工智能

原文链接：http://www.cnblogs.com/chuanlong/archive/2013/04/27/3047705.html

版权

From the previous blog, I know that there are a lot of zero, which will trigger many questions, such as unpredictability in test data, unavailability of preplexity. So, now we introduce the method smoothing.

previous : P(wi | wi-1) = c(wi-1, wi) / c(wi)

using smoothing: P(wi | wi-1) = ( c(wi-1, wi) + 1 ) / (c(wi-1) + V)

Then we can ensure that the p will not be zero. Now we can estimate this mothed.

We can use the Reconsitituted formula: c(wi-1, wi) = P(wi | wi-1) * c(wi-1) = ( c(wi-1, wi) + 1 ) / (c(wi-1) + V) * c(wi-1).

By using this formula, we can gain the the difference between them as following.

So add-one smoothing makes massive changes to our accounts. In other word, add-one estimation is a very blunt instrument. So in practice we don't actually use add-one smoothing for n-grams. we have better methods. But we do use add-one smoothing for other kinds of NLP models such text classification, or it will be used in similar kinds of domain where the number of zeros isn't so enormous.

转载于:https://www.cnblogs.com/chuanlong/archive/2013/04/27/3047705.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_34417635

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Smoothing: Add-one smoothing

From the previous blog, I know that there are a lot of zero, which will trigger many questions, such as unpredictability in test data, unavailability of preplexity. So, now we introduce the method smo...
复制链接

扫一扫