Week6-6Language Modelling3

Evaluation of LM

  • Extrinsic
  • Intrinsic
  • Correlate the two for validation purposes

Intrinsic: Perplexity

  • Does the model fit the data?
    • A good model will give high probability to a real sentence.
  • Perplexity
    Per=1P(w1,w2,...,wN)N

    • Average branching factor in predicting the next word
    • Lower perplexity -> higher probability
  • Logarithmic version
    Per=21Nlog2P(wi)

Cross entropy

H(p,q)=xp(x)logq(x)

Word error rate

  • number of insertions, deletions and substitutions
  • normalized by sentence length
  • same as Levenstein Edit Distance, but in a word level

Issues

  • Out of vocabulary words(OOV)

    • split the training set into 2 parts
    • label all words in part 2 that were not in part 1 UNK
    • The estimates for UNK will be used in the estimation for the unknown words in test data
  • Clustering

    • e.g., dates, monetary amounts, organizations, years
  • Long distance dependencies

    • This is where n-gram model fails by definition
    • missing syntactic information
      • The students who participated in the game are tired.
    • missing semantic information
      • The pizza that I had yesterday was tasty.
      • The class that I had yesterday was interesting.

Other ideas in LM

  • Syntactic model
    • condition words on other words that appear in a specific syntactic relation with them
  • Caching model
    • take advantage of the fact that words appears in bursts

这里写图片描述

这里写图片描述

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值