moses-chart笔记


1. extract-rules

1.1 Span Size Limit : The limit on span sizes can be set with max-chart-span. In fact its default is 10, which is not a useful setting for syntax models.

from http://www.statmt.org/moses/?n=Moses.SyntaxTutorial


1.2 max-phrase-length 

from http://comments.gmane.org/gmane.comp.nlp.moses.user/4194


1.3 max-phrase-length 在chart抽短语时候指的是初始短语长度

在training中设置max-phrase-length为5。extract rule显示:


而extract-rules后面的参数为:


默认:--MaxSpan[10]  --MinWords[1] | --MaxSymbolsTarget[999] | --MaxSymbolsSource[5] | --MaxNonTerm[2] 


1.4 Glue rules

<s> [X] ||| <s> [S] ||| 1 ||| ||| 0
[X][S] </s> [X] ||| [X][S] </s> [S] ||| 1 ||| 0-0 ||| 0
[X][S] [X][X] [X] ||| [X][S] [X][X] [S] ||| 2.718 ||| 0-0 1-1 ||| 0

这几条规则的含义见:http://comments.gmane.org/gmane.comp.nlp.moses.user/9253


1.5 Rule format

http://www.statmt.org/moses/manual/manual.pdf

关键点:非终结符对应关系不看编号,看alignment。

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值