python nltk语义分析_Python nltk 如何解析出英语短语?

花了点时间研究 nltk,也试着去写点代码,我有这样一段文字,

>>> text = "i would't have the Scotland Yarders know it for the world"

>>> import nltk

>>> from nltk.collocations import *

>>> bigram_measures = nltk.collocations.BigramAssocMeasures()

>>> trigram_measures = nltk.collocations.TrigramAssocMeasures()

>>> text = "i would't have the Scotland Yarders know it for the world"

>>> tokens = nltk.wordpunct_tokenize(text)

>>> finder = BigramCollocationFinder.from_words(tokens)

>>> scored = finder.score_ngrams(bigram_measures.raw_freq)

>>> sorted(bigram for bigram, score in scored)

[("'", 't'), ('I', 'would'), ('Scotland', 'Yarders'), ('Yarders', 'know'), ('for', 'the'), ('have', 'the'), ('it', 'for'), ('know', 'it'), ('t', 'have'), ('the', 'Scotland'), ('the', 'world'), ('would', "'")]

这代码是 google 来的,不是我写的。最后运行的效果,似乎无法辨别出 for the world 这个短语。

我想问,目前的 nltk 能准确识别出来这个短语吗?

  • 0
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值