stanford-ner sn.tag() 过慢提速

最新推荐文章于 2024-09-10 14:42:30 发布

veeoni

最新推荐文章于 2024-09-10 14:42:30 发布

阅读量215

点赞数 2

分类专栏：命名实体识别文章标签：机器学习深度学习自然语言处理

本文链接：https://blog.csdn.net/veeoni/article/details/117535095

版权

命名实体识别专栏收录该内容

2 篇文章 0 订阅

订阅专栏

# 斯坦福的命名实体识别工具包
sn = StanfordNERTagger('F://VEN/stanford-ner-2020-11-17/classifiers/english.muc.7class.distsim.crf.ser.gz',
                       path_to_jar='F://VEN/stanford-ner-2020-11-17/stanford-ner.jar')

# 进行识别
ne_annotated_sentences = [sn.tag(sent) for sent in tokenized_sentences]

每句话调用一次斯坦福的工具包，每次都要重启JVM，速度过慢

改进：将所有的语句存成列表，作为参数调用sn.tag_sents()方法

ne_annotated_sentences = sn.tag_sents(tokenized_sentences)

参考资料 StackOverFlow

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

veeoni

关注关注

2
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
stanford-ner sn.tag() 过慢提速

# 斯坦福的命名实体识别工具包sn = StanfordNERTagger('F://VEN/stanford-ner-2020-11-17/classifiers/english.muc.7class.distsim.crf.ser.gz', path_to_jar='F://VEN/stanford-ner-2020-11-17/stanford-ner.jar')# 进行识别ne_annotated_sentences = [sn.tag(sent
复制链接

扫一扫