spacy库的安装与使用_python spacy库使用总结【待完善】

最新推荐文章于 2024-05-29 17:17:35 发布

王润壮

最新推荐文章于 2024-05-29 17:17:35 发布

阅读量1k

点赞数 1

文章标签： spacy库的安装与使用

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_42509766/article/details/111907306

版权

spacy库的使用说明1.安装2.用法2.1 word tokenize(doc: token)2.2 英文断句(doc.sents: sent)2.3 词干化(doc: token, token_lemma_, token_lemma)2.4 词性标注(doc: token,token.pos_,token.pos)2.5 命名实体识别(doc.ents:ent, ent.label_, ent...

摘要由CSDN通过智能技术生成

spacy库的使用说明

1.安装

2.用法

2.1 word tokenize(doc: token)

2.2 英文断句(doc.sents: sent)

2.3 词干化(doc: token, token_lemma_, token_lemma)

2.4 词性标注(doc: token,token.pos_,token.pos)

2.5 命名实体识别(doc.ents:ent, ent.label_, ent.label)

2.6 名词短语提取(doc.noun_chunks)

2.7 基于词向量计算两个单词的相似度 (doc[index_i].similarity(doc[index_j]))

1.安装

见另一篇python spacy安装问题末尾总结。

2.用法

spaCy 是一个Python自然语言处理工具包，诞生于2014年年中，号称“Industrial-Strength Natural Language Processing in Python”，是具有工业级强度的Python NLP工具包。spaCy里大量使用了 Cython 来提高相关模块的性能，这个区别于学术性质更浓的Python NLTK，因此具有了业界应用的实际价值。

import spacy

nlp = spacy.load(en_core_web_em)

官方文档见spacy(https://spacy.io/usage/linguistic-features)

主要支持英语和德语。

功能包括word tokenize, 英文断句，词干化，词性标注，命名实体识别，名词短语提取，相似度计算……

2.1 word tokenize(doc: token)

将英文单词和标点符号都分离出来，如果含有中文，则中文以多个文字之间的空格分词。

In [3]: test_doc = nlp(u"it's word tokenize test for spacy")

In [4]: print(test_doc)

it's word tokenize test for spacy

In [5]: for token in test_doc:

print(token)

...:

it

's

word

tokenize

test

for

spacy

test_doc是 spacy.tokens.doc.Doc 对象。

2.2 英文断句(doc.sents: sent)

In [6]: test_doc = nlp(u'Natura

最低0.47元/天解锁文章

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
spacy库的安装与使用_python spacy库使用总结【待完善】

spacy库的使用说明1.安装2.用法2.1 word tokenize(doc: token)2.2 英文断句(doc.sents: sent)2.3 词干化(doc: token, token_lemma_, token_lemma)2.4 词性标注(doc: token,token.pos_,token.pos)2.5 命名实体识别(doc.ents:ent, ent.label_, ent...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。