使用nltk.pos出现IndexError: string index out of range

最新推荐文章于 2024-05-30 10:11:57 发布

微电子学与固体电子学-俞驰

最新推荐文章于 2024-05-30 10:11:57 发布

阅读量683

点赞数

分类专栏： Python自然语言处理

Python自然语言处理专栏收录该内容

60 篇文章 0 订阅

订阅专栏

问题重现:

# -*- encoding:utf-8 -*-
import sys
reload(sys)
sys.setdefaultencoding('utf-8')
import nltk
from nltk.corpus import stopwords
text="I've got a very big apple "
text=text.split(" ")
# text.remove('')
text = list(set(text))
print text
text_list=nltk.pos_tag(text)
print text_list

for item in text_list:
    if item[1]=='JJ':#如果该英文单词是形容词
        print"item[0]=",item[0]

解决方案如下:

# -*- encoding:utf-8 -*-
import sys
reload(sys)
sys.setdefaultencoding('utf-8')
import nltk
from nltk.corpus import stopwords
text="I've got a very big apple "
text=text.split(" ")
text.remove('')
text = list(set(text))
print text
text_list=nltk.pos_tag(text)
print text_list

for item in text_list:
    if item[1]=='JJ':#如果该英文单词是形容词
        print"item[0]=",item[0]