SentiWordNet计算情感倾向

最新推荐文章于 2022-11-13 14:11:43 发布

兔唧唧不秃

最新推荐文章于 2022-11-13 14:11:43 发布

阅读量1.7k

点赞数 1

文章标签：自然语言处理 python

本文链接：https://blog.csdn.net/melodyzhan/article/details/123502591

版权

自然语言处理菜鸡的学习笔记专栏收录该内容

1 篇文章 0 订阅

订阅专栏

使用NLTK提供的SentiWordNet工具计算一个句子的情感倾向性，计算方法为每个词所处词性下的每个词义情感倾向性之和。

import string

from nltk.tokenize import word_tokenize
from nltk import pos_tag
from nltk.corpus import stopwords
from nltk.corpus import sentiwordnet
from nltk.corpus import wordnet

# 停用词
stpw = stopwords.words('english')
# 标点符号
punc = list(string.punctuation)
# 不需要分析的词和标点
stop = punc + stpw

# 要分析的句子
sentence = "His performance is so great that he got the price. "
# 1.标记解析
words = word_tokenize(sentence)
for word in words:
    if word.lower() in stop:
        words.remove(word)
print(words)
# 2.词性标注
word_tag = pos_tag(words)
tag_map = {'NN': 'n', 'NNP': 'n', 'NNPS': 'n', 'NNS': 'n', 'UH': 'n',\
           'VB': 'v', 'VBD': 'v', 'VBG': 'v', 'VBN': 'v', 'VBP': 'v', 'VBZ': 'v',\
           'JJ': 'a', 'JJR': 'a', 'JJS': 'a',\
           'RB': 'r', 'RBR': 'r', 'RBS': 'r', 'RP': 'r', 'WRB': 'r'}

word_tag = [(t[0], tag_map[t[1]]) if t[1] in tag_map else (t[0], '') for t in word_tag]
print(word_tag)
# 同义词集senti_synsets()
sentiment_synsets = [list(sentiwordnet.senti_synsets(t[0], t[1])) for t in word_tag]
print(sentiment_synsets)

score = sum(sum([x.pos_score() - x.neg_score() for x in s]) / len(s) for s in sentiment_synsets if len(s) != 0)
print(score)

兔唧唧不秃

关注

1
点赞
踩
8

收藏

觉得还不错? 一键收藏
0
评论
SentiWordNet计算情感倾向

使用NLTK提供的SentiWordNet工具计算一个句子的情感倾向性，计算方法为每个词所处词性下的每个词义情感倾向性之和。import stringfrom nltk.tokenize import word_tokenizefrom nltk import pos_tagfrom nltk.corpus import stopwordsfrom nltk.corpus import sentiwordnetfrom nltk.corpus import wordnet# 停用词stp
复制链接

扫一扫