Python词频小工具，可以直接调用

最新推荐文章于 2023-09-08 17:08:24 发布

六日～

最新推荐文章于 2023-09-08 17:08:24 发布

阅读量710

点赞数

分类专栏：文本文章标签： python 自然语言处理

本文链接：https://blog.csdn.net/qq_43151062/article/details/122814236

版权

文本专栏收录该内容

4 篇文章 0 订阅

订阅专栏

1.先定义FreqWords（）函数

from collections import Counter
import jieba

#计算词频
def FreqWords(txt, n_top=None, stopwords = None):
    #分词
    words = jieba.cut(txt)
    #去掉停用词
    if stopwords:
        words = [w for w in words if w not in stopwords]
    #计算词频
    freq = Counter(words)
    
    if n_top:
        return freq.most_common(n_top)
    else:
        return freq
    
if __name__=='__main__':
    file_path = input('请输入文本文件路径：')
    with open(file_path,encoding = 'utf-8') as f:
        txt = f.read()
    stop_path = input('请输入停用词文件路径：')
    with open(stop_path,encoding = 'utf-8') as stop:
        stopwords = stop.read()
        result = FreqWords(txt, 10, stopwords)
    print(result)

2.直接调用即可

import FreqWords

r = FreqWords.FreqWords('啦啦啦，今天天气很好',10,None)
print(r)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

六日～

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Python词频小工具，可以直接调用

1.先定义FreqWords（）函数from collections import Counterimport jieba#计算词频def FreqWords(txt, n_top=None, stopwords = None): #分词 words = jieba.cut(txt) #去掉停用词 if stopwords: words = [w for w in words if w not in stopwords] #计算词频
复制链接

扫一扫