统计水浒传完整姓名前十位：jieba库应用，python编程

才疏学浅的莫笑天

于 2020-01-05 01:30:23 发布

阅读量4.3k

点赞数 8

本文链接：https://blog.csdn.net/qq_45804132/article/details/103839366

版权

本文介绍了如何使用Python的jieba库来统计《水浒传》中出现的完整人物姓名。通过处理文本，过滤掉单个字符和非姓名词汇，然后对姓名进行计数并按出现次数排序。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

直入主题，我们需要用到jieba库的一些函数，这个python库是国内大神编写的。

我们需要用到文件的一部分内容，这里我们还需要两个txt文本

1.水浒传部分文本（也可以是全部文本）

2.水浒传内所有完整的姓名（除称号外）

文本在网上可以找得到，我直接上代码了

import jieba
txt=open("AllManAreBrothers.txt","rb").read()
txt_name=open("heros_name.txt","rb").read()
words=jieba.lcut(txt)
words_name=jieba.lcut(txt_name)
counts={}
for word in words:
    if len(word)==1:
        continue
    if word not in words_name:
        continue
    counts[word]=counts.get(word,0)+1
sorted(counts.items(), key=lambda x:x[0], reverse=True)
for i in range(10):
    word,count=items[i]
    print("{0:<10}{1:>5}".format(word,count))