要求
附件是《沉默的羔羊》中文版内容,请读入内容,分词后输出长度大于2且最多的单词。
解答
import jieba
def getText():
txt = open("沉默的羔羊.txt", "r", encoding='utf-8').read().lower()
return txt
data = jieba.lcut(getText())
rs = {}
for w in data:
if (w.isalpha() and len(w) > 2):
rs[w] = rs.get(w, 0) + 1
rsSorted = sorted(rs.items(), key = lambda kv:(kv[1], kv[0]), reverse = True)
word, count = rsSorted[0]
print(word)
题目出处:
Python语言程序设计 (第13期)