1.12.在序列中查找出现次数最多的元素

最新推荐文章于 2021-09-06 18:34:18 发布

shinestaryu

最新推荐文章于 2021-09-06 18:34:18 发布

阅读量670

点赞数

分类专栏： Python 文章标签： Python Cookbook第三版

Python 专栏收录该内容

12 篇文章 0 订阅

订阅专栏

解决方法

collections.Counter类就是为此设计的，它甚至提供了most_common()方法来解决你的问题。

举个例子，查找words中的出现次数最多的词：

words = [
   'look', 'into', 'my', 'eyes', 'look', 'into', 'my', 'eyes',
   'the', 'eyes', 'the', 'eyes', 'the', 'eyes', 'not', 'around', 'the',
   'eyes', "don't", 'look', 'around', 'the', 'eyes', 'look', 'into',
   'my', 'eyes', "you're", 'under'
]
from collections import Counter
word_counts = Counter(words)
top_three = word_counts.most_common(3)
print(top_three)
# Outputs [('eyes', 8), ('the', 5), ('look', 4)]

讨论

Counter对象可以“吞吃”任何哈希元素的序列。一个Counter是映射元素到出现次数的字典。e.g：

>>> word_counts['not']
1
>>> word_counts['eyes']
8
>>>

如果想手动添加计数，只需简单添加：

>>> morewords = ['why','are','you','not','looking','in','my','eyes']
>>> for word in morewords:
...     word_counts[word] += 1
...
>>> word_counts['eyes']
9
>>>

还可以使用update()函数：

>>> word_counts.update(morewords)
>>>

更有趣的是，Counter实例可以简单的做大量的数学运算，e.g：

>>> a = Counter(words)
>>> b = Counter(morewords)
>>> a
Counter({'eyes': 8, 'the': 5, 'look': 4, 'into': 3, 'my': 3, 'around': 2,
         "you're": 1, "don't": 1, 'under': 1, 'not': 1})
>>> b
Counter({'eyes': 1, 'looking': 1, 'are': 1, 'in': 1, 'not': 1, 'you': 1,
         'my': 1, 'why': 1})
>>> # Combine counts
>>> c = a + b
>>> c
Counter({'eyes': 9, 'the': 5, 'look': 4, 'my': 4, 'into': 3, 'not': 2,
         'around': 2, "you're": 1, "don't": 1, 'in': 1, 'why': 1,
         'looking': 1, 'are': 1, 'under': 1, 'you': 1})
>>> # Subtract counts>>> d = a - b
>>> d
Counter({'eyes': 7, 'the': 5, 'look': 4, 'into': 3, 'my': 2, 'around': 2,
         "you're": 1, "don't": 1, 'under': 1})
>>>