【Python】使用itertools.groupby()进行列表的归类和个数统计

最新推荐文章于 2024-08-22 11:43:59 发布

小灰狼码代码ing

最新推荐文章于 2024-08-22 11:43:59 发布

阅读量4.2k

点赞数 1

分类专栏：关于Python的一系列有趣的事情文章标签： python 开发语言后端

本文链接：https://blog.csdn.net/Thueii/article/details/121116582

版权

本文介绍了如何使用Python的itertools.groupby()函数对列表进行归类和统计连续元素的个数。从问题来源出发，详细讲解了groupby函数的使用，包括直接使用列表、列表套字典以及使用函数作为参数的场景，并给出了具体的示例代码，帮助读者理解其工作原理和应用。

摘要由CSDN通过智能技术生成

一、问题来源

刷题的时候想要快速统计一个列表当中连续出现的元素的个数，除了自己实现之外，想要更快速的方式就百度了一下，查到了itertools模块的groupby方法，挺有意思的，所以做一个记录

二、groupby概述

0、文档介绍

本段下面有总结，不想看的话这里可以跳过
官方文档指路： python中itertools库的官方文档
节选：

itertools.groupby(iterable, key=None)

Make an iterator that returns consecutive keys and groups from the iterable. The key is a function computing a key value for each element. If not specified or is None, key defaults to an identity function and returns the element unchanged. Generally, the iterable needs to already be sorted on the same key function.

The operation of groupby() is similar to the uniq filter in Unix. It generates a break or new group every time the value of the key function changes (which is why it is usually necessary to have sorted the data using the same key function). That behavior differs from SQL’s GROUP BY which aggregates common elements regardless of their input order.

The returned group is itself an iterator that shares the underlying iterable with groupby(). Because the source is shared, when the groupby() object is advanced, the previous group is no longer visible. So, if that data is needed later, it should be stored as a list:

    groups = []
    uniquekeys = []
    data = sorted(data, key=keyfunc)
    for k, g in groupby(data, keyfunc):
        groups.append(list(g))      # Store group iterator as a list
        uniquekeys.append(k)

    groupby() is roughly equivalent to:

    class groupby:
        # [k for k, g in groupby('AAAABBBCCDAABBB')] --> A B C D A B
        # [list(g) for k, g in groupby('AAAABBBCCD')] --> AAAA BBB CC D
        def __init__(self, iterable, key=None):
            if key is None:
                key = lambda x: x
            self.keyfunc = key
            self.it = iter(iterable)
            self.tgtkey = self.currkey = self.currvalue = object()
        def __iter__(self):
            return self
        def __next__(self):
            self.id = object()
            while self.currkey == self.tgtkey:
                self.currvalue = next(self.it)    # Exit on StopIteration
                self.currkey = self.keyfunc(self.currvalue)
            self.tgtkey = self.currkey
            return (self.currkey, self._grouper(self.tgtkey, self.id))
        def _grouper(self, tgtkey, id):
            while self.id is id and self.currkey