python中itertools groupby函数是干嘛的_关于python：itertools.groupby（）用于什么？

最新推荐文章于 2024-03-11 14:26:34 发布

仙女味果铺

最新推荐文章于 2024-03-11 14:26:34 发布

阅读量448

点赞数

文章标签： python中itertools groupby函数是干嘛的

本文链接：https://blog.csdn.net/weixin_42303627/article/details/114935438

版权

在阅读python文档时，我遇到了itertools.groupby()。功能。这不是很简单，所以我决定在stackoverflow上查找一些信息。我从如何使用python的itertools.groupby()中找到了一些东西？.

这里和文档中似乎没有关于它的信息，所以我决定将我的观察结果发表出来征求意见。

谢谢

你查过grouby()号文件吗？哪一部分不是直接向前的？

@op问题的第一句话声明他们阅读了python文档。

你问了一个你准备好了详细答案的问题？真的？为什么不把这些都放在问题里，留下答案部分讨论？

@广角治疗师只为了回答问题而问一个问题是完全可以接受的。我自己做的。"为什么不把这些都记在问题里呢"，因为答案不是问题的一部分。答案就是答案。

@我的实际问题是，"这件事中哪一部分不是直接的？".I提到了指向文档的链接，只是为了确保OP检查了官方的python文档，而不是任何教程。

@Moinuddinchuadri这是非常有效的，我没有提出你的问题。我只是指出你问题的第一部分已经回答了。

我会回答你，莫伊努丁。我对python比较陌生，很多时候我在寻找解决方案时常常感到沮丧。我看了医生。groupby()是最复杂的。我还没把整个班级的事情都考虑进去。医生的例子也不太清楚。我不认为我会坐着等别人问这个问题再回答。希望我的积极主动不会冒犯你。我清楚地说，我只是发表评论意见。我可能错过了一件事，或者加了两件。

首先，您可以阅读此处的文档。

我将把我认为最重要的一点放在首位。我希望在举例之后，原因会变得清楚。

始终使用用于分组的相同键对项进行排序，以避免出现意外结果。

itertools.groupby(iterable, key=None or some func)。获取iterables列表，并根据指定的键对其进行分组。键指定要应用于每个独立ITerable的操作，然后将其结果用作每个项目分组的标题；最终具有相同"键"值的项目将结束在同一个组中。

返回值是一个类似于字典的iterable，因为它的形式是{key : value}。

实施例1

# note here that the tuple counts as one item in this list. I did not

# specify any key, so each item in the list is a key on its own.

c = groupby(['goat', 'dog', 'cow', 1, 1, 2, 3, 11, 10, ('persons', 'man', 'woman')])

dic = {}

for k, v in c:

dic[k] = list(v)

dic

结果

{1: [1, 1],

'goat': ['goat'],

3: [3],

'cow': ['cow'],

('persons', 'man', 'woman'): [('persons', 'man', 'woman')],

10: [10],

11: [11],

2: [2],

'dog': ['dog']}

号

实施例2

# notice here that mulato and camel don't show up. only the last element with a certain key shows up, like replacing earlier result

# the last result for c actually wipes out two previous results.

list_things = ['goat', 'dog', 'donkey', 'mulato', 'cow', 'cat', ('persons', 'man', 'woman'), \

'wombat', 'mongoose', 'malloo', 'camel']

c = groupby(list_things, key=lambda x: x[0])

dic = {}

for k, v in c:

dic[k] = list(v)

dic

结果

{'c': ['camel'],

'd': ['dog', 'donkey'],

'g': ['goat'],

'm': ['mongoose', 'malloo'],

'persons': [('persons', 'man', 'woman')],

'w': ['wombat']}

。

现在，对于已排序的版本

# but observe the sorted version where I have the data sorted first on same key I used for grouping

list_things = ['goat', 'dog', 'donkey', 'mulato', 'cow', 'cat', ('persons', 'man', 'woman'), \

'wombat', 'mongoose', 'malloo', 'camel']

sorted_list = sorted(list_things, key = lambda x: x[0])

print(sorted_list)

print()

c = groupby(sorted_list, key=lambda x: x[0])

dic = {}

for k, v in c:

dic[k] = list(v)

dic

结果

['cow', 'cat', 'camel', 'dog', 'donkey', 'goat', 'mulato', 'mongoose', 'malloo', ('persons', 'man', 'woman'), 'wombat']

{'c': ['cow', 'cat', 'camel'],

'd': ['dog', 'donkey'],

'g': ['goat'],

'm': ['mulato', 'mongoose', 'malloo'],

'persons': [('persons', 'man', 'woman')],

'w': ['wombat']}

。

实施例3

things = [("animal","bear"), ("animal","duck"), ("plant","cactus"), ("vehicle","harley"), \

("vehicle","speed boat"), ("vehicle","school bus")]

dic = {}

f = lambda x: x[0]

for key, group in groupby(sorted(things, key=f), f):

dic[key] = list(group)

dic

。

结果

{'animal': [('animal', 'bear'), ('animal', 'duck')],

'plant': [('plant', 'cactus')],

'vehicle': [('vehicle', 'harley'),

('vehicle', 'speed boat'),

('vehicle', 'school bus')]}

现在是排序版本。我把元组改成了列表。不管怎样，结果都是一样的。

things = [["animal","bear"], ["animal","duck"], ["vehicle","harley"], ["plant","cactus"], \

["vehicle","speed boat"], ["vehicle","school bus"]]

dic = {}

f = lambda x: x[0]

for key, group in groupby(sorted(things, key=f), f):

dic[key] = list(group)

dic

。

结果

{'animal': [['animal', 'bear'], ['animal', 'duck']],

'plant': [['plant', 'cactus']],

'vehicle': [['vehicle', 'harley'],

['vehicle', 'speed boat'],

['vehicle', 'school bus']]}

"itertools.groupby(iterable, key=None or some func)取一个iterables的列表"是取一个iterables的列表，还是只取一个iterables？列表是不可更改的。

医生没有明确地说。但从我发布的示例中，您可以看到我同时使用了列表和嵌套列表。因此可以采用"iterable"(示例1)和"iterables列表"(示例2)。你甚至可以通过一个字符串，而你仍然在做生意。

和往常一样，应该首先检查函数的文档。然而，itertools.groupby无疑是最棘手的itertools之一，因为它有一些可能的陷阱：

仅当其key结果与连续项目相同时，才对项目进行分组：

from itertools import groupby

for key, group in groupby([1,1,1,1,5,1,1,1,1,4]):

print(key, list(group))

# 1 [1, 1, 1, 1]

# 5 [5]

# 1 [1, 1, 1, 1]

# 4 [4]

。

如果你想做一个完整的groupby，你可以在之前使用sorted。

它生成两个项，第二个项是迭代器(因此需要对第二个项进行迭代！).I显式地需要将这些强制转换为前一个示例中的list。

如果向前推进groupby迭代器，则丢弃第二个生成的元素：

it = groupby([1,1,1,1,5,1,1,1,1,4])

key1, group1 = next(it)

key2, group2 = next(it)

print(key1, list(group1))

# 1 []

号

即使group1不是空的！

如前所述，可以使用sorted来执行总体groupby操作，但这非常低效(如果要使用groupby on generator，则会降低内存效率)。如果不能保证输入是sorted(这也不需要O(n log(n))排序时间开销)，则可以使用更好的替代方法：

埃多克斯1〔14〕

埃多克斯1〔15〕

可能更多。

不过，检查当地的房产还是不错的。itertools部分有两个配方：

def all_equal(iterable):

"Returns True if all the elements are equal to each other"

g = groupby(iterable)

return next(g, True) and not next(g, False)

号

还有：

def unique_justseen(iterable, key=None):

"List unique elements, preserving order. Remember only the element just seen."

# unique_justseen('AAAABBBCCDAABBB') --> A B C D A B

# unique_justseen('ABBCcAD', str.lower) --> A B C A D

return map(next, map(itemgetter(1), groupby(iterable, key)))

号

谢谢。我一定会注意的，以防我需要一些替代品。现在我正在一节一节地阅读文档，以免把所有的东西都弄乱。祝你新年快乐

仙女味果铺

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python中itertools groupby函数是干嘛的_关于python：itertools.groupby（）用于什么？

在阅读python文档时，我遇到了itertools.groupby()。功能。这不是很简单，所以我决定在stackoverflow上查找一些信息。我从如何使用python的itertools.groupby()中找到了一些东西？.这里和文档中似乎没有关于它的信息，所以我决定将我的观察结果发表出来征求意见。谢谢你查过grouby()号文件吗？哪一部分不是直接向前的？@op问题的第一句话声明他们阅读...
复制链接

扫一扫