Priority queue

最新推荐文章于 2021-11-15 00:30:34 发布

钟炳祥

最新推荐文章于 2021-11-15 00:30:34 发布

阅读量177

点赞数

分类专栏： Python 数据结构文章标签：队列 python 算法

本文链接：https://blog.csdn.net/zhongbing1234/article/details/108965926

版权

Python 同时被 2 个专栏收录

10 篇文章 1 订阅

订阅专栏

数据结构

3 篇文章 0 订阅

订阅专栏

Priority queue(优先队列)

队列是一种满足先进先出原则的数据结构，在Python中利用列表就可以简单地实现一个队列。首先需要定义一个列表，接在在列表中加入元素，最后利用pop方法将最先加入的元素取出，即可实现一个队列。

queue = []
queue.append(2)
queue.append(3)
queue.append(1)
queue.pop(0)

# output: 2

队列的应用很多，比如在BFS算法中就使用到了队列的数据结构。但在有些场合中，我们不需要按照时间的先后顺序来取出元素，而是根据优先等级来取元素，这时就需要优先队列这种数据结构。

简单地说，优先队列也会像队列一样保存一些元素，但取出元素的方式会按照元素的等级进行，一般有最大优先队列和最小优先队列之分，前者每次取出的是最大等级的元素，后者取出的是最小等级的元素。按照这种说法，我们可以将上面的代码简单修改一下就实现了优先队列的功能。

比如我们按照客户的VIP等级对客户提供服务，级别高的先服务。首先把客户和其VIP等级添加进队列里，接着将列表降序排序，输出第一个元素即是VIP等级最高的客户。

pq = []
pq.append((2,"Tom"))
pq.append((3,"Tina"))
pq.append((1,"Tony"))
pq.sort(reverse=True)
pq.pop(0)

# output: (3,"Tina")

利用列表实现的优先队列虽然简单，但其扩展性较差，比如我们希望输出客户这个对象，而不是将加入列表里面的内容全部输出，这时的列表将不能满足这个功能。同时调用pop方法，会改变剩下元素的索引，其时间复杂度为O(n)。那么有什么方法可以减少时间复杂度，同时提供优先队列的扩展性呢？

上面代码中，当我们取出最高等级的元素时，所用的时间复杂度为O(n)，用何种数据结构可以使得取出最大值或者最小值的时间复杂度小于O(n)呢？二叉堆可以做到，在Heapsort中，我们可以看到对于最大堆，取出最大元素只要取出第一个元素即可，同时调用Max-heapify方法使树的结构满足堆的特性。该方法的时间复杂度为O(logn)，因此完成取出最大值的时间复杂度为O(logn)，要小于线性时间。

下面我将介绍如何利用二叉堆来设计一个优先队列，我将以最小优先队列为例进行介绍。

最主要需要三个方法：

Insert: 插入元素
Extract_Min: 取出最小等级的元素
Decrease_Key: 减小某一元素的等级，减小之后需要交换元素以满足堆的特性。

Python代码如下：

class PriorityQueue:
    """Min-heap-based priority queue, using 1-based indexing. Adapted from CLRS.
    
    Augmented to include a map of keys to their indices in the heap so that
    key lookup is constant time and decrease_key(key) is O(log n) time.
    """
    
    def __init__(self):
        """Initializes the priority queue."""
        self.heap = [None] # To make the index 1-based.
        self.key_index = {} # key to index mapping.
    
    def __len__(self):
        return len(self.heap) - 1
       
    def __getitem__(self, i):
        return self.heap[i]

    def __setitem__(self, i, key):
        self.heap[i] = key

    def decrease_key(self, key):
        """Decreases the value of the key if it is in the priority queue and 
        maintains the heap property."""
        index = self.key_index[key]
        if index:
            self._decrease_key(index, key)
    
    def insert(self, key):
        """Inserts a key into the priority queue."""
        self.heap.append(key)
        self.key_index[key] = len(self)
        self._decrease_key(len(self), key)

    def extract_min(self):
        """Removes and returns the minimum key."""
        if len(self) < 1:
            return None
        self._swap(1, len(self))
        min = self.heap.pop()
        del self.key_index[min]
        self._min_heapify(1)
        return min
    
    def _decrease_key(self, i, key):
        """Decreases key at a give index.
        
        Args:
            i: index of the key.
            key: key with decreased value.
        """
        while i > 1:
            parent = i // 2
            if self[parent] > key:
                self._swap(i, parent)
                i = parent
            else:
                break
            
    def _min_heapify(self, i):
        """Restores the heap property from index i downwards."""
        l = 2 * i
        r = 2 * i + 1
        smallest = i
        if l <= len(self) and self[l] < self[i]:
            smallest = l
        if r <= len(self) and self[r] < self[smallest]:
            smallest = r
        if smallest != i:
            self._swap(i, smallest)
            self._min_heapify(smallest)

    def _swap(self, i, j):
        # Swaps the key at indices i and j and updates the key to index map.
        self.heap[i], self.heap[j] = self.heap[j], self.heap[i]
        self.key_index[self.heap[i]], self.key_index[self.heap[j]] = i, j

以下面这个例子说明下上述代码如何使用。现在我需要给车的价值排个序，每次取出价值最低的车。为了方便，创建一个Car类。在上面代码中，我们是按照key的大小构造了二叉堆。按照这个意思我们应该把车的价值用insert方法加入到队列中，但是那样就无法获取其对应的车的信息。为此我们将每个车的对象加入到优先队列，按照它们的价值构造二叉堆，故要在Car类中定义比较大小的方法，如下：

class Car(object):
    def __init__(self, name, value):
        self.name = name
        self.value = value

    def __lt__(self, other):
        # :nodoc: Delegate comparison to distance.
        return (self.value < other.value or
                (self.value == other.value and
                 id(self.name) < id(other.name)))

    def __le__(self, other):
        # :nodoc: Delegate comparison to distance.
        return (self.value < other.value or
                (self.value == other.value and
                 id(self.name) <= id(other.name)))

    def __gt__(self, other):
        # :nodoc: Delegate comparison to distance.
        return (self.value > other.value or
                (self.value == other.value and
                 id(self.name) > id(other.name)))

    def __ge__(self, other):
        # :nodoc: Delegate comparison to distance.
        return (self.value > other.value or
                (self.value == other.value and
                 id(self.name) >= id(other.name)))

    car1 = Car("BMW", 45)
    car2 = Car("Maybach", 145)
    car3 = Car("Bugatti", 85)
    car4 = Car("Cadillac", 78)
    car5 = Car("Maserati", 85)
    pq = PriorityQueue()
    pq.insert(car1)
    pq.insert(car2)
    pq.insert(car3)
    pq.insert(car4)
    pq.insert(car5)
    print("队列大小：{0}".format(len(pq)))
    print(pq.extract_min().name)
    
    # output:
    # 队列大小：5
	# BMW

BMW的价值最小，利用extract_min方法取出了car1对象，其name是BMW。

其实在python中已经有相关的模块可实现上述的优先队列的功能，如heapq模块，但只提供实现最小优先队列功能。

import heapq

q = []

heapq.heappush(q, (2, 'code'))
heapq.heappush(q, (1, 'eat'))
heapq.heappush(q, (3, 'sleep'))

print(heapq.heappop(q))

# output：(1, 'eat')

还有queue中的PriorityQueue，其时间复杂度与heapq的一样，区别在于PriorityQueue是同步的，提供了锁语义来支持多个并发的生产者和消费者。

from queue import PriorityQueue

q = PriorityQueue()

q.put((2, 'code'))
q.put((1, 'eat'))
q.put((3, 'sleep'))

print(q.get())
# output：(1, 'eat')

参考资料：

1.算法导论第三版

2.https://geek-docs.com/python/python-examples/python-priority-queue.html

钟炳祥

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Priority queue

Priority queue(优先队列)队列是一种满足先进先出原则的数据结构，在Python中利用列表就可以简单地实现一个队列。首先需要定义一个列表，接在在列表中加入元素，最后利用pop方法将最先加入的元素取出，即可实现一个队列。queue = []queue.append(2)queue.append(3)queue.append(1)queue.pop(0)# output: 2队列的应用很多，比如在BFS算法中就使用到了队列的数据结构。但在有些场合中，我们不需要按照时间的先后顺序来
复制链接

扫一扫

专栏目录