Python核心丨字典和集合

最新推荐文章于 2024-02-26 09:46:18 发布

So.ne

最新推荐文章于 2024-02-26 09:46:18 发布

阅读量790

点赞数 2

分类专栏： Python

本文链接：https://blog.csdn.net/m0_45198298/article/details/106491582

版权

Python 专栏收录该内容

74 篇文章 0 订阅

订阅专栏

字典和集合

字典和集合基础

字典

字典是一系列由（key）和值（value）配对组成的元素的集合

在Python3.7+，字典被确定为有序

相比于列表和元组，字典的性能更优，特别是对于查找、添加和删除操作，字典都能在常数时间复杂度内完成。

集合

集合和字典基本相同，唯一的区别，集合没有键和值的配对，是一系列无序的、唯一的元素集合

字典和集合的创建

d1 = {'name': 'jason', 'age':20, 'gender': 'male'}
d2 = dict({'name': 'jason', 'age': 20, 'gender': 'male'})
d3 = dict([('name', 'jason'), ('age', 20), ('gender', 'male')])
d4 = dict(name='jason', age=20, gender='male')
d1 == d2 == d3 == d4
True

s1 = {1， 2， 3}
s2 = set([1, 2, 3])
s1 == s2
True

字典元素访问

d = {'name': 'jason', 'age': 20}
d['name']
'jason'
d['location']
# 报错：KeyErrpr

字典可以使用get(key, default)函数来进行索引

d = {'name': 'jason', 'age': 20}
d.get('name')
'jason'
d.get('location', 'none')
'none'

注：集合并不支持索引操作，因为集合本质上是一个哈希表，和列表不一样

s = {1, 2, 3}
s[0]
# 报错：TypeError

想要判断一个元素在不在字典或集合内，可以使用value in dict/set来判断

s = {1, 2, 3}
1 in s
True
10 in s
False

d = {'name': 'jason', 'age': 20}
'name' in d
True
'location' in d
False

字典和集合支持增加、删除、更新等操作

d = {'name': 'jason', 'age': 20}
d['gender'] = 'male'  # 增加元素对'gender':'male'
d
{'name': 'jason', 'age': 20, 'gender': 'male'}
d['age'] = 30  # 更新键'age'对应的值
d.pop('gender')
d
{'name': 'jason', 'age': 30}

s = {1, 2, 3}
s.add(4)  # 增加元素4到集合
s
{1, 2, 3, 4}
s.remove(4)  # 从集合中删除元素4
s
{1, 2, 3}

注：集合的pop()操作时删除集合中最后一个元素，可是集合本身是无序的，不知道会删除哪个元素。

根据字典的键或值，升序或降序

d = {'b': 1, 'a': 2, 'c': 10}
d_sorted_by_key = sorted(d.items(), key=lambda x: x[0])  # 根据字典键的升序排序
d_sorted_by_value = sorted(d.items(), key=lambda x: x[1])  # 根据字典值的升序排序
d_sorted_by_key
[('a', 2), ('b', 1), ('c', 10)]
d_sorted_by_value
[('b', 1), ('a', 2), ('c', 10)]

元素排序，直接调用sorted(set)

s = {3, 4, 2, 1}
sorted(s)  # 对集合的元素进行升序排序
[1, 2, 3, 4]

#### 字典和集合性能

示例

电商企业的后台，存储每件产品的ID、名称和加格。需求是，给定某件商品的ID，找出其价格

# 用列表来存储
def find_product_price(products, product_id):
    for id, price in products:
        if id == product_id:
            return price
    return None

products = [
    (143121312, 100),
    (432314553, 30),
    (32421912367, 150)
]

print('The price of product 432314553 is {}'.format(find_product_price(products, 432314553)))

# 输出结果：The price of product 432314553 is 30

假设列表有n个元素，而查找的过程要遍历列表，那么时间复杂度就为O(n)

如果用字典来存储数据，只需O(1)的时间复杂度就可以完成。

products = {
    143121312: 100,
    432314553: 30,
    32421912367: 150,
}
print('The price of product 432314553 is {}'.format(products[432314553]))

# 输出结果：The price of product 432314553 is 30

字典和集合的工作原理

字典和集合的内部结构都是一张哈希表

对于字典而言，这张表存储了哈希值（hash）、键和值这3个元素
对集合来说，区别就是哈希表内没有键和值的配对，只有单一的元素

'''
现在的哈希表除了字典本身的结构，
会把索引和哈希值、键、值单独分开。
'''
Indices
----------------------------------------------------
None | index | None | None | index | None | index ...
----------------------------------------------------

Entries
--------------------
hash0   key0  value0
---------------------
hash1   key1  value1
---------------------
hash2   key2  value2
---------------------
        ...
---------------------

示例中在新的哈希表结构下的存储形式


indices = [None, 1, None, None, 0, None, 2]
entries = [
[1231236123, 'name', 'mike'],
[-230273521, 'dob', '1999-01-01'],
[9371539127, 'gender', 'male']
]