python list、dict、set 查询速度对比

needones

已于 2023-09-19 18:06:26 修改

阅读量9.3k

点赞数 6

分类专栏： python 文章标签：列表字典集合 python

于 2021-05-26 17:05:03 首次发布

本文链接：https://blog.csdn.net/qq_43656718/article/details/117297334

版权

python 专栏收录该内容

10 篇文章 0 订阅

订阅专栏

本文通过测试展示了在不同数据规模下，Python中的list、dict和set查询速度的显著差异。随着数据量增加，set查询速度最快，dict次之，list最慢，性能差距可达上万倍。字典和集合利用哈希存储提高查询效率，而列表则依赖线性搜索。此外，还探讨了空list和空方括号的创建时间差异。

摘要由CSDN通过智能技术生成

python list、dict、set 查询速度对比

平常使用的时候，可能对于这三种类型的查询无感，因为数据量小的时候查询速度都很快，所以无感觉，今天来试试。

一百万数据测试

=b list=
0.013915400000000133
=c dict=
2.100000000115898e-06
=d set=
9.999999996956888e-07

一千万数据测试

=b list=
0.12139450000000096
=c dict=
2.9000000001389026e-06
=d set=
1.1000000004202093e-06

一亿数据测试

=b list=
14.738766999999996
=c dict=
0.0010382000000106473
=d set=
0.0005590999999753876

测试代码

b = []
c = {}
d = set()

for i in range(1000000):  # 此处调整数据量
    num = random.randint(0, 10000000000)

    b.append(num)
    c[num] = 1
    d.add(num)

print('=========b list=========')
time1 = time.perf_counter()
if 9999 in b:
    print("yes")
time2 = time.perf_counter()
print(time2 - time1)
print('=========c dict=========')
time1 = time.perf_counter()
if 9999 in c:
    print("yes")
time2 = time.perf_counter()
print(time2 - time1)
print('=========d set=========')
time1 = time.perf_counter()
if 9999 in d:
    print("yes")
time2 = time.perf_counter()
print(time2 - time1)

结论

从上面数据来看，set > dict >> list
list查询速度最慢，性能上差距上万倍

字典： dict会把所有的key变成hash 表，然后将这个表进行排序，这样，你通过data[key]去查data字典中一个key的时候，python会先把这个key hash成一个数字，然后拿这个数字到hash表中看没有这个数字，如果有，拿到这个key在hash表中的索引，拿到这个索引去与此key对应的value的内存地址那取值就可以了。
集合： 集合的存储方式和字典key类似，都是采用hash存储，相同的值对应相同的地址，所以set中没有相同值，也是无序的

冷知识：你知道这两个有什么区别吗，哪个更快?

a = list()
b = []

有兴趣的小伙伴尝试一下


t1 = time.perf_counter()
a = list()
t2 = time.perf_counter()
print(f"a = list() cost :{t2 - t1}")

t1 = time.perf_counter()
b = []
t2 = time.perf_counter()
print(f"b = []     cost :{t2 - t1}")