python字典初始化_如何在Python中设置字典的初始大小？

最新推荐文章于 2022-03-25 15:12:32 发布

weixin_39976733

最新推荐文章于 2022-03-25 15:12:32 发布

阅读量392

点赞数

文章标签： python字典初始化

I'm putting around 4 million different keys into a Python dictionary.

Creating this dictionary takes about 15 minutes and consumes about 4GB of memory on my machine. After the dictionary is fully created, querying the dictionary is fast.

I suspect that dictionary creation is so resource consuming as the dictionary is very often rehashed (as it grows enormously).

Is is possible to create a dictionary in Python with some initial size or bucket number?

My dictionary points from a number to an object.

class MyObject(object):

def __init__(self):

# some fields...

d = {}

d[i] = MyObject() # 4M times on different key...

解决方案

With performance issues it's always best to measure. Here are some timings:

d = {}

for i in xrange(4000000):

d[i] = None

# 722ms

d = dict(itertools.izip(xrange(4000000), itertools.repeat(None)))

# 634ms

dict.fromkeys(xrange(4000000))

# 558ms

s = set(xrange(4000000))

dict.fromkeys(s)

# Not including set construction 353ms

The last option doesn't do any resizing, it just copies the hashes from the set and increments references. As you can see, the resizing isn't taking a lot of time. It's probably your object creation that is slow.

weixin_39976733

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python字典初始化_如何在Python中设置字典的初始大小？

I'm putting around 4 million different keys into a Python dictionary.Creating this dictionary takes about 15 minutes and consumes about 4GB of memory on my machine. After the dictionary is fully creat...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。