python列表判断数字是否在列表中,Python，检查数字是否在列表中的多个范围内。...

最新推荐文章于 2023-06-30 11:42:17 发布

不要芝麻酱

最新推荐文章于 2023-06-30 11:42:17 发布

阅读量1.8k

点赞数

文章标签： python列表判断数字是否在列表中

If there are integer lists like these:

a_list = [2501, 2783, 3088, 3980, 465, 1001, 39392911, 39394382, 488955,489087, ......]

b_list = [474, 498, 47478821, 47479800, 3774, 8970, 484000, 486000......]

every 2 numbers indicate a range of natural numbers, for example, the ranges of a_list would be:

2501 2783 # 2501，2502，2503，2504，2505，2506,......,2783

3088 3980

465 1001

39392911 39394382

488955 489087

......

For a given number, search for the range where it belongs to, and with priority of a_list > b_list i.e. if a range is found in a_list, stop searching and move on to the next number for searching.

I had test run for searching around 50 numbers which took about 7 minutes. I have a big dataset which could be 20 million numbers need to be searched in his way.

How to code this to do it faster?

============= more conditions and information =============

could be more than 10 thousand numbers in each list.

could be up to 30 million numbers for searching.

the size of list is always n * 2

a_list: [1st < 2nd, 3rd < 4th, ......]

the numbers in the lists might occur more than once.

the priority: a_list > b_list.

I have code as following:

hasFound = 0

if hasFound == 0:

for x, y in izip(*[iter(a_list)]*2): # gives every 2 numbers

if aNumber in range(x,y):

a_list_counter +=1

hasFound = 1

break

if hasFound == 0:

for x, y in izip(*[iter(b_list)]*2):

if aNumber in range(x,y):

b_list_counter += 1

hasFound = 1

break

Many thanks in advance.

解决方案

Toss them all in one big dictionary:

a_list = [2501, 2783, 3088, 3980, 465, 1001, 39392911, 39394382, 488955,489087, ......]

b_list = [474, 498, 47478821, 47479800, 3774, 8970, 484000, 486000......]

# into

ranges = {'a': [2501, 2783, 3088, 3980, 465, 1001, 39392911, 39394382, 488955,489087, ......],

'b': [474, 498, 47478821, 47479800, 3774, 8970, 484000, 486000......]}

Then go through each list in order, mostly the way you were doing it before:

numbers = [list of your target numbers]

scores = {} # dict to store results in

for number in numbers:

for range_name in sorted(ranges):

range_list = ranges[range_name]

groups = zip(*[iter(range_list)] * 2)

if any(start <= number < end for start,end in groups):

scores.setdefault(range_name, 0) += 1

Alternatively (and I'm not sure if this is faster or not) you could do:

for number in numbers:

for range_name in sorted(ranges):

range = ranges[range_name]

if sorted(range + [number]).index(number) % 2:

scores.setdefault(range, 0) += 1

In this case you're throwing a new number into a sorted list, re-sorting it (which is fast using TimSort), and seeing if it falls between two existing numbers.

不要芝麻酱

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python列表判断数字是否在列表中,Python，检查数字是否在列表中的多个范围内。...

If there are integer lists like these:a_list = [2501, 2783, 3088, 3980, 465, 1001, 39392911, 39394382, 488955,489087, ......]b_list = [474, 498, 47478821, 47479800, 3774, 8970, 484000, 486000......]ev...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。