python多线程加速for循环,Python 2.5 - 多线程for循环

最新推荐文章于 2024-05-29 21:29:09 发布

Ada-苏婉妤

最新推荐文章于 2024-05-29 21:29:09 发布

阅读量497

点赞数

文章标签： python多线程加速for循环

I've got a piece of code:

for url in get_lines(file):

visit(url, timeout=timeout)

It gets URLs from file and visit it (by urllib2) in for loop.

Is is possible to do this in few threads? For example, 10 visits at the same time.

I've tried:

for url in get_lines(file):

Thread(target=visit, args=(url,), kwargs={"timeout": timeout}).start()

But it does not work - no effect, URLs are visited normally.

The simplified version of function visit:

def visit(url, proxy_addr=None, timeout=30):

(...)

request = urllib2.Request(url)

response = urllib2.urlopen(request)

return response.read()

解决方案

To expand on senderle's answer, you can use the Pool class in multiprocessing to do this easily:

from multiprocessing import Pool

pool = Pool(processes=5)

pages = pool.map(visit, get_lines(file))

When the map function returns then "pages" will be a list of the contents of the URLs. You can adjust the number of processes to whatever is suitable for your system.

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

Ada-苏婉妤

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python多线程加速for循环,Python 2.5 - 多线程for循环

I've got a piece of code:for url in get_lines(file):visit(url, timeout=timeout)It gets URLs from file and visit it (by urllib2) in for loop.Is is possible to do this in few threads? For example, 10 vi...
复制链接

扫一扫