（47）-- 用线程简单爬取网络页面

最新推荐文章于 2024-02-11 22:55:13 发布

Fredreck1919

最新推荐文章于 2024-02-11 22:55:13 发布

阅读量195

点赞数

分类专栏： Web基础文章标签：线程

本文链接：https://blog.csdn.net/Fredreck1919/article/details/79641391

版权

Web基础专栏收录该内容

15 篇文章 0 订阅

订阅专栏

#用线程简单爬取网络页面

from urllib import request
from multiprocessing import Process,Queue

def downloader(url_queue):
    p = url_queue.get()
    response = request.urlopen(p)

    html = response.read()
    content = html.decode("utf-8")
    path_list=p.split('/')
    file_name = path_list[-1]

    with open(file_name,"w",encoding = "utf-8") as f:
        f.write(content)


if __name__ == "__main__":

    url_queue = Queue()

    s1 = "http://www.langlang2017.com/index.html"
    s2 = "http://www.langlang2017.com/route.html"
    s3 = "http://www.langlang2017.com/FAQ.html"
    url_queue.put(s1)
    url_queue.put(s2)
    url_queue.put(s3)

    p1 = Process(target=downloader, args=(url_queue, ))
    p1.start()

    p2 = Process(target=downloader, args=(url_queue, ))
    p2.start()

    p3 = Process(target=downloader, args=(url_queue, ))
    p3.start()

兄弟连学python

Python学习交流、资源共享群：563626388 QQ

优惠劵

Fredreck1919

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
（47）-- 用线程简单爬取网络页面

#用线程简单爬取网络页面from urllib import requestfrom multiprocessing import Process,Queuedef downloader(url_queue): p = url_queue.get() response = request.urlopen(p) html = response.read() c...
复制链接

扫一扫