Python3：网络爬虫（1）

最新推荐文章于 2024-10-08 12:37:10 发布

小屁猪qAq

最新推荐文章于 2024-10-08 12:37:10 发布

阅读量324

点赞数

分类专栏： Python3 文章标签： python 爬虫

本文链接：https://blog.csdn.net/cl965081198/article/details/52589247

版权

Python3 专栏收录该内容

4 篇文章 0 订阅

订阅专栏

Python3:这是今天学习的，第一个网页爬虫，可以爬去百度贴吧的十个网页并存储起来

import urllib.request

def baidu_tieba(url,begin_page,end_page):
    for i in range(begin_page,end_page+1):
        sName=str(i).zfill(5)+'.html'
        print('正在下载第'+str(i)+'个网页，并将其存储为'+sName+'.....')
        m=urllib.request.urlopen(url+str(i)).read()
        with open(sName,'wb') as file:
            file.write(m)

bdurl=str('http://tieba.baidu.com/p/4785143088?pn=')
begin_page=1
end_page=10

baidu_tieba(bdurl,begin_page,end_page)