python网络爬虫实例(一):爬取糗事百科

最新推荐文章于 2024-04-23 10:40:16 发布

cloud-2014

最新推荐文章于 2024-04-23 10:40:16 发布

阅读量856

点赞数

分类专栏： python

本文链接：https://blog.csdn.net/u012592062/article/details/51924938

版权

python 专栏收录该内容

16 篇文章 0 订阅

订阅专栏

#coding=utf8
'''
Created on 2016年7月16日

@author: root
'''

import urllib,urllib2
page=1
try:
    while page<36:
        print"开始爬取第"+str(page)+"个网页......"
        url="http://www.qiushibaike.com/8hr/page/"+str(page)+"/?s=4895521"
        user_agent="Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/50.0.2661.75 Safari/537.36"
        headers={'User-Agent':user_agent}
        req=urllib2.Request(url,headers=headers)
        rsp=urllib2.urlopen(req)
        html=rsp.read()
        f=open("E:\qiushibaike\\03\page_"+str(page)+".html",'w+')
        f.write(html)
        f.close()
        page=page+1
except urllib2.URLError,e:
    if hasattr(e,"code"):
        print e.code
    if hasattr(e,"reason"):
        print e.reason

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

cloud-2014

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python网络爬虫实例(一):爬取糗事百科

#coding=utf8'''Created on 2016年7月16日@author: root'''import urllib,urllib2page=1try: while page<36: print"开始爬取第"+str(page)+"个网页......" url="http://www.qiushibaike.com/8hr/p
复制链接

扫一扫