Python爬虫入门案例3：使用handler处理器访问baidu

最新推荐文章于 2024-10-07 01:36:35 发布

咸蛋_dd

最新推荐文章于 2024-10-07 01:36:35 发布

阅读量111

点赞数

分类专栏： Python爬虫文章标签： python 爬虫开发语言

本文链接：https://blog.csdn.net/weixin_62848089/article/details/130557032

版权

Python爬虫专栏收录该内容

8 篇文章 1 订阅

订阅专栏

为什么要使用handler处理器？

因为我们之前使用的urlopen无法使用动态cookie和代理来访问网站

下面用一个例子来演示handler的基本使用

记住三个口诀：

handler build_opener open

import urllib.request


url='http://www.baidu.com'
headers={
     #这里填自己的ua
    "User-Agent":""
}
#handler   build_opener    open
req=urllib.request.Request(headers=headers,url=url)
#获取handler对象
handler=urllib.request.HTTPSHandler()
#获取opener对象
opener=urllib.request.build_opener(handler)
#调用open方法
resp=opener.open(req)
content=resp.read().decode("utf-8")
print(content)

就可以成功获取baidu的html代码了