Requests模块
安装:pip install requests
镜像安装:pip install -i https://pypi.tuna.tsinghua.edu.cn/simple requests
一、代码:
import requests
def pracRequests():
url = r'https://www.sogou.com/web?query=许嵩'
resp = requests.get(url)
## 获得响应
print(resp)
print(resp.text)
# 页面源代码
return
此时网页若检查到是爬虫的话,返回的页面如下:
<Response [200]>
<html>
<head>
<script>
location.replace(location.href.replace("https://","http://"));
</script>
</head>
<body>
<noscript><meta http-equiv="refresh" content="0;url=http://www.baidu.com/"></noscript>
</body>
</html>
为解决这一问题,需要获取浏览器申请的header,定义如下: