使用request模拟请求
res = request.urlopen('https://www.baidu.com')
print(type(res))
print(res.read())
请求头设置User-Agent防止反爬虫
from urllib import request
url = 'https://www.baidu.com/'
header = {
'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/97.0.4692.71 '
'Safari/537.36 '
}
rq = request.Request(url, headers=header)
resp = request.urlopen(rq)
print(resp.read())
使用ip代理发送请求 - proxyHandler
from urllib import request
url = 'http://httpbin.org/ip'
hander = request.ProxyHandler({'http':'221.10.217.59:8888'})
opener = request.build_opener(hander)
resp = opener.open(url)
print(resp.read())