浏览器内核 :
IE : Trident
Opera : Presto
Mozilla : Firefox ( Fecko )
Linux : KHTML ( Like Gecko )
Apple : Webkit ( Like KHTML )
Google : Chrome ( Like Webkit )
import urllib.request
urllib.request. urlopen('http:……')
request = urllib.request.Request(url,data,headers)
urllib.request.urlopen(request)
url = "http://www.baidu.com/"
request = urllib.request.Request(url,headers={"User-Agent":"2222"}) #请求头
response = urllib.request.urlopen(request)
print(response.getcode()) #返回响应码,200为成功
print(response.geturl()) #返回实际数据url(防止重定向)
print(response.info()) #服务器报头信息
user-agent:dddd 是爬虫和反爬虫斗争的第一步
import urllib.request
import random
user_agent_list = ['adegeg','bsgege','cgege','deggw']
user_agent = random.choice(user_agent_list) #随机选择列表中的一项
request = urllib.request.Request("http://www.baidu.com/")
request.add_header("User-Agent",user_agent) #添加/修改 HTTP 报头
request.get_header("User-agent") #获取报头信息,只首字母大写