爬虫
编码的三叔
坚持是一种信仰。
展开
-
python3 sipder 01
简单实现一个获取网页#coding:utf-8from urllib import requestif __name__=='__main__': response = request.urlopen('http://www.csdn.net') html=response.read() print(html)获取网页编码#coding:utf-8from urllib im...原创 2018-11-09 00:23:52 · 135 阅读 · 0 评论 -
python3 spider 02 获取html的url、 head、 status
#coding:utf-8from urllib import requestimport chardetif __name__=='__main__': req = request.Request('http://www.csdn.net') response = request.urlopen(req) #读取url信息 url = response.geturl(); pr...原创 2018-11-09 00:46:40 · 308 阅读 · 0 评论