爬虫
文章平均质量分 83
chengjintao1121
这个作者很懒,什么都没留下…
展开
-
凤凰网的抓取
import requestsimport re,json,pymysql,time#获取页码IDarticle_id_list=[“http://shankapi.ifeng.com/shanklist//getColumnInfo//default/6429514672495399578/1532918315000/20/5-35059-/getColumnInfoCallback?c...原创 2018-12-28 19:06:03 · 2096 阅读 · 0 评论 -
快科技的抓取
import requestsimport time,json,re,pymysqlfrom lxml import etreearticle_id_list=[608862]def ID_last(article_id_list):time_now = int((time.time()) * 1000)headers = {“User-Agent”: ‘Mozilla/5.0 (W...原创 2018-12-28 19:06:39 · 197 阅读 · 0 评论 -
新浪数据抓取
import requestsimport re,json,pymysql,timeheaders = {“Accept”: “application/json, text/javascript, /; q=0.01”,“Accept-Encoding”: “gzip, deflate, br”,“Accept-Language”: “zh-CN,zh;q=0.9,en;q=0.8”,...原创 2018-12-28 19:07:49 · 271 阅读 · 0 评论 -
滚动资讯的爬取
import requestsimport time,json,re,pymysqlfrom lxml import etreeheaders = {“User-Agent”: ‘Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.110 Safar...原创 2018-12-28 19:09:01 · 6444 阅读 · 0 评论