- 博客(7)
- 资源 (9)
- 收藏
- 关注
转载 eletron学习
安装环境 node.js 安装npm githttp://blog.csdn.net/kavensu/article/details/17733639 eletron安装卡在 node install.jsnpm 安装 node-sass 网速慢的 可以 运行 npm config set registry https://registry.npm.taobao.org然后 编辑 ~
2017-02-16 14:42:25 1125
原创 python(四)
批量获取链接 定义函数from bs4 import BeautifulSoupimport requestsurl = 'http://bj.xiaozhu.com/fangzi/1508951935.html'wb_data = requests.get(url)soup = BeautifulSoup(wb_data.text,'lxml')title = soup.select('
2017-02-12 18:05:13 250
原创 python爬虫(三)
爬虫手机端 headersfrom bs4 import BeautifulSoupimport requestsheaders = { 'User-Agent':'Mozilla/5.0 (iPhone; CPU iPhone OS 9_1 like Mac OS X) AppleWebKit/601.1.46 (KHTML, like Gecko) Version/9.0 Mobi
2017-02-12 16:56:21 329
原创 python爬虫(二)
爬虫连续抓取数据 time.sleep(4)from bs4 import BeautifulSoupimport requestsimport timeurl_saves = 'http://www.tripadvisor.com/Saves#37685322'url = 'https://cn.tripadvisor.com/Attractions-g60763-Activities
2017-02-12 15:46:07 333
原创 python爬虫(一)
抓取 标题,图片,链接,多个文字 listfrom bs4 import BeautifulSoupimport requestsurl = 'https://cn.tripadvisor.com/Attractions-g60763-Activities-New_York_City_New_York.html'wb_data = requests.get(url)soup =
2017-02-12 14:26:14 328
原创 python爬虫(2-2)
抓取58链接 存储到mongofrom bs4 import BeautifulSoupimport requestsimport timeimport pymongoclient = pymongo.MongoClient('localhost',27017)ceshi = client['ceshi']url_list = ceshi['url_list3']#spider
2017-02-10 16:43:24 314
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人