爬虫篇
爬虫
禅悟刂
你不掌握别人的技术,命运就会被别人掌握。
展开
-
java爬取链家网数据
int num = 1; String path = "D:\\房源信息.txt"; BufferedWriter bf = new BufferedWriter(new FileWriter(path)); while(num<=100){ String link = "https://bj.lianjia.com/ershoufang/pg"+num; Doc...原创 2019-05-22 11:27:56 · 793 阅读 · 0 评论 -
java爬取豆瓣网电影数据
String path = "D:\\data.txt"; String url = "https://movie.douban.com/top250?start="; String link = "&filter="; BufferedWriter bf = new BufferedWriter(new FileWriter(path)); int num = 0; ...原创 2019-05-21 16:15:27 · 1199 阅读 · 0 评论 -
python 妹子图抓取
import requestsimport reheaders = { "user-agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/72.0.3626.81 Safari/537.36", "referer": "https...原创 2019-06-20 14:18:05 · 2916 阅读 · 0 评论 -
python 基于百度图片搜索
#! /user/bin/python# -*- coding: utf-8 -*-# Author: chen# Date 06/12import requestsimport re #导入正则表达式 提取所需要的内容import randomdef spiderPic(html,keyword): print('正在查找:'+keyword+'对应的文件,正在...原创 2019-06-20 14:19:07 · 705 阅读 · 0 评论 -
刺激战场 枪支性能雷达图分析
1、效果图2、python源码import requestsimport jsonpathimport pygal# 1. 请求刺激战场整体数据 jsonresponse = requests.get("http://pg.qq.com/zlkdatasys/data_zlk_zlzx.json")# 2. json数据转化为Python数据py_data = ev...原创 2019-06-20 14:22:45 · 1046 阅读 · 0 评论 -
包图网视屏爬取(请勿商用)
1.效果图2源码import requestsfrom lxml import etreeimport threadingclass Spider(object): def __init__(self): self.headers = {"User-Agent":"Mozilla/5.0 (Windows NT 10.0; WOW64) AppleW...原创 2019-07-01 15:36:14 · 2887 阅读 · 0 评论