- 博客(15)
- 资源 (19)
- 收藏
- 关注
原创 决策树和随机森林对经典鸢尾花数据集分类
1.决策树import pandas as pdimport numpy as npfrom sklearn.datasets import load_irisfrom sklearn.tree import DecisionTreeClassifierfrom sklearn.tree import export_graphvizfrom sklearn.tree import De...
2020-04-23 16:26:54 5679 1
原创 基于selenium的斗鱼直播房间详细信息自动化爬虫
from selenium import webdriverimport time#导入 ActionChains 类from selenium.webdriver import ActionChainsimport jsonimport reimport requestsclass DouYuSpider(): def __init__(self): #...
2020-04-18 21:05:08 296
原创 sklearn.neural_network.MLPClassifier用法
from sklearn.neural_network import MLPClassifier#初始化输入矩阵X = [[0., 0.], [1., 1.]]#初始化目标值y = [0, 1]#实例化一个人工神经网络分类器并传入数据训练clf = MLPClassifier(solver='sgd', alpha=1e-5, activation='logistic',hidde...
2020-04-18 16:18:25 3285
原创 python爬取腾讯招聘信息
import requestsimport jsonimport queueimport threadingimport timeclass TencentSpider(): def __init__(self): self.headers = { "user-agent": "Mozilla/5.0 (Windows NT 10.0; Win...
2020-04-17 17:55:13 802
原创 pickle模块保存python对象到文件的使用方法
1.保存python对象到文件import pickles = "测试串"f = open("./data/run.pkl", "wb")pickle.dump(s, f)f.close()2.从文件读取python对象import picklef = open("./data/run.pkl", "rb")s = pickle.load(f)f.close()pri...
2020-04-17 15:39:36 277
原创 逻辑回归实现音乐类型分类
1训练模块import numpy as npfrom sklearn import linear_model, datasetsimport matplotlib.pyplot as pltfrom scipy.stats import normfrom scipy import fftfrom scipy.io import wavfile# 准备音乐数据,进行傅里叶变换取前...
2020-04-17 15:30:58 519
原创 python多线程爬取海报图片
1. 单线程版import requestsfrom lxml import etreeimport timeimport reclass HaiBaoSpider(): def __init__(self): self.headers = { "User-Agent": "Mozilla/5.0 (Windows NT 10.0; W...
2020-04-16 16:33:41 512
原创 贴吧帖子标题 + 回复内容 + 回复图片爬虫
import requestsfrom lxml import etreeimport reimport jsonimport osimport timeclass TieBaSpider(): def __init__(self): self.headers = { "user-agent": "Mozilla/5.0 (Windo...
2020-04-14 11:09:38 718
原创 逻辑回归对经典鸢尾花数据集进行三分类预测
import numpy as npfrom sklearn import datasetsfrom sklearn.linear_model import LogisticRegressionimport matplotlib.pyplot as pltiris = datasets.load_iris() #获取鸢尾花数据集#获取数据集的全部行,但只取其第四列即花瓣长度X = ...
2020-04-13 15:34:32 2243
原创 np.linspace()用法
np.linspace(a,b,c)用于创建一个等差序列的向量,向量值是[a,b]之间均匀分布的c个实数import numpy as nparithmetic_sequence = np.linspace(0,10,9).reshape(-1,1)print(arithmetic_sequence)输出结果如下:[[ 0. ][ 1.25][ 2.5 ][ 3.75][...
2020-04-13 15:17:22 16990
原创 有趣段子 + 图片爬虫
import requestsimport reimport jsonimport osclass NeiHanSpider(): def __init__(self): self.start_url = "http://www.budejie.com/" self.headers = { "User-Agent": "M...
2020-04-13 11:09:49 152
原创 通过session保存即时cookies请求拉勾网职位信息
import requestsstart_url = "https://www.lagou.com/"next_url = "https://www.lagou.com/jobs/positionAjax.json?needAddtionalResult=false"headers = { "User-Agent": "Mozilla/5.0 (Windows NT 10.0;...
2020-04-12 17:56:46 326
原创 通过requests获取网络上图片的大小
from io import BytesIO,StringIOimport requestsfrom PIL import Imageimg_url = "https://profile.csdnimg.cn/F/6/F/3_cyj5201314"response = requests.get(img_url)f = BytesIO(response.content)img = Ima...
2020-04-12 17:16:06 1869
原创 pandas读取分析保险数据
import pandas as pdimport matplotlib.pyplot as pltfrom sklearn.preprocessing import PolynomialFeaturesfrom sklearn.linear_model import LinearRegression#读入数据data = pd.read_csv('./data/insurance.c...
2020-04-12 16:35:14 293
原创 scrapy的setting.py的常用设置
SettingsScrapy设置(settings)提供了定制Scrapy组件的方法。可以控制包括核心(core),插件(extension),pipeline及spider组件。比如 设置Json Pipeliine、LOG_LEVEL等。内置设置参考手册BOT_NAME默认: ‘scrapybot’当使用 startproject 命令创建项目时其也被自动赋值。CONCURRENT...
2020-04-01 11:42:00 321
object_detection.rar
2021-04-27
opencv_haar特征的人脸检测xml文件.rar
2021-04-09
Redis可视化工具.rar
2020-06-15
Redis程序包.rar
2020-06-15
phantomjs-2.1.1-windows.rar
2020-03-10
robo3t.rar
2020-03-07
MNIST_data手写数字图片.rar
2020-02-27
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人