![](https://img-blog.csdnimg.cn/20201014180756927.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
python
Rivertao
这个作者很懒,什么都没留下…
展开
-
文本挖掘--相似度对比
对比盗墓笔记、鬼吹灯和金九门的相似度import jiebafrom gensim import corpora, models, similaritiesimport urllib.requestfrom collections import defaultdict#下面我们使用phpstudy的服务器来打开txt文档doc1=urllib.request.urlopen("http://127...原创 2018-03-05 11:40:49 · 969 阅读 · 0 评论 -
Series和DataFrame的柱状图
import pandas as pdimport numpy as npimport matplotlib.pyplot as pltfrom pandas import Seriesfrom pandas import DataFrame#Series画柱状图fig,axes=plt.subplots(2,1)#两行一列data=Series(np.random.rand(16)...原创 2018-04-05 20:42:54 · 3594 阅读 · 0 评论 -
python连接mysql
import pymysqlconnection=Nonecursor=Nonetry: connection=pymysql.connect('localhost','root','root','test1') cursor = connection.cursor() sql = 'insert into dept values (%s,%s,%s)' tr...原创 2018-05-07 16:35:29 · 123 阅读 · 0 评论 -
BP人工神经网络案例
#BP人工神经网络实现#1.读取数据#2.导入keras.models Sequential /keras.layers.core Dense,Activation#3.Squential 建立模型#4.Dense 建立层#5.Activation激活函数#6.compile模型编译#7.fit训练(学习)#8.验证(测试,分类预测)import numpy as npim...原创 2018-08-14 10:45:21 · 984 阅读 · 0 评论 -
朴素贝叶斯分类算法的实现
import numpy as npdef bayes(train_data,labels,test_data): train_data_num=len(train_data)#测试数据或类别长度 not_r_label=set(labels)#不重复的类别 label_rate={} for item in not_r_label: label...原创 2018-08-12 08:21:51 · 213 阅读 · 0 评论 -
逻辑回归案例
import pandas as pdroot="G:/python/源码/源码/luqu.csv"dataframe=pd.read_csv(root)#print(dataframe)x=dataframe.ix[:,1:4]y=dataframe.ix[:,0]from sklearn.linear_model import LogisticRegressionmodel2=...原创 2018-08-12 14:09:44 · 1242 阅读 · 0 评论 -
决策树ID3算法案例
import pandas as pdfile_root="G:/python/源码/源码/lesson.csv"dataframe=pd.read_csv(file_root,encoding="gbk")#print(dataframe)x=dataframe.ix[:,1:5].as_matrix()y=dataframe.ix[:,5].as_matrix()for i in...原创 2018-08-12 18:20:03 · 1185 阅读 · 0 评论 -
星际争霸游戏战队案例分析
import pandas as pdimport numpy as npimport matplotlib.pyplot as plt#1.加载并查看数据基本信息def read_dataset(file_root): dataframe=pd.read_csv(file_root) print("数据的基本信息:") print(dataframe.info(...原创 2018-08-15 15:02:58 · 453 阅读 · 0 评论 -
聚类K—means案例
import pandas as pdimport numpy as npimport matplotlib.pylab as pylfile_root="路径文件"dataframe=pd.read_csv(file_root)x=dataframe.ix[:,:].as_matrix()from sklearn.cluster import KMeansk_mean=KMean...原创 2018-08-13 09:59:34 · 620 阅读 · 0 评论 -
电影票房案例分析
import pandas as pd#加载数据def read_data(file_root): dataframe=pd.read_csv(file_root) print("数据的基本信息:") print(dataframe.info()) print("数据的行是%i,列是%i"%(dataframe.shape[0],dataframe.shape...原创 2018-08-15 22:02:31 · 1744 阅读 · 0 评论 -
用SVM识别手写体案例
from sklearn import datasetsfrom sklearn import svmiris=datasets.load_iris()digits=datasets.load_digits()#选择SVM模型svm_classifier=svm.SVC(gamma=0.0001,C=100)#手动划分训练集,测试集n_test=100#测试数量train_x=d...原创 2018-08-16 16:07:04 · 691 阅读 · 0 评论 -
用matplotlib设置标题、轴标签、刻度标签以及添加图例
import pandas as pdimport numpy as npimport matplotlib.pyplot as pltfig=plt.figure()ax=fig.add_subplot(1,1,1)ax.plot(np.random.randn(1000).cumsum())x_ticks=ax.set_xticks([0,250,500,750,1000])x_...原创 2018-04-05 16:39:21 · 26153 阅读 · 0 评论 -
检测和过滤异常值
from pandas import Seriesimport pandas as pdimport numpy as np#np.random.seed(12345)data=pd.DataFrame(np.random.randn(1000,4))#print(data.describe())print(data[(np.abs(data)>3).any(1)])#超过3或者...原创 2018-04-04 21:10:27 · 845 阅读 · 0 评论 -
用KNN算法识别黑白数字
from numpy import *import operatorfrom os import listdirdef knn(k,testdata,traindata,labels): traindatasize=traindata.shape[0] dif=tile(testdata,(traindatasize,1))-traindata sqdif=dif**2 s...原创 2018-03-06 13:12:33 · 269 阅读 · 0 评论 -
用python代码实现kmeans算法
import numpy as npydef kmeans(X,k,maxIteration): numpoint,numdim=X.shape numSet=npy.zeros((numpoint,numdim+1)) numSet[:,:-1]=X centroids=numSet[npy.random.randint(numpoint,size=k),:]...原创 2018-03-30 18:21:02 · 1302 阅读 · 0 评论 -
knn算法实现
from numpy import *import operatorfrom os import listdirdef knn(k,traindata,labels,testdata): num=traindata.shape[0] dif=tile(testdata,(num,1))-traindata sqdif=dif**2 sqdifsum=sqdif.sum(ax...原创 2018-03-26 10:21:03 · 160 阅读 · 0 评论 -
利用python中的库实现knn分类
from sklearn import neighborsfrom sklearn import datasetsknn=neighbors.KNeighborsClassifier()iris=datasets.load_iris()print(iris)knn.fit(iris.data,iris.target)predictlabel=knn.predict([0.1,0.2,0.3,0.4...原创 2018-03-26 21:09:37 · 2485 阅读 · 0 评论 -
神经网络算法实现
import numpy as npydef tanh(x): return npy.tanh(x)def tanh_deriv(x): return 1.0-npy.tanh(x)**2def logistic(x): return 1/(1+npy.exp(-x))def logistic_deriv(x): return logistic(x)*(1...原创 2018-03-27 16:11:12 · 509 阅读 · 0 评论 -
用python实现简单线性回归
import numpy as npydef fitSLR(x,y): fenzi = 0 fenmu = 0 num=len(x) for i in range(num): fenzi=fenzi+(x[i]-npy.mean(x))*(y[i]-npy.mean(y)) fenmu=fenmu+(x[i]-npy.mean(x)...原创 2018-03-27 20:17:33 · 508 阅读 · 0 评论 -
多元线性回归1
import numpy as npyimport pandas as pdafrom sklearn import linear_modelfilepath="G:/Delivery.csv"data=pda.read_csv(filepath,encoding="gbk")print(data)x=data.iloc[:,0:2].as_matrix()y=data.iloc[:...原创 2018-03-28 09:46:56 · 234 阅读 · 0 评论 -
多元线性回归案例
from numpy import genfromtxtimport numpy as npyfrom sklearn import datasets,linear_modelfilepath=r"G:\六西格玛\第一阶段-深度学习基础\代码与素材\代码与素材(2)\MachineLearning\MultiLinearRegression\Delivery.csv"data=genfro...原创 2018-03-28 10:01:23 · 2371 阅读 · 0 评论 -
python实现回归中的相关系数和决定系数
import numpy as npyimport cmathdef computecorrelation(x,y): x_bar=npy.mean(x) y_bar=npy.mean(y) SSR=0 Varx=0 Vary=0 for i in range(0,len(x)): SSR+=(x[i]-x_bar)*(y[i]...原创 2018-03-28 19:11:23 · 8794 阅读 · 0 评论 -
利用函数或者映射进行数据转换
from pandas import Seriesimport pandas as pdimport numpy as npdata=pd.DataFrame({ "food":["bacon", "pulled pork", "bacon", "Pastrami", "corned be.原创 2018-04-04 18:51:40 · 417 阅读 · 0 评论 -
数据离散化和面元划分
from pandas import Seriesimport pandas as pdimport numpy as npage=[20,22,25,27,21,23,37,31,61,45,41,32]bins=[18,25,35,60,100]cats=pd.cut(age,bins)print(cats)print(cats.codes)#查看属于哪类count=pd.va...原创 2018-04-04 20:30:30 · 259 阅读 · 0 评论 -
用交叉验证调整KNN模型的参数
import pandas as pdimport matplotlib.pyplot as pltimport numpy as np#加载数据def inspect_data(file_root): dataframe=pd.read_csv(file_root) print("数据基本信息:") print(dataframe.info()) pri...原创 2018-08-17 15:13:30 · 1634 阅读 · 0 评论