python机器学习
arm_xuli
这个作者很懒,什么都没留下…
展开
-
使用logisticregression回归算法训练部分,全部样本 预测良/恶性肿瘤
#导入pandas工具包,并且更名为pdimport pandas as pd#调用pandas工具包read_csv函数,传入训练文件地址参数,获得返回数据存至变量df_traindf_train = pd.read_csv('../Datasets/Breast-Cancer/breast-cancer-train.csv')#调用pandas工具包read_csv函数,传入测原创 2018-01-31 20:39:56 · 821 阅读 · 0 评论 -
使用tensorflow自定义线性分类器预测 良/恶性肿瘤
import tensorflow as tfimport numpy as npimport pandas as pdtrain = pd.read_csv('../Datasets/Breast-Cancer/breast-cancer-train.csv')test = pd.read_csv('../Datasets/Breast-Cancer/breast-cancer-...原创 2018-03-02 18:26:53 · 2402 阅读 · 1 评论 -
python中使用集成模型,随机森林分类器,梯度提升决策树性能模型分析 可视化
import pandas as pdtitanic = pd.read_csv('http://biostat.mc.vanderbilt.edu/wiki/pub/Main/DataSets/titanic.txt')#titanic = pd.read_csv('../Datasets/Breast-Cancer/titanic.txt')X=titanic[['pclass','a...原创 2018-02-14 11:10:41 · 2143 阅读 · 1 评论 -
python中K近邻分类器(无参训练)对数据进行类别预测 可视化
from sklearn.datasets import load_irisiris = load_iris()iris.data.shapeprint(iris.DESCR)from sklearn.cross_validation import train_test_splitX_train,X_test,y_train,y_test=train_test_split(iris.da...原创 2018-02-14 11:33:29 · 738 阅读 · 0 评论 -
python 中bayes模型超参数并行网格搜索 程序分析
from sklearn.datasets import fetch_20newsgroupsimport numpy as npnews = fetch_20newsgroups(subset='all')from sklearn.cross_validation import train_test_splitX_train,X_test,y_train,y_test = ...原创 2018-02-13 12:00:46 · 573 阅读 · 0 评论 -
python中对不CountVectorizer与TfidfVectorizer,去停用词,对文本特征量化结合Bayes算法进行分类,可视化分析
from sklearn.datasets import fetch_20newsgroupsnews = fetch_20newsgroups(subset='all')print(len(news.data))print(news.data[0])from sklearn.cross_validation import train_test_splitX_train,X_test,y...原创 2018-02-13 11:20:29 · 2529 阅读 · 0 评论 -
python 原始相素特征和Pca压缩重建进行图像识别 识别性能可视化
import pandas as pd import numpy as np digits_train = pd.read_csv('../Datasets/Breast-Cancer/optdigits.tra', header=None) digits_test = pd.read_csv('../Datasets/Breast-Cancer/optdigits.tes', h...原创 2018-02-12 22:28:29 · 2189 阅读 · 0 评论 -
python中使用超参数估计法结合特征筛选的方法提升决策树的预测性能
import pandas as pdtitanic = pd.read_csv('../Datasets/Breast-Cancer/titanic.txt')y=titanic['survived']X = titanic.drop(['row.names','name','survived'],axis=1)X['age'].fillna(X['age'].mean(),in...原创 2018-02-12 20:15:15 · 776 阅读 · 0 评论 -
python中使用Word2Vec多核技术进行新闻词向量训练
from sklearn.datasets import fetch_20newsgroupsnews = fetch_20newsgroups(subset='all')X,y=news.data,news.targetfrom bs4 import BeautifulSoup#导入nltk和re工具包import nltk,re#定义一个函数名为news_to_sente...原创 2018-02-12 10:52:41 · 1078 阅读 · 0 评论 -
python显示手写数字图片经pca压缩后的二维空间分布 程序错误分析
import pandas as pd import numpy as np digits_train = pd.read_csv('../Datasets/Breast-Cancer/optdigits.tra', header=None) digits_test = pd.read_csv('../Datasets/Breast-Cancer/optdigits.tes', h原创 2018-02-11 23:28:16 · 868 阅读 · 1 评论 -
python中使用4次多项式回归模型在训练样本中进行拟合
X_train = [[6],[8],[10],[14],[18]]y_train = [[7],[9],[13],[17.5],[18]]from sklearn.linear_model import LinearRegressionregressor = LinearRegression()regressor.fit(X_train,y_train)import numpy...原创 2018-02-12 11:08:58 · 1263 阅读 · 0 评论 -
python中基于深度depth的回归决策树分析
import numpy as npfrom sklearn.tree import DecisionTreeRegressorfrom sklearn import cross_validationimport matplotlib.pyplot as plt#给出一个随机产生的数据def create_data(n): np.random.seed(0) X =...原创 2018-03-21 23:33:39 · 2431 阅读 · 0 评论