python
岸芷汀兰whu
热爱生活、热爱技术
展开
-
python之10分钟pandas
#encoding=utf-8import pandas as pdimport numpy as npimport matplotlib.pyplot as plt'''参考:http://pandas.pydata.org/pandas-docs/stable/10min.html'''#传一个列表来创建Seriess = pd.Series([1,3,5,np.nan,6,8])原创 2016-01-24 17:22:12 · 1081 阅读 · 0 评论 -
ipython notebook安装及使用(一)
安装参考一: ubuntu安装ipython notebookapt-get install ipython-notebookipython notebook --pylab inline --ip 0.0.0.0用浏览器访问对应的地址即可。。。ip:8888参考二:ubuntu安装ipythonnotebookwindows安装ipython notebook参考mac上安装ipython原创 2016-01-25 17:12:08 · 3808 阅读 · 0 评论 -
pyspark初探(一)LearningSpark
启动pysparkIPYTHON=1 pysparkIPYTHON_OPTS="notebook" pyspark(set IPYTHON=1 pyspark for windows)执行python脚本spark-submit my_script.py初始化sparkcontextfrom pyspark import SparkConf,SparkContextconf = Spark原创 2016-03-15 17:01:51 · 4337 阅读 · 0 评论 -
scikit-learn入门到精通(五)Unsupervised learning: seeking representations of the data
#encoding=utf-8'''五监督学习:寻找数据的代表''''''KMeans聚类'''from sklearn import cluster ,datasetsiris = datasets.load_iris()X_iris = iris.datay_iris = iris.targetk_means = cluster.KMeans(n_clusters=3)k_原创 2016-01-30 12:41:18 · 753 阅读 · 0 评论 -
scikit-learn入门到精通(四):模型选择
k-折叠验证'''k折叠验证,用于测量预测精度'''import numpy as npX_folds = np.array_split(X_digits,3)y_folds = np.array_split(y_digits,3)scores = list()for k in range(3): X_train = list(X_folds) X_test =X_t原创 2016-01-29 22:07:16 · 1162 阅读 · 0 评论 -
scikit-learn入门到精通(三):监督学习
KNN#encoding=utf-8'''最近邻和维数灾难'''#分类 irisesimport numpy as npfrom sklearn import datasetsiris = datasets.load_iris()iris_X =iris.datairis_y = iris.targetnp.unique(iris_y)'''k近邻分类'''#分训练集和测原创 2016-01-29 20:05:33 · 1050 阅读 · 0 评论 -
scikit-learn入门到精通(二):seting和estimator
#encoding=utf-8'''scikit-learn的datasets是2D array.可以理解为一个多为观测的list'''from sklearn import datasetsiris = datasets.load_iris()data = iris.datadata.shape#这是一个150*4的观测数据,没有初始化为(n_samples,n_features)原创 2016-01-29 18:29:10 · 2239 阅读 · 1 评论 -
scikit-learn入门到精通(一):快速入门
加载数据集#encoding=utf-8'''一个dataset是一个类似字典的对象,数据存储在n_samples,n_features的array对象.data中,在监督学习中,响应变量存储在.target中'''from sklearn import datasetsiris = datasets.load_iris()digits = datasets.load_digits()原创 2016-01-29 17:57:10 · 3500 阅读 · 0 评论 -
ipython notebook使用
启动sudo ipython notebook --pylab inline参考ipython notebook 轻松搞定你的IPython + Notebook 基于云的科学计算环境原创 2016-01-29 17:07:56 · 647 阅读 · 0 评论 -
python第一步:安装ipython
官方文档 安装pipsudo apt-get install pip安装ipythonpip install "ipython[all]"测试iptest安装scikit-learnpip install -U scikit-learn原创 2016-01-24 21:36:58 · 592 阅读 · 0 评论 -
Integrating Apache Spark with PyCharm
参考 在/Applications/PyCharm CE.app/Contents/bin 下写了一个pycharm.shexport PYTHONPATH=/usr/local/share/spark1626/python/:/usr/local/share/spark1626/python/lib/py4j-0.9-src.zipexport SPARK_HOME=/usr/local/s原创 2016-04-01 13:41:39 · 627 阅读 · 0 评论