![](https://img-blog.csdnimg.cn/20201014180756927.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
利用python进行数据分析
DMU_lzq1996
这个作者很懒,什么都没留下…
展开
-
第二章 Python语法基础,Ipython和JupyterNotebooks
2.1 Python解释器Python是解释性语言,解释器同一时间只能运行一个程序或语句。2.2 IPython基础运行Ipython Shell运行Jupyter NotebooksTab补全自省:?;??%run命令中断运行代码:Ctrl-C从剪贴板执行程序:%paste和%cpaste2.3 Python语法基础使用缩进,而不是括号万物皆对象注释函数和对象方法调用...原创 2018-10-22 19:46:05 · 272 阅读 · 0 评论 -
数据规整
#层次化索引import pandas as pdimport numpy as npdata = pd.Series(np.random.randn(9), index = [list('aaabbccdd'),list('123131223')])dataa 1 0.609164 2 -0.791785 3 -0.6964...原创 2018-11-05 11:48:12 · 598 阅读 · 0 评论 -
数据可视化
#绘图和可视化import matplotlib.pyplot as pltimport numpy as npdata = np.arange(10)dataarray([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])plt.plot(data)[<matplotlib.lines.Line2D at 0x187c0ce60f0>]fig =...原创 2018-11-05 11:46:59 · 491 阅读 · 1 评论 -
时间序列
#时间序列#日期和时间数据类型及工具from datetime import datetimeimport pandas as pdimport numpy as npnow = datetime.now()nowdatetime.datetime(2018, 11, 7, 15, 6, 50, 327155)now.year,now.month,now.day(2018...原创 2018-11-07 15:52:49 · 324 阅读 · 0 评论 -
高维数据可视化
import pandas as pdimport matplotlib.pyplot as pltdata = pd.read_csv('iris.csv')data.head() Sepal.Length Sepal.Width Petal.Length Petal.Width Specie...原创 2018-11-09 21:29:08 · 1836 阅读 · 0 评论 -
python建模库
#建模库import pandas as pdimport numpy as npdata = pd.DataFrame({ 'x0':[1,2,3,4,5], 'x1':[0.01,-0.01,0.25,-4.1,0.], 'x2':[-1.5,0.,3.6,1.3,-2.1]})data x0 x...原创 2018-11-09 21:27:05 · 924 阅读 · 0 评论 -
数据聚合与分组运算
#数据聚合与分组import pandas as pdimport numpy as npdf = pd.DataFrame({'key1' : ['a', 'a', 'b', 'b', 'a'], 'key2' : ['one', 'two', 'one', 'two', 'one'], 'dat...原创 2018-11-06 15:59:03 · 298 阅读 · 0 评论 -
数据清洗和准备2
#字符串操作#字符串对象方法#splitval = 'a,b, guido'val.split(',')['a', 'b', ' guido']#strippieces = [x.strip()for x in val.split(',')]#去除空白符pieces['a', 'b', 'guido']first,second,third = piecesfirst...原创 2018-10-30 16:27:38 · 261 阅读 · 0 评论 -
数据清洗和准备1
#数据清洗和准备import pandas as pdimport numpy as np#处理缺失值string_data = pd.Series(['aardvark','artwdfv',np.nan,'asdfaa'])string_data0 aardvark1 artwdfv2 NaN3 asdfaadtype: obje...原创 2018-10-30 16:27:16 · 230 阅读 · 0 评论 -
数据加载,存储和文件格式
import pandas as pddf = pd.read_csv('examples/ex1.csv')df a b c d message 0 1 2 3 4 hello ...原创 2018-10-27 09:37:06 · 659 阅读 · 0 评论 -
numpy数组1
# NumPy基础#NumPy的ndarray:一种多维数组对象import numpy as npdata = np.random.randn(2,3)dataarray([[ 0.93326604, 1.25437496, 0.55385005], [ 1.54395985, 1.77476317, -0.83043758]])data*10array...原创 2018-10-24 20:44:59 · 291 阅读 · 0 评论 -
伪随机数生成
import numpy as np#伪随机数生成samples = np.random.normal(size = (4,4))#4*4正态分布数组samplesarray([[-1.5229485 , 0.77679543, 1.52695935, 1.20236513], [-0.47239924, -1.44597446, 0.53760123, -1.41...原创 2018-10-26 21:55:18 · 322 阅读 · 0 评论 -
3.1字典,集合,异常,函数,文件
#字典:键值对#创建字典empty_dict = {}d1 = {'a':'some value','b':[1,2,3,4]}d1{'a': 'some value', 'b': [1, 2, 3, 4]}#访问,插入,设定字典元素d1[7] = 'an integer'd1{'a': 'some value', 'b': [1, 2, 3, 4], 7: 'an in...原创 2018-10-23 16:59:16 · 152 阅读 · 0 评论 -
pandas
#pandasimport pandas as pdimport numpy as npfrom pandas import Series,DataFrame#Seriesobj = pd.Series([4,7,-5,3])obj0 41 72 -53 3dtype: int64obj.valuesarray([ 4, 7, -5, 3...原创 2018-10-25 18:50:15 · 199 阅读 · 0 评论 -
3.1数据结构—元组和列表
#3.1数据结构和序列#3.1.1元组:固定长度,不可改变#创建元组tupletup1 = (1,2,3)tup2 = ((1,2),3,4)tup1tup2((1, 2), 3, 4)#序列或迭代器转换为元组tup3 = tuple([1,2,3])tup4 = tuple("string")tup3tup4('s', 't', 'r', 'i', 'n', '...原创 2018-10-22 21:14:58 · 199 阅读 · 0 评论 -
pandas高级应用
#分类数据import pandas as pdimport numpy as npvalues = pd.Series(['apple','orange','apple','apple']*2)values0 apple1 orange2 apple3 apple4 apple5 orange6 apple7 ...原创 2018-11-08 10:26:42 · 212 阅读 · 0 评论