pandas
thfyshz
一花一世界
展开
-
pandas纵向学习之10 minutes to pandas(一)
10mins官方文档10 Minutes to pandas必要的库导入:import pandas as pdimport numpy as npimport matplotlib.pyplot as plt创建对象pandas常用数据类型有两个:series和dataframe。创建一个series:s = pd.Series([1,3,5,np.nan,6,8])s...原创 2018-10-31 09:10:47 · 378 阅读 · 0 评论 -
pandas_cookbook学习(八)
切片一个数据集:In [122]: df = pd.DataFrame(data={'Case' : ['A','A','A','B','A','A','B','A','A'], .....: 'Data' : np.random.randn(9)}) .....: In [123]: dfs = list(zip(*df.grou...原创 2018-11-12 10:17:23 · 191 阅读 · 0 评论 -
pandas_cookbook学习(七)
根据索引值将每一组数据滞后一项:In [112]: df = pd.DataFrame( .....: {u'line_race': [10, 10, 8, 10, 10, 8], .....: u'beyer': [99, 102, 103, 103, 88, 100]}, .....: index=[u'Last Gunfighter', u'Last...原创 2018-11-11 21:53:58 · 164 阅读 · 0 评论 -
pandas_cookbook学习(五)
使用剩下值的均值代替此值,注意transform的用法,与apply相区分:apply返回一个聚类结果,transform分别返回每个处理的结果In [94]: df = pd.DataFrame({'A' : [1, 1, 2, 2], 'B' : [1, -1, 1, 2]})In [95]: gb = df.groupby('A')In [96]: def replace(g):...原创 2018-11-08 10:52:42 · 164 阅读 · 0 评论 -
pandas_cookbook学习(九)——apply
Apply应用函数:In [135]: df = pd.DataFrame(data={'A' : [[2,4,8,16],[100,200],[10,20,30]], 'B' : [['a','b','c'],['jj','kk'],['ccc']]},index=['I','II','III']); df A BI [2, 4, 8, 16] [a, b, c]II [100, 20...原创 2018-11-13 13:14:37 · 111 阅读 · 0 评论 -
pandas_cookbook学习(二)
选择数据dataframes> df = pd.DataFrame( ....: {'AAA' : [4,5,6,7], 'BBB' : [10,20,30,40],'CCC' : [100,50,-30,-50]}); df AAA BBB CCC0 4 10 1001 5 20 502 6 30 -303 7 40 -50#设定两个“或”条件筛选数据&...原创 2018-11-05 11:24:28 · 152 阅读 · 0 评论 -
pandas_cookbook学习(四)
Missing DataThe missing data docs.Fill forward a reversed timeseriesIn [79]: df = pd.DataFrame(np.random.randn(6,1), index=pd.date_range(‘2013-08-01’, periods=6, freq=‘B’), columns=list(‘A’))In [8...原创 2018-11-07 17:37:40 · 140 阅读 · 0 评论 -
pandas纵向学习之10 minutes to pandas(四)
时间序列原创 2018-11-03 00:05:13 · 326 阅读 · 0 评论 -
pandas_cookbook学习(一)
import pandas as pdimport numpy as npIdiomsdf = pd.DataFrame( ...: {'AAA' : [4,5,6,7], 'BBB' : [10,20,30,40],'CCC' : [100,50,-30,-50]}); df AAA BBB CCC0 4 10 1001 5 20 502 6 30 -303 7...原创 2018-11-04 12:22:57 · 253 阅读 · 0 评论 -
pandas纵向学习之10 minutes to pandas(三)
操作数学统计df.mean() #查看每列的平均值df.mean(1) #查看每行的平均值s = pd.Series([1,3,5,np.nan,6,8], index=dates).shift(2)sdf.sub(s, axis='index')A B C D F2013-01-01 NaN NaN NaN NaN NaN2013-01-02 NaN NaN NaN NaN ...原创 2018-11-02 09:06:36 · 207 阅读 · 0 评论 -
pandas_cookbook学习(六)
使用resample和apply函数分别变换:In [103]: rng = pd.date_range(start="2014-10-07",periods=10,freq='2min')In [104]: ts = pd.Series(data = list(range(10)), index = rng)In [105]: def MyCust(x): .....: i...原创 2018-11-09 08:12:41 · 121 阅读 · 0 评论 -
pandas_cookbook学习(三)
多重索引> df = pd.DataFrame({'row' : [0,1,2], ....: 'One_X' : [1.1,1.1,1.1], ....: 'One_Y' : [1.2,1.2,1.2], ....: 'Two_X' : [1.11,1.11...原创 2018-11-06 12:22:32 · 220 阅读 · 0 评论 -
pandas横向学习之创建
pandas.Series文档1.series的创建class pandas.Series(data=None, index=None, dtype=None, name=None, copy=False, fastpath=False)Parametersmeansdataarray-like, dict, or scalar value. Contains da...原创 2018-10-30 15:38:12 · 253 阅读 · 0 评论 -
pandas纵向学习之10 minutes to pandas(二)
pandas纵向学习之10 minutes to pandas(一)布尔值索引df[df.A>0] A B C D2013-01-02 0.356680 -0.468280 1.293093 -0.7522512013-01-03 1.179930 0.407866 -1.733382 -0.1284742013-01-05 1.398427 0.087443 -1.0327...原创 2018-11-01 09:59:21 · 177 阅读 · 0 评论 -
pandas之groupby学习笔记
pandas库的groupby问题一、对象分组1 一个简单例子In [1]: df = pd.DataFrame([('bird', 'Falconiformes', 389.0), ...: ('bird', 'Psittaciformes', 24.0), ...: ('mammal', 'Carni...原创 2019-07-16 12:18:44 · 997 阅读 · 0 评论