Python的Pandas模块
数据分析的Pandas模块
huskyKKK
略
展开
-
python-pandas模块1-基本数据类型1
pandas模块-基本数据类型导入模块 import pandas as pd import numpy as np from pandas import Series,DataFrame1.Seriespd.Series()返回一个有index和values属性的数据对象s= pd.Series([4,-7,3,9])s(1)自定义索引s1=Series([4,7,6,5],index=['a','b','c','d'],dtype=float)(2)读取Series使用原创 2020-08-21 20:03:15 · 238 阅读 · 0 评论 -
python-pandas模块2-基本数据类型2
pandas模块-基本数据类型2导入模块 import pandas as pd import numpy as np from pandas import Series,DataFrame3.通过字典中包含字典创建个DataFrame创建字典并转换为DataFramedict_city={ 'shanghai':{2019:54000,2020:56000}, 'beijing':{2018:66666,2019:77777}}df_city=DataFrame(dic原创 2020-08-21 20:18:33 · 155 阅读 · 0 评论 -
python-pandas模块3-索引操作&算术运算
pandas模块-索引操作&算术运算导入模块:import numpy as np import pandas as pdfrom pandas import Series,DataFrame1.索引创建一个Seriess1=Series(range(1,5),index=list('ABCD'))s1原创 2020-08-22 10:32:42 · 385 阅读 · 0 评论 -
python-pandas模块4-函数映射&排序&重复轴判断
pandas模块-函数映射&排序&重复轴判断导入模块import numpy as npimport pandas as pdfrom pandas import Series,DataFrame1.函数应用和映射先创建一个DataFramedf1=DataFrame( np.random.choice(range(20),size=(4,3),replace=False) index=list('ABCD') columns=list('abc'))df1(原创 2020-08-23 19:12:42 · 242 阅读 · 0 评论 -
python-pandas模块5-汇总计算-描述统计-唯一值-值计数-成员资格-缺失值处理
pandas模块-汇总计算-描述统计-唯一值、值计数、成员资格-缺失值处理导入模块import numpy as npimport pandas as pdfrom pandas import Series,DataFrame1.汇总计算pandas的Series和DataFramexxx.sum()xxx.mean()xxx.max()xxx.add()先创建一个带nan的DataFramedf1=DataFrame( [ [3,2,np.nan],原创 2020-08-24 10:37:09 · 768 阅读 · 0 评论 -
python-pandas模块6-合并数据集
pandas模块-合并数据集导入模块:import numpy as npimport pandas as pdfrom pandas import Series,DataFramefrom numpy import nan as NA1.pd.merge() dataframe和dataframe合并创建两个DataFrame用了来连接df1=DataFrame( {'key':list('aabbabc'),'data1':np.arange(7)*10})df1df原创 2020-08-24 14:42:54 · 242 阅读 · 0 评论 -
python-pandas模块7-重塑与轴向选择
pandas模块-重塑与轴向选择导入模块:import numpy as npimport pandas as pdfrom pandas import Series,DataFramefrom numpy import nan as NA1.数据规整化–重塑与轴向选择(1)多层索引的数据创建一个Series原创 2020-08-24 19:31:52 · 114 阅读 · 0 评论 -
python-pandas模块8-去重-函数与映射-替换值
pandas模块-去重-函数与映射-替换值导入模块:import numpy as npimport pandas as pdfrom pandas import Series,DataFramefrom numpy import nan as NA1.去重创建有重复的数据df1 = DataFrame({ 'k1':list('aaabbbbccc'), 'k2':[1,1,2,2,3,3,3,4,2,1] },index=list('ABCDEFGHIJ'))原创 2020-08-24 20:26:33 · 403 阅读 · 1 评论 -
python-pandas模块9-数据拆分-异常值筛选-过滤
pandas模块-数据拆分-异常值筛选-过滤导入模块:import numpy as npimport pandas as pdfrom pandas import Series,DataFramefrom numpy import nan as NA# matplotlib做图模块import matplotlib.pyplot as plt# 处理中文正常显示plt.rcParams['font.sans-serif'] = ['Microsoft YaHei']1.数据拆分(分割原创 2020-08-25 14:24:26 · 1337 阅读 · 1 评论 -
python-pandas模块10-pandas读取与写入文件
pandas模块-pandas读取与写入文件导入模块:import numpy as npimport pandas as pdfrom pandas import Series,DataFramefrom numpy import nan as NA# matplotlib做图模块import matplotlib.pyplot as plt# 处理中文正常显示plt.rcParams['font.sans-serif'] = ['Microsoft YaHei']这里是下面会用到的原创 2020-08-25 17:10:13 · 639 阅读 · 1 评论 -
python-pandas11-数据聚合-分组计算
pandas-数据聚合-分组计算导入模块import numpy as npimport pandas as pdfrom pandas import Series,DataFramefrom numpy import nan as NAimport matplotlib.pyplot as pltplt.rcParams['font.sans-serif'] = ['Microsoft YaHei']先创建一个DataFrame和一个Seriesdf1 = DataFrame(原创 2020-08-25 20:53:43 · 361 阅读 · 0 评论