Python的Pandas模块
数据分析的Pandas模块
huskyKKK
略
展开
-
python-pandas模块1-基本数据类型1
pandas模块-基本数据类型 导入模块 import pandas as pd import numpy as np from pandas import Series,DataFrame 1.Series pd.Series()返回一个有index和values属性的数据对象 s= pd.Series([4,-7,3,9]) s (1)自定义索引 s1=Series([4,7,6,5],index=['a','b','c','d'],dtype=float) (2)读取Series 使用原创 2020-08-21 20:03:15 · 232 阅读 · 0 评论 -
python-pandas模块2-基本数据类型2
pandas模块-基本数据类型2 导入模块 import pandas as pd import numpy as np from pandas import Series,DataFrame 3.通过字典中包含字典创建个DataFrame 创建字典并转换为DataFrame dict_city={ 'shanghai':{2019:54000,2020:56000}, 'beijing':{2018:66666,2019:77777} } df_city=DataFrame(dic原创 2020-08-21 20:18:33 · 150 阅读 · 0 评论 -
python-pandas模块3-索引操作&算术运算
pandas模块-索引操作&算术运算 导入模块: import numpy as np import pandas as pd from pandas import Series,DataFrame 1.索引 创建一个Series s1=Series(range(1,5),index=list('ABCD')) s1原创 2020-08-22 10:32:42 · 374 阅读 · 0 评论 -
python-pandas模块4-函数映射&排序&重复轴判断
pandas模块-函数映射&排序&重复轴判断 导入模块 import numpy as np import pandas as pd from pandas import Series,DataFrame 1.函数应用和映射 先创建一个DataFrame df1=DataFrame( np.random.choice(range(20),size=(4,3),replace=False) index=list('ABCD') columns=list('abc') ) df1 (原创 2020-08-23 19:12:42 · 230 阅读 · 0 评论 -
python-pandas模块5-汇总计算-描述统计-唯一值-值计数-成员资格-缺失值处理
pandas模块-汇总计算-描述统计-唯一值、值计数、成员资格-缺失值处理 导入模块 import numpy as np import pandas as pd from pandas import Series,DataFrame 1.汇总计算 pandas的Series和DataFrame xxx.sum() xxx.mean() xxx.max() xxx.add() 先创建一个带nan的DataFrame df1=DataFrame( [ [3,2,np.nan],原创 2020-08-24 10:37:09 · 750 阅读 · 0 评论 -
python-pandas模块6-合并数据集
pandas模块-合并数据集 导入模块: import numpy as np import pandas as pd from pandas import Series,DataFrame from numpy import nan as NA 1.pd.merge() dataframe和dataframe合并 创建两个DataFrame用了来连接 df1=DataFrame( {'key':list('aabbabc'),'data1':np.arange(7)*10} ) df1 df原创 2020-08-24 14:42:54 · 231 阅读 · 0 评论 -
python-pandas模块7-重塑与轴向选择
pandas模块-重塑与轴向选择 导入模块: import numpy as np import pandas as pd from pandas import Series,DataFrame from numpy import nan as NA 1.数据规整化–重塑与轴向选择 (1)多层索引的数据 创建一个Series原创 2020-08-24 19:31:52 · 108 阅读 · 0 评论 -
python-pandas模块8-去重-函数与映射-替换值
pandas模块-去重-函数与映射-替换值 导入模块: import numpy as np import pandas as pd from pandas import Series,DataFrame from numpy import nan as NA 1.去重 创建有重复的数据 df1 = DataFrame({ 'k1':list('aaabbbbccc'), 'k2':[1,1,2,2,3,3,3,4,2,1] },index=list('ABCDEFGHIJ'))原创 2020-08-24 20:26:33 · 387 阅读 · 1 评论 -
python-pandas模块9-数据拆分-异常值筛选-过滤
pandas模块-数据拆分-异常值筛选-过滤 导入模块: import numpy as np import pandas as pd from pandas import Series,DataFrame from numpy import nan as NA # matplotlib做图模块 import matplotlib.pyplot as plt # 处理中文正常显示 plt.rcParams['font.sans-serif'] = ['Microsoft YaHei'] 1.数据拆分(分割原创 2020-08-25 14:24:26 · 1311 阅读 · 1 评论 -
python-pandas模块10-pandas读取与写入文件
pandas模块-pandas读取与写入文件 导入模块: import numpy as np import pandas as pd from pandas import Series,DataFrame from numpy import nan as NA # matplotlib做图模块 import matplotlib.pyplot as plt # 处理中文正常显示 plt.rcParams['font.sans-serif'] = ['Microsoft YaHei'] 这里是下面会用到的原创 2020-08-25 17:10:13 · 625 阅读 · 1 评论 -
python-pandas11-数据聚合-分组计算
pandas-数据聚合-分组计算 导入模块 import numpy as np import pandas as pd from pandas import Series,DataFrame from numpy import nan as NA import matplotlib.pyplot as plt plt.rcParams['font.sans-serif'] = ['Microsoft YaHei'] 先创建一个DataFrame和一个Series df1 = DataFrame(原创 2020-08-25 20:53:43 · 347 阅读 · 0 评论