Pandas
qq_42052864
这个作者很懒,什么都没留下…
展开
-
pandas自定义排序规则
from pandas.api.types import CategoricalDtype a = ['红红','白白','看看','慢慢','kini','ssfs','fff'] b = ["Mar(0, 15]","Jan(15, 31]","Aug(15, 31]","Sep(0, 15]","Jun(15, 31]","Jul(0, 15]","May(15, 31]"] df = pd.DataFrame({"编辑":a,"月份":b}) df cat_variable_order .原创 2021-04-21 14:32:35 · 680 阅读 · 0 评论 -
pd.DataFrame.melt()函数
对这个函数的理解就是二维变一维,就是逆序数列 melt(self, id_vars=None, value_vars=None, var_name=None, value_name='value', col_level=None) Parameters ---------- id_vars : tuple, list, or ndarray, optional Column(s) to use as identifier variables. v.原创 2021-04-21 14:23:24 · 871 阅读 · 0 评论 -
Groupby技术,数据聚合,透视表和交叉表
from __future__ import division from numpy.random import randn import numpy as np import os import matplotlib.pyplot as plt from io import StringIO # StringIO模块主要用于在内存缓冲区中读写数据。模块是用类编写的,只有一个StringIO类...原创 2018-08-15 17:26:00 · 654 阅读 · 0 评论 -
描述统计与单样本检验
import pandas as pd from pandas import Series,DataFrame import numpy as np a=[98,83,65,72,79,76,75,94,91,77,63,83,89,69,64,78,63,86,91,72,71,72,70,80,65,70,62,74,71,76] np.mean(a),np.mean(np.sort(a)...原创 2018-08-15 17:34:06 · 217 阅读 · 0 评论 -
pandas学习之字符串对象化方法,正则表达式
字符串对象化方法 val='a,b, guido' pieces=[x.strip() for x in val.split(',')] #以逗号分隔符切割字符,并且去掉字符前后空格 first,second,third=pieces first+"::"+second+"::"+third #连接字符,输出'a::b::guido' 等价于 .join方法 "::".join(pieces...原创 2018-08-14 11:54:42 · 1693 阅读 · 0 评论 -
pandas 数据归整化 清理,转换,合并,重塑
from __future__ import division import numpy as np import pandas as pd import os import matplotlib.pyplot as plt from scipy.interpolate import lagrange #导入拉格朗日插值函数 from pandas import Series,DataFrame...原创 2018-08-13 23:31:04 · 372 阅读 · 0 评论 -
pandas数据合并
数据合并 pd.merge() .join() from __future__ import division import numpy as np import pandas as pd import os import matplotlib.pyplot as plt from scipy.interpolate import lagrange #导入拉格朗日插值函数 from panda...原创 2018-08-13 16:13:15 · 747 阅读 · 0 评论 -
Pandas学习笔记 Series DataFrame
Series import numpy as np import pandas as pd import sys from pandas import Series,DataFrame obj=Series([4,7,-5,3],index=['d','b','a','c']) obj obj[['d','c']] obj['b']=6 obj obj*2 obj[obj>2] ...原创 2018-08-12 15:28:40 · 235 阅读 · 0 评论