pd.crosstab()
1、缩小数据集范围
DataFrame.query()
2、处理日期数据
pd.to_datetime
pd.DatetimeIndex
3、增加分割的日期数据
4、删除没用的日期数据
pd.drop
5.提取一个列表中的多列:
x=data[['pclass','age','sex']]
pd转化成字典格式
df.to_dict(orient='records')
[{‘colA’: ‘A’, ‘colB’: ‘X’, ‘colC’: 100, ‘colD’: 90},
{‘colA’: ‘A’, ‘colB’: nan, ‘colC’: 50, ‘colD’: 60},
{‘colA’: ‘B’, ‘colB’: ‘Ya’, ‘colC’: 30, ‘colD’: 60},
{‘colA’: ‘C’, ‘colB’: ‘Xb’, ‘colC’: 50, ‘colD’: 80},
{‘colA’: ‘A’, ‘colB’: ‘Xa’, ‘colC’: 20, ‘colD’: 50}]
df.to_dict(orient='dict')
{‘colA’: {0: ‘A’, 1: ‘A’, 2: ‘B’, 3: ‘C’, 4: ‘A’},
‘colB’: {0: ‘X’, 1: nan, 2: ‘Ya’, 3: ‘Xb’, 4: ‘Xa’},
‘colC’: {0: 100, 1: 50, 2: 30, 3: 50, 4: 20},
‘colD’: {0: 90, 1: 60, 2: 60, 3: 80, 4: 50}}