pandas.DataFrame.pivot_table创建一个pivot table。以DataFrame中的某一列或某几列分别作为index和columns,构造一个新的DataFrame。
Parameter:
- values:column to aggregate, optional
- index:column, Grouper, array, or list of the previous
- columns:column, Grouper, array, or list of the previous
- aggfunc:function, list of functions, dict, default numpy.mean
- fill_value:scalar, default None. Value to replace missing values with
- margins:boolean, default False. Add all row / columns (e.g. for subtotal / grand totals)
- margins_name: string, default ‘All’. Name of the row / column that will contain the totals when margins is True.
下面是一个电影评分的例子:
userId、movie和raing分别表示用户Id、电影和评分,例如,第0行表示Id为1的用户对电影m1评分是3,但是这样的表格看起来比较混乱,尤其是当userId和movie乱序时,我们可以将它转化为index为用户Id,columns为电影的表格:
import pandas as pd
import numpy as np
data = {
'userId': [1, 1, 1, 1, 1, 2, 2, 2, 2, 2