使用pandas包报错备忘
df.nunique() # 返回不重复元素的数量,以键值对的形式返回
df.unique() # 会报错,因为unique()针对series,按列使用变不报错
【例】:
import pandas as pd
from rdt import get_demo
data = get_demo()
data
0_int 1_float 2_str 3_datetime0
0 68.0 82.394134 c NaT
1 48.0 68.819774 NaN 2021-01-13 20:11:00
2 NaN 3.944822 b 2021-02-19 12:21:00
3 NaN 11.090033 a 2021-05-18 08:58:00
4 35.0 NaN b 2020-09-28 17:49:00
5 99.0 NaN NaN NaT
6 46.0 1.801142 d 2020-02-08 02:25:00
7 13.0 70.168589 c 2019-10-29 08:38:00
8 72.0 71.931300 a 2021-02-01 00:37:00
9 20.0 63.330224 b 2020-01-07 13:42:00
data.nunique()
0_int 8 1_float 8 2_str 4 3_datetime 8 dtype: int64
data.nunique().values
array([8, 8, 4, 8])
data['0_int'].unique()
array([68., 48., nan, 35., 99., 46., 13., 72., 20.])