数据
kaggle地址:https://www.kaggle.com/mehdidag/black-friday
也可以去我的资源里下
一、导入数据
老样子,用pd.read_csv导入,数据集共54w条数据只有23M,不用担心(之前分析一个220M的数据集直接把我电脑卡死了…)
data = pd.read_csv('./data/BlackFriday.csv')
data.head()
>>>
User_ID Product_ID Gender Age Occupation City_Category Stay_In... Marital_Status Product_Category_1 Product_Category_2 Product_Category_3 Purchase
0 1000001 P00069042 F 0-17 10 A 2 0 3 0.0 0.0 8370
1 1000001 P00248942 F 0-17 10 A 2 0 1 6.0 14.0 15200
2 1000001 P00087842 F 0-17 10 A 2 0 12 0.0 0.0 1422
3 1000001 P00085442 F 0-17 10 A 2 0 12 14.0 0.0 1057
4 1000002 P00285442 M 55+ 16 C 4+ 0 8 0.0 0.0 7969