Analysing the data from a supermarket
Import the data 导入数据
import pandas as pd
import matplotlib.pyplot as plt
get_ipython().run_line_magic(‘matplotlib’, ‘inline’)
data = pd.read_csv(‘order143.csv’)
data.head()
##Change the name of the columns due to some erroes.
改变列的名称
columns_new =
[‘Product_ID’, ‘Class_ID’, ‘Name_of_Branch’, ‘Price’, ‘Sale_Volume’,
‘Transaction_Time’, ‘Order_ID’]
data.columns = columns_new
data.head()
data.info()
Which kind of product sales better
Classified by Class_ID, sum the Sale_Volume, and sort a decreasing sequence by Sale_Volume
data_1 = data.groupby(‘Class_ID’)[‘Sale_Volume’].sum()
data_2 = data_1.reset_index()
data_3 = data_2. sort_values(by = ‘Sale_Volume’,