1.导入数据并提取前几行字段信息
# 1、导入工具包、数据,查看前几行数据列
import seaborn as sns
data = sns.load_dataset('tips')
print(data.head())
import matplotlib
matplotlib.rcParams['font.sans-serif'] = ['SimHei']
print(data.corr())
从皮尔逊相关系数可知,tips和total_bill、size之间有相关性,与total_bill甚至有强相关性。
2.小费数目通常与消费金额挂钩、有强相关性,绘制图像进行查看
#2、绘制散点图来查看小费与总消费之间的关系
import matplotlib.pyplot as plt
import pandas as pd
x = data['tip'].tolist()
y = data['total_bill'].tolist()
sns.lmplot(x='total_bill',y='tip',data=data,height=10)