Weak14 Jupyter homework

最新推荐文章于 2021-12-19 13:30:31 发布

GH_Loeng

最新推荐文章于 2021-12-19 13:30:31 发布

阅读量193

点赞数 1

分类专栏： Python Homework

本文链接：https://blog.csdn.net/qq_36475045/article/details/80635859

版权

Python 同时被 2 个专栏收录

20 篇文章 0 订阅

订阅专栏

Homework

16 篇文章 0 订阅

订阅专栏

%matplotlib inline

import random

import numpy as np
import scipy as sp
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

import statsmodels.api as sm
import statsmodels.formula.api as smf

sns.set_context("talk")

Anscombe’s quartet
Anscombe’s quartet comprises of four datasets, and is rather famous. Why? You’ll find out in this exercise.

anascombe = pd.read_csv('data/anscombe.csv')
anascombe.head()

这里写图片描述
Part 1
For each of the four datasets:
Compute the mean and variance of both x and y

print(anascombe.groupby('dataset')['x'].mean())
print(anascombe.groupby('dataset')['y'].mean())

这里写图片描述

print(anascombe.groupby('dataset')['x'].var())
print(anascombe.groupby('dataset')['y'].var())

这里写图片描述
Compute the correlation coefficient between x and y

for i in range(4):
    X = anascombe.x[0+11*i:11+11*i].values
    Y = anascombe.y[0+11*i:11+11*i].values
    #形成的是2*2的相关系数矩阵
    print("the correlation of dataset%d is %f"%(i+1,np.corrcoef(X,Y)[0][1]))

这里写图片描述

Compute the linear regression line: y=β0+β1x+ϵ (hint: use statsmodels and look at the Statsmodels notebook)

for i in range(4):
    X = anascombe.x[0+11*i:11+11*i].values#11个样本点
    Y = anascombe.y[0+11*i:11+11*i].values
    train = sm.add_constant(X)#样本集左侧加上一列1，构成12维
    model = sm.OLS(Y,train).fit()
    print("The linear regression line of dataset%d is y = %f + %fx"%(i+1,model.params[0],model.params[1]))

这里写图片描述
Part 2
Using Seaborn, visualize all four datasets.
hint: use sns.FacetGrid combined with plt.scatter

m = sns.FacetGrid(anascombe, col="dataset")    
m.map(plt.scatter, "x","y")

这里写图片描述

GH_Loeng

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
2
评论
Weak14 Jupyter homework

%matplotlib inlineimport randomimport numpy as npimport scipy as spimport pandas as pdimport matplotlib.pyplot as pltimport seaborn as snsimport statsmodels.api as smimport statsmodels.form...
复制链接

扫一扫