一、关于体温、性别、心率的临床数据
对男性体温抽样计算下95%置信区间总体均值范围。转自:https://www.jianshu.com/p/a3efca8371eb
import pandas as pd
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
#读取数据
df = pd.read_csv('http://jse.amstat.org/datasets/normtemp.dat.txt', header = None,sep = '\s+' ,names=['体温','性别','心率'])
#选取样本大小,查看数据
np.random.seed(42)
#df.describe()
#样本量为90,查看样本数据
df_sam = df.sample(90)
df_sam.head()
#计算抽取样本中男士体温的均值
df3 = df_sam.loc[df_sam['性别']==1]
df3['体温'].mean()
#重复抽取样本,计算其他样本中男士体温的均值,得到抽样分布
boot_means = []
for _ in range(10000):
bootsample = df.sample(90, replace=True)
mean = bootsample[bootsample['性别'] == 1]['体温'