ANOVA-学习笔记

端庄的燕麦酱

已于 2022-12-23 16:25:46 修改

阅读量378

点赞数

文章标签：学习概率论

于 2022-12-23 11:19:29 首次发布

本文链接：https://blog.csdn.net/m0_74642128/article/details/128372329

版权

5.1 One-way Analysis of Variance

5.1.1 Some notations:

$I$ samples: every treatment value corresponds to a sample $i$

$\mu _{1},\mu _{2}...\mu _{I}$ : treatment means

$J_{1},J_{2}...J_{I}$ : sample size

$N$ : Total sample,所有样本数量， $N = J_{1}+J_{2}+...+J_{I}$

$x_{ij}$ : $j$ -th observation in the i-th sample

$\bar{x_{i.}}$ : Sample mean of the $i_{th}$ sample, $\bar{x_{i.}}= \frac{\sum_{j=1}^{Ji}x_{ij}}{J_{i}}$

$\bar{x}_{..}$ : Sample grand mean, $\bar{x}_{..} = \frac{\sum_{i = 1}^{I}\sum_{J=1}^{J_{i}}x_{ij}}{N}={\color{Red} \frac{\sum_{i = 1}^{I}J_{i}\bar{x}_{i}}{N}}$

Hypothesis is:

$H_{0}:\mu _{1}= \mu _{2}= ...=\mu _{I}$ v.s. $H_{1}$ : two or more of the $\mu_{i}$ are different.

5.1.2 Assumptions:

Treatment populations （各样本背后的总体） must

be normal 服从正态分布
have the same variance $\sigma ^{2}$ 方差需要相等，即，方差齐性/方差同质性

Samples （随机抽样的样本）must be

independent from each other

Note ：

在不满足正态性时可以采用非参数检验
要求方差齐性的原因： ANOVA虽然叫做Analysis of Variance, 但是其目的是为了检验每个组的均值是否相同，为了具有均值上的可比性，我们需要让方差相等。

5.1.3 ANOVA Brief Introduction

方差分析的基本原理: 认为不同处理组均数间的差别主要来源于

组内方差的产生只由于各个水平内部的随机变动，即不可控的随机因素造成的(random uncertainty)
组间方差的产生可能由于

1）随机误差，测量误差/个体间的差异

2）实验条件，不同的处理造成的差异，differences between the treatment means

5.1.4 SSTr (Treatment Sum of Squares):

The spread of the sample means around the sample grand mean, it measures how different the treatment means are from each other, i.e. inter-sample variation (组间方差）

即通过衡量对不同样本的均值偏离样本总均值的程度的大小，来验证不同实验组均值的差异程度.(在这里我个人的理解是，spread of the sample mean around the sample grand mean 相当于treatment means 的一个estimate，即估计值）

如果不同样本的均值非常分散(Sample means around Sample grand mean), 有很大可能是treatment means会不同，因此我们拒绝 $H_{0}$

SSTr 的计算公式：（前一个为定义式，后一个为方便计算式）

$=\sum_{i =1}^{I}J_{i}(\bar{x_{i.}}-\bar{x_{..}})^2= \sum_{i=1}^{I}J_{i}\bar{x_{i.}}^2-N\bar{x_{..}}^2$

Note ：
SSTr is large, i.e. sample means spread out widely $\Rightarrow$ 我们有理由总结：treatment means differ and thus reject $H_{0}$
SSTr is small, i.e. sample means are all close to the sample grand mean $\Rightarrow$ 我们有理由总结：treatment means are equal.

5.1.5 SSE (Error Sum of Squares):

刚刚讲了SSTr 的计算公式和SSTr的作用，但是现在有一个问题，SSTr 多大算大，以至于让我们拒绝 $H_{0}$ ，认为treatment means不相等呢？

在这里我们引入SSE，让SSTr 和SSE 作比较。

SSE measures the variation in the individual sample points around their respective sample means:

$\begin{aligned} SSE & =\sum_{i =1}^{I}\sum_{j=1}^{J_{i}}(x_{ij}-\bar{x_{i.}})^2 \\ &= \sum_{i=1}^{I}\sum_{j=1}^{J_{i}}x_{ij}^2-\sum_{i=1}^{I}J_{i}\bar{x_{i.}}^2 \end{aligned}$

Based on sample variance formula:

$s_{i}^2 = \frac{\sum_{j=1}^{J_{i}}(x_{ij}- \bar{x_{i.}})^2}{{\color{Red} J_{i}}-1} \\\Rightarrow \sum_{j=1}^{J_{i}}(x_{ij}-\bar{x_{i.}})^2 = s_{i}^2(J_{i}-1) \tag{1}$
将（1）带入SSE的定义式，
$\sum_{i =1}^{I}s_{i}^2(J_{i}-1)$

5.1.6 SST (Total Sum of Squares):

observation mean value - sample grand mean value

$\begin{aligned} SST &= \sum_{I=1}^{I} \sum_{J=1}^{Ji}(x_{ij} -\bar{x..})^2\\&=\sum_{i=1}^{I} \sum_{J=1}^{Ji} x_{ij}^2-N\bar{x_{..}}^2\end{aligned}$

重要关系式：
$SST = SST r + SSE$

5.1.7 ANOVA Table

d.f.(SSTr) = I - 1
d.f.(SSE) = N - I

$MSTr\ (Treatment\ Mean\ Square) =\frac{SSTr}{I-1}$
$MSE\ (Error\ Mean\ Square) = \frac{SSE}{N-I}$

ANOVA Table

方差来源	自由度	平方和	均方	F值
因素（Treatment）	I - 1	$=\sum_{i =1}^{I}J_{i}(\bar{x_{i.}}-\bar{x_{..}})^2$	$MSTr=\frac{SSTr}{I-1}$	$\frac{MSTr}{MSE}$
误差（Error）	N - I	$=\sum_{i =1}^{I}\sum_{j=1}^{J_{i}}(x_{ij}-\bar{x_{i.}})^2$	$\frac{SSE}{N-I}$
总和（Total）	N - 1	$=\sum_{i =1}^{I}\sum_{j=1}^{J_{i}}(x_{ij}-\bar{x_{..}})^2$

核心要点：
1. 假设检验：

$\left\{\begin{matrix} H_{0} :\mu_{0}=\mu_{1}=...=\mu_{I} \\ H_{1}: \text{Two\ or\ more\ means\ are\ not\ equal} \end{matrix}\right.$

2. 假设检验的test statistics： $\frac{MSTr}{MSE}$

3. 假设检验的Criteria:
-If $H_{0}$ is right, F is near 1
-If $H_{0}$ is false, F ＞ 1 (right tailed)

4. 对应的原理：

$\left\{\begin{matrix} E(SSTr)= (I-1)\sigma^2\mathrm{\ if\ } H_{0}\mathrm{\ is\ right} \\ E(SSTr)> (I-1)\sigma^2\mathrm{\ if\ } H_{0}\mathrm{\ is\ wrong} \end{matrix}\right.$

但是不管假设是否正确， $E({\color{Red}SSE }) = (N-I)\sigma^2$

$\Longrightarrow$ $\left\{\begin{matrix} E(MSTr)= \frac{(I-1)\sigma^2}{I-1}=\sigma^2\mathrm{\ if\ } H_{0}\mathrm{\ is\ right} \\ E(MSTr)> \sigma^2\mathrm{\ if\ } H_{0}\mathrm{\ is\ wrong} \end{matrix}\right.$

但是不管假设是否正确， $E({\color{Red}MSE } ) =E(\frac{SSE}{N-I})= \frac{(N-I)\sigma^2}{N-I}=\sigma^2$

$\Longrightarrow$ $\left\{\begin{matrix} 若H_{0}正确，F值接近1 \\ 若H_{0}错误，F值>1 \end{matrix}\right.$

5. F test
假设经过计算得到F-value为 $f$ ，significance level = $\alpha$
${\color{Red} F_{\alpha}(I-1,N-I)}<f$ ,then we say we reject $H_{0}$ under $\alpha$ significance level.Therefore there is indication from the data that there is significant difference among the treatment means.

6. F test 对应的p-value 转换
p(F= f-value) <0.05, there is moderate evidence against $H_{0}$ in favor of $H_{1}$ .
There is significant difference among the treatment means.

5.2 An Alternate Parameterization

现在介绍另外一种比较treatment means的方法。

拆分

Assume $X_{ij} \sim N(\mu_{i},\sigma^2)$
Error term $\varepsilon_{ij}\sim N(0,\sigma^2)$
For each $X_{ij}$ , 将其拆分成两个部分
$X_{ij} = \mu_{i} + \varepsilon_{ij}$

定义新的变量

Define population grand mean
$\mu=\frac{1}{I}\sum_{i=1}^{I}\mu_{i}$
Define i-th treatment effect $\alpha_{i}=\mu_{i}-\mu \tag{2}$
we can easily get $\sum_{i=1}^{I}\alpha_{i} = 0$
From(2), we know $\mu_{i} =\alpha_{i}+\mu$
Thus, we can change the equation of $X_{ij}$ into $X_{ij}=\mu_{i}+\varepsilon_{ij}={\color{Red}\mu+\alpha_{i}+\varepsilon_{ij} }$ where $\sum_{i=1}^{I}\alpha_{i} = 0$
For the one-way ANOVA,
$H_{0}:\mu_{1}=\mu_{2}=...=\mu_{I} \Leftrightarrow \\H_{0}:\alpha_{1}=\alpha_{2}=...=\alpha_{I}= {\color{Red} 0}$

为什么都等于0？
$\because\mu_{i} =\alpha_{i}+\mu$
$\mu$ is fixed,
$\mu_{1}=...=\mu_{I}$ ,
$\therefore \alpha_{1}=...=\alpha_{I}$
$又\because\sum_{i=1}^{I}\alpha_{i} = 0$
$\therefore \alpha_{1}=\alpha_{2}=...=\alpha_{I}= {\color{Red} 0}$

5.3 Comparison with Random Effects Model

固定效应模型和随机效应模型的对比

Item	Differences	$H_{0}$
Fixed Effect Model	Treatments are chosen deliberately by the experimenters. Interest is on specific treatment means	$\mu_{1}=...=\mu_{I}$
Random Effect Model	Treatments are chosen at random from a population of possible treatments. No particular interest.	For every treatment in the population, treatment means are equal.