04-Hypothesis Test & p-value

最新推荐文章于 2022-11-27 11:27:54 发布

hongo0o0

最新推荐文章于 2022-11-27 11:27:54 发布

阅读量1.1k

点赞数

Statistical Analysis 专栏收录该内容

5 篇文章

订阅专栏

本文详细介绍了进行假设检验的五个步骤：设定零假设、设置显著性水平、收集数据、计算检验统计量及p值，并通过一个具体例子说明如何根据p值判断是否拒绝零假设。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Hypothesis Test Process

1. Set a Null Hypothesis

The null hypothesis can be thought of as the opposite of the "guess" the research made (in this example the biologist thinks the plant height will be different for the fertilizers). So the null would be that there will be no difference among the groups of plants.

2. Set the significance level (alpha)


	H0 is true	H0 is false
Accept	true positive	false positive
Reject	true negative (alpha)	false negative

The term significance level (alpha) is used to refer to a pre-chosen probability, also called Type I error.

3. Collect Data

Usually we use Permutation sampling to collect the new random data for the following hypothesis test, called hypothesis data. The original data is called observed data.

4. Calculate the test statistic

Could be the mean of differences, pearson correlation r value and so on. Depend on real situation.

5. Calculate the p-value

the term "p-value" is used to indicate a probability that you calculate after a given study. p = "sum(the situations that can prove the null hypothesis is true)/#ofAllSituations"

If your p-value is less than the chosen significance level then you reject the null hypothesis.

A full example:

Null hypothesis: performance in the finals and semifinals are identical

Test statistic: the mean of the fractional improvement. mean(f) where f = (semi_perm - final_perm) / semi_perm

f_mean is calculated from observed data, perm_reps[i] is calculated from hypothesis data.

p-value: sum(perm_reps >= f_mean) / len(perm_reps)

so, if hypothesis is true, the ideal situation is that f_mean is some value close to the average value of perm_reps so that the p-value should be 0.5;

however, the result shows that p-value equals to 0.001, which means that "f_mean" is not gonna happen in this hypothesis. Hence we reject the hypothesis.