Statistics -- Inferential Statistics: Confidence Intervals

最新推荐文章于 2019-12-30 12:46:12 发布

CHNMSCS

最新推荐文章于 2019-12-30 12:46:12 发布

阅读量334

点赞数

分类专栏： Data Science Bootcamp

本文链接：https://blog.csdn.net/BSCHN123/article/details/103646194

版权

Data Science Bootcamp 专栏收录该内容

32 篇文章 1 订阅

订阅专栏

                    
                        
                    
                    Confidence interval is much more accurate representation of reality.
The level of confidence. It is denoted by one minus Alpha and is called the confidence level of the interval. Alpha is a value between 0 and 1.
The formula for all confidence intervals is from the point estimate minus the reliability factor times the standard error to the point estimate plus the reliability factor times the standard error. [Point Estimate - Reliability Factorstandard error, Point Estimate + ReliabilityStandard Error] The point estiamte is the X bar.
How is the a confidence interval related to a point estimate? – The point estimate is the midpoint of the interval.
A confidence interval is the range within which you expect the population parameter to be and its estimation is based on the data we have in our sample. There can be two main situations when we calculate the confidence intervals for a population when the population barrier is known and when it is unknown depending on which situation we are in we would use a different calculation method.
If we know that a variable is normally distributed we are basically making the statement that the majority of observations will be around the mean and the rest far away from it.
When our confidence is lower the confidence interval itself is smaller. For a 99% confidence interval we would have a higher confidence but a much larger confidence interval.
Student’s T Distribution allows inference through small samples and with an unknown population variance.
Z-statistic is related to the standard normal distribution. t-statistic is related to the Student’s distribution.
In essence the bigger the sample the closer we get to the actual numbers a common rule of thumb is that for a sample containing more than 50 observations we use the Z table instead of the t table.
The Student’s T distribution approximates the Normal distribution but has fatter tails. This means the probability of values being far away from the mean is bigger. For big enough samples, the Student’s T distribution coincides with the Normal distribution.
Population variance is unknown, the sample size is small => Student’s T distribution.
When population variance is unknown, sample standard deviation goes with the t statistic. When population variance is known, population standard deviation goes with the Z statistic.
A higher level of competence increases the statistic a higher statistic means a high margin of error.
Bigger margin of error => wider confidence interval.
Smaller margin of error => narrower confidence interval
A lower standard deviation means that the data set is more concentrated around the mean.
Higher sample sizes will decrease the margin of error. This is also quite intuitive.
The more observations there are in the sample, the higher the chances of getting a good idea about the true mean of the entire population.
A higher statistics increases the margin of error. A higher standard deviation inceases the margin of error. A higher sample size decreases the margin of error.
Confidence intervals for dependent samples. And statistical methods like regressions.
Dependent samples: this if often used when developing medicine. In biology, normality is so often observed that we assume that such variables are normally distributed.

                

CHNMSCS

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Statistics -- Inferential Statistics: Confidence Intervals

Confidence interval is much more accurate representation of reality.The level of confidence. It is denoted by one minus Alpha and is called the confidence level of the interval. Alpha is a value bet...
复制链接

扫一扫

专栏目录