重学statistics,Cha3 Descriptive Statistics: numerical measures

这一章学什么?

1. 一些很简单的统计数据:Mean, Median, Mode, Percentiles, Variance, Standard Deviation

2. 用数据描述图形的形状:z-score, skewness

3. 最后的两个变量之间的 Covariance 和 Correlation Coefficient。

这三个方面都是为以后的章节打下基础。


If the measures are computed for data from a sample, they are called sample statistics

If the measures are computed for data from a population, they are called population parameters.


3.1 Measure of Location

Sample Mean :

Population Mean:

Median:Arrange the data in ascending order (smallest value to largest value). 

(a) For an odd number of observations, the median is the middle value. 

(b) For an even number of observations, the median is the average of the two middle values.

Mode:The mode is the value that occurs with greatest frequency.

Percentiles: The pth percentile is a value such that at least p percent of the observations are less than or equal to this value and at least (100-p) percent of the observations are greater than or equal to this value. 第 p 的 percentiles 是至少 p%的数据是小于等于这个值的,而且 至少100-p 的值是大于或者等于这个值。

怎么计算 Percentile ?

Step 1. Arrange the data in ascending order (smallest value to largest value). 

Step 2. Compute an index i= p/100 *n  where p is the percentile of interest and n is the number of observations. 

Step 3. (a) If i is not an integer, round up. The next integer greater than i denotes the position of the pth percentile. 

(b) If i is an integer, the pth percentile is the average of the values in po- sitions i and i+1.

如果计算出 i 不是一个整数,那么就 round up,下一个数字就是第 p 的 pencentile。

如果计算出 i 是一个整数,那么第 p 的 pencentile 就是 第 i个数据和第 i+1个数据的平均数

<

  • 1
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值