Measures of Scale

本文探讨了衡量数据集变化性的几种方法,包括方差、标准差、范围、平均绝对偏差和中位数绝对偏差。这些度量关注数据中心附近和尾部的分散程度,并在正常、双指数、卡方和图基lambda分布等不同分布下进行了比较。标准差受极端值影响较大,而中位数绝对偏差和四分位距在存在极端值时提供更稳定的分散度量。
摘要由CSDN通过智能技术生成

https://www.itl.nist.gov/div898/handbook/eda/section3/eda356.htm

Scale, Variability, or Spread A fundamental task in many statistical analyses is to characterize the spread, or variability, of a data set. Measures of scale are simply attempts to estimate this variability.
When assessing the variability of a data set, there are two key components:

How spread out are the data values near the center?
How spread out are the tails?
Different numerical summaries will give different weight to these two elements. The choice of scale estimator is often driven by which of these components you want to emphasize.
The histogram is an effective graphical technique for showing both of these components of the spread.

Definitions of Variability For univariate data, there are several common numerical measures of the spread:
variance - the variance is defined as
s2=∑Ni=1(Yi−Y¯)2/(N−1)
where Y¯ is the mean of the data.

The variance is roughly the arithmetic average of the squared distance from the mean. Squaring the distance from the mean has the effect of giving greater weight to values that are further from the mean. For example, a point 2 units from the mean adds 4 to the above sum while a point 10 units from the mean adds 100 to the sum. Although the variance is intended to be an overall measure of spread, it can be greatly affected by the tail behavior.

standard deviation - the standard deviation is the square root of the variance. That is,
s=∑Ni=1(Yi−Y¯)2/(N−1)−−−−−−−−−−−−−−−−−−−√
The standard deviation restores the units of the spread to the original data units (the variance squares the units).

range - the range is the largest value minus the smallest value in a data set. Note that this measure is based only on the lowest and highest extreme values in the sample. The spread near the center of the data is not captured at all

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值