02-ECDF and Histogram

Difference between ECDF(Empirical Cumulative Distribution Function) and CDF:

The empirical CDF is built from an actual data set . The CDF is a theoretical construct.

Let  X  be a random variable.

  • The cumulative distribution function  F(x) gives the  P(Xx) .
  • An empirical CDF function  G(x) gives the  P(Xx)  in your actual sample.

The distinction is which probability measure is used. For the ECDF, you use the probability measure implicitly defined by the frequency counts in your sample.

Simple example (coin flip):

Let  X X be a random variable denoting the result of a single coin flip where  X=1 denotes heads and  X=0  denotes tails.

The CDF for a fair coin is given by:

F(x)=0121for x<0for 0x<1for 1x

If you flipped 2 heads and 1 tail, the empirical CDF would be:

G(x)=0231for x<0for 0x<1for 1x

The empirical CDF would reflect that  2/3  of your flips were heads.


Why ECDF is useful?

  • 1st step for data visualization. Usually along with Histogram; 

enter image description here

  • Help us to define the distribution of dataset;




  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值