CENTRAL LIMIT THEOREM

Derivation on Central Limit Theorem

Maths

Here comes something really spooky.

Suppose a random signal is created by generating a bunch of random numbers in between 0 and 1 and joining them by straight lines, then we can have its cartoon shown in Figure 1 (a). As we take convolutions on the random signal with itself iteratively, something spooky really happens that we get a bell-shaped curve just after three times convolutions.


Figure 1. (a) shows the initial random signal. It turns to (b) after one convolution. It turns to (c) after two convolutions. It turns to (d) after three convolutions.

As a matter of fact, after taking infinite many times of convolutions, we shall get a Gaussian distribution, as shown in Figure 2.


Figure 2

Problem is, how could this spookiness possibly happen?


Central Limit Theorem

The answer lies in Central Limit Theorem (CLT) manipulating this entire magical and charming math game,

Given a sequence of random variables which are iid (independent identically distributed), {X1, …,Xn}, and,


for any real number x,



Digression

What I want to do in this work is to derive CLT. Before the derivation, we need to come to a digression on convolution.

Suppose X1 andX2 are independent random variables with distributionsp1(x1) and p2(x2), then what is the distribution ofZ = X1 + X2?

For any value of z,


With change of variables, u =x1x1+ x=v,


Thus, the distribution of Z =X1+X2 is,


More generally, the distribution of X+ … +Xn is p1*…*pn.


Derivation on CLT

Now let us return to the derivation on CLT.

With normalization to variables, we can simply take the following assumption,


Let Sn denote the sum ofn items, i.e. Sn = X1+ … +Xn, then the mean of Sn is 0 and the standard deviation is sqrt(n).

After further normalization to Sn,


then, the mean ofYnis 0 and the standard deviation is 1.

Let pn(x) denote the distribution of Yn, we will see,


Since X1, …,Xn are iid, we have, p(x) = p1(x1) = … = pn(xn).

Therefore, the distribution of X1 + …+Xn is,


Let yixi / sqrt(n), then,


Thus, the distribution of Yn is,


Now take Fourier transform of pn(x) in order to turn convolution into multiplication for an easier derivation,


where


Take inverse Fourier transform,


Therefore, in conclusion,


Now the spookiness can be easily explained by CLT.

CLT indicates that, no matter what distribution Xi’s conform to, as long as {Xi,i = 1, …,n} are iid and n is sufficiently large, we come to the approximation that,


i.e.,



DeMoivre-Laplace Theorem

In particular, when Xi’s conform to B(1, p), we will see DeMoivre-Laplace Theorem derived from CLT,


where nA is the number of times for which Event A happens in an-time Bernoulli experiment.


Example

Suppose you are attending a CET-4, 6 exam today, there are 85 choice questions. Each has four choices with only one to be correct. Those who want to pass the exam must give right answers to more than 51 questions. Question is, what is the probability of one passing the exam purely by luck?

Solution:

Define random variables, (i= 1, …, 85)


are iid. Further,


Using DeMoivre-Laplace Theorem yields,


This indicates that, improvement on English ability is the only way to pass a CET-4,6 exam.


Acknowledgement

The whole discussion above benefits highly from Professor Osgood’s course on Fourier transform and its application of Stanford University. The cartoons are provided by Junjie WANG, who is an undergraduate in BJTU.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值