STATISTICS DISTRIBUTION OF NORMAL POPULATION

STATISTICS DISTRIBUTION OF NORMAL POPULATION

Maths

FOR MUM AND MUM'S MUM

In probability and statistics, a statistic is a function of samples. Theoretically, it is also a random variable with a probability distribution, called statistics distribution. As a statistic is a basis of inference of population distribution and characteristics, to determine a distribution of a statistic is one of the most fundamental problems in statistics. Generally speaking, it is rather complicate to determine a precise distribution of a statistic. But, for normal population, there are a few effective methods. This work will introduce some common statistics distributions of normal population and derive some important related theorems.

 

Digression

Let’s consider the probability distribution of =X2, where X ~ N(0, 1). The probability density of Y is given by


Therefore, ~ Gamma(1/2, 1/2), where


By using Gamma function (See Appendix I), one can determine the expectation and variance of Y, E(Y) =alpha/lambda, Var(Y) = alpha/lambda2.

The characteristic function of Gamma probability density is given by


where ~ Gamma(alphalambda).

Consider the distribution of a summation, Z, of two independent random variables, X ~ Gamma(alpha1, lambda) and Y ~ Gamma(alpha2,lambda). The characteristic function of Z is given by


Thus, ~ Gamma(alpha1+alpha2, lambda).

 

ChiDistribution

Suppose a population X~N(0, 1), X1, …, Xn are iid, then


It is easy to check that E(Y)=n, D(Y)=2n.

Two important properties of Chidistribution:


Proof:

The first one is trivial.

To see the second one, let Yi = 2(lambda)Xi, then


Therefore,


This completes the proof. #


t-Distribution

Suppose XN(0,1),Y ~Chi2(n), and X and Y are independent, then


of which the probability density is given by


One can check that, when n>2, E(T)=0, D(T)=n/(n-2).


F-Distribution

Suppose X Chi2(m),~ Chi2(n), and X andY are independent, then


of which the probability density is given by



Theorem 1

Suppose X1, …, Xn are samples drawn from a normal population N(musigma2), with a sample mean and a sample variance,


then, (1)(2)(3)(4)


Proof:

Formula (1) can be easily checked by performing Gaussian normalization.

For (2) and (3),


where


Now construct a finite-dimensional linear transformation A from Y to Z,


One can show that A is an orthonormal matrix. Further,


Therefore, Zi’s are also iid ~ N(0, 1).

Consequently, U and V are independent, where


Since VChi2(n-1), then


To see (4), now that U’ and V’ are independent, satisfying


by the definition of t-distribution, we have


This completes the proof. #


Theorem 2

Suppose X1, …, Xm andY1,…,Yn are samples taken from two normal populations N(mu1,sigma2),N(mu2sigma2), respectively, and they are all independent, then,


Hint: This can be shown by applying Theorem 1.


Theorem 3

Suppose X1, …, Xm andY1, …,Yn are samples taken from two normal populations N(mu1,sigma12),N(mu2, sigma22), respectively, and they are all independent, then,


Hint: This can be shown by using the definition of F-distribution.


APPENDIX

I. Gamma Function

Gamma function is defined as


which has following properties


Proof:


This completes the proof. #


II. Probability Density after Transformation

Suppose random vectors XY are in Rand a bijective operator T is defined as


If the probability density of X is pX(x), then that of Y is given by


Proof:

The cumulative probability function of Y is given by


Hence,


This completes the proof. #

This theorem can be used to determine the probability density typical of the form “Z=X/Y”, “Z=X+Y”, etc., by introducing an irrelevant variable and integrating the joint distribution of (ZU)to get the marginal distribution of Z, as long as T is bijective.

 

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Make sure that we grade your HW based solely on your R code script. If we don’t see the correct results when we run your code, you will get 0 point for those questions. 1. Create a R function to show the central limit theorem. This function should have the following properties: - In the argument of the function, you have an option to consider poisson, exponential, uniform, normal distributions as the population distribution. - Depending on the choice of the population distribution in part (1), the function will receive extra argument(s) for the parameters of the distribution. For example, if a normal distri- bution is chosen, the mean and SD are needed in the function argument. Note that each distribution has a different parameter setting. - If the distribution is not selected from (“Normal”, “Poisson”, “Uniform”, “Exponential”), the function needs to print the following error message: check the distributional setting: consider ("Normal", "Poisson", "Uniform", "Exponential") and stop. - The function should give the summary statistics (minimum, 1st quartile, median, mean, 3rd quartile, maximum) of 1, 000 sample mean values for given n values (n = 10, 50, 100, 500). - The result should have the following statement at the beginning, for example, if a normal distribution with mean 1 and SD 0.5 was chosen: ‘‘For the Normal distribution, the central limit theorem is tested’’ where the term “Normal” is automatically inserted in the statement based on the argument. And the output should have the following form: For the Normal distribution, the central limit theorem is tested When n=10: Min. 1st Qu. Median Mean 3rd Qu. Max. 0.5187 0.8930 1.0016 0.9993 1.1019 1.4532 When n=50: Min. 1st Qu. Median Mean 3rd Qu. Max. 0.7964 0.9508 1.0010 0.9997 1.0493 1.2309 1 When n=100: Min. 1st Qu. Median Mean 3rd Qu. Max. 0.8534 0.9679 0.9972 0.9992 1.0325 1.1711 When n=500: Min. 1st Qu. Median Mean 3rd Qu. Max. 0.9258 0.9836 1.0006 0.9997 1.0154 1.0678 I Using your own function, test the N(−1,0.52) and the Unif(−3,6) case.
06-05
SQLAlchemy 是一个 SQL 工具包和对象关系映射(ORM)库,用于 Python 编程语言。它提供了一个高级的 SQL 工具和对象关系映射工具,允许开发者以 Python 类和对象的形式操作数据库,而无需编写大量的 SQL 语句。SQLAlchemy 建立在 DBAPI 之上,支持多种数据库后端,如 SQLite, MySQL, PostgreSQL 等。 SQLAlchemy 的核心功能: 对象关系映射(ORM): SQLAlchemy 允许开发者使用 Python 类来表示数据库表,使用类的实例表示表中的行。 开发者可以定义类之间的关系(如一对多、多对多),SQLAlchemy 会自动处理这些关系在数据库中的映射。 通过 ORM,开发者可以像操作 Python 对象一样操作数据库,这大大简化了数据库操作的复杂性。 表达式语言: SQLAlchemy 提供了一个丰富的 SQL 表达式语言,允许开发者以 Python 表达式的方式编写复杂的 SQL 查询。 表达式语言提供了对 SQL 语句的灵活控制,同时保持了代码的可读性和可维护性。 数据库引擎和连接池: SQLAlchemy 支持多种数据库后端,并且为每种后端提供了对应的数据库引擎。 它还提供了连接池管理功能,以优化数据库连接的创建、使用和释放。 会话管理: SQLAlchemy 使用会话(Session)来管理对象的持久化状态。 会话提供了一个工作单元(unit of work)和身份映射(identity map)的概念,使得对象的状态管理和查询更加高效。 事件系统: SQLAlchemy 提供了一个事件系统,允许开发者在 ORM 的各个生命周期阶段插入自定义的钩子函数。 这使得开发者可以在对象加载、修改、删除等操作时执行额外的逻辑。
SQLAlchemy 是一个 SQL 工具包和对象关系映射(ORM)库,用于 Python 编程语言。它提供了一个高级的 SQL 工具和对象关系映射工具,允许开发者以 Python 类和对象的形式操作数据库,而无需编写大量的 SQL 语句。SQLAlchemy 建立在 DBAPI 之上,支持多种数据库后端,如 SQLite, MySQL, PostgreSQL 等。 SQLAlchemy 的核心功能: 对象关系映射(ORM): SQLAlchemy 允许开发者使用 Python 类来表示数据库表,使用类的实例表示表中的行。 开发者可以定义类之间的关系(如一对多、多对多),SQLAlchemy 会自动处理这些关系在数据库中的映射。 通过 ORM,开发者可以像操作 Python 对象一样操作数据库,这大大简化了数据库操作的复杂性。 表达式语言: SQLAlchemy 提供了一个丰富的 SQL 表达式语言,允许开发者以 Python 表达式的方式编写复杂的 SQL 查询。 表达式语言提供了对 SQL 语句的灵活控制,同时保持了代码的可读性和可维护性。 数据库引擎和连接池: SQLAlchemy 支持多种数据库后端,并且为每种后端提供了对应的数据库引擎。 它还提供了连接池管理功能,以优化数据库连接的创建、使用和释放。 会话管理: SQLAlchemy 使用会话(Session)来管理对象的持久化状态。 会话提供了一个工作单元(unit of work)和身份映射(identity map)的概念,使得对象的状态管理和查询更加高效。 事件系统: SQLAlchemy 提供了一个事件系统,允许开发者在 ORM 的各个生命周期阶段插入自定义的钩子函数。 这使得开发者可以在对象加载、修改、删除等操作时执行额外的逻辑。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值