Covariance and Unbiased Estimate

最新推荐文章于 2024-06-02 16:33:44 发布

中原H

最新推荐文章于 2024-06-02 16:33:44 发布

阅读量158

点赞数

分类专栏：数学学海

本文链接：https://blog.csdn.net/m0_49946797/article/details/117884719

版权

数学学海专栏收录该内容

9 篇文章

订阅专栏

Table of Content

1. Covariance and Correlation
- 1.1 examples
2. Unbiased estimate
Reference

1. Covariance and Correlation

Let $x_{i}$ and $x_{j}$ be two real random variables in a random vector $x=[x_{1},\cdots , x_{N}]^{T}$ .
The mean and varicance of a variable $x_{i}$ and the covariance and correlation coefficient (normalized correlation) between two variables $x_{i}$ and $x_{j}$ are defined below:

Mean of $x_{i}:$ $\mu_{i}=E(x_{i})$
Variance of $x_{i}:$ $\sigma_{i}^{2}=E[(x_{i}-\mu_{i})^{2}]=E(x_{i}^{2})-\mu_{i}^{2}$
Covariance of $x_{i}$ and $x_{j}:$ $\sigma_{ij}^{2}=E[(x_{i}-\mu_{i})(x_{j}-\mu_{j})]=E(x_{i}x_{j})-\mu_{i}\mu_{j}$
Correlation coefficient between $x_{i}$ and $x_{j}$ : $r_{ij}=\dfrac{\sigma_{ij}^{2}}{\sqrt{\sigma_{i}^{2}\sigma_{j}^{2}}}=\dfrac{\sigma_{ij}^{2}}{\sigma_{i}\sigma_{j}}$

Note that: the correlation coefficient $r_{ij}$ can be considered as the normalized covariance $\sigma_{ij}^{2}$ .

To obtain these parameters as expectations of the first and second order functions of the random variables, the joint probability density function $p(x_{1},\cdots, x_{N})$ is required.
However, when it is not avaiable, the parameters can still be estimated by averaging the outcomes of a random experiment involving these variables repeated $K$ times:
在这里插入图片描述

1.1 examples

Assume the experiment concering $x_{i}$ and $x_{j}$ is repeated $K = 3$ times with the following outcomes:
在这里插入图片描述
so, we can get

We see that $x_{i}$ and $x_{j}$ are highly correlated.

2. Unbiased estimate

Defination: Let $X$ is overall, $\theta \in \Theta$ is an under-estimated parameter which is inclued in the distribution of $X$ , and $X_{1},X_{2},\cdots, X_{n}$ is an sample from $X$ . If the expectation of the estimation $\hat{\theta}=\hat{\theta}(X_{1},X_{2},\cdots,X_{n})$ exists, while the equaction $E(\hat{\theta})=\theta$ holds for any $\theta \in \Theta$ . Then $\hat{\theta}$ is called the unbiased estimate of $\theta$ .

Example1: Let $\mu$ and $\sigma^{2}$ as the mean and variance of $X$ , they are unknown, then the estimator of $\sigma^{2}$
$\hat{\sigma^{2}}=\frac{1}{n}\sum_{i=1}^{n}(X_{i}-\bar{X})^{2}$
is a biased estimator.
Proof: As
$\hat{\sigma^{2}}=\frac{1}{n}\sum_{i=1}^{n}(X_{i}-\bar{X})^{2}=\frac{1}{n}\sum_{i=1}^{n}X_{i}^{2}-\bar{X}^{2},$
$E(\hat{\sigma^{2}})=E(\frac{1}{n}\sum_{i=1}^{n}X_{i}^{2})-E(\bar{X}^{2})=\frac{1}{n}\sum_{i=1}^{n}E(X_{i}^{2})-E(\bar{X}^{2}),$
and
$E(X_{i}^{2})=var(X_{i})+[E(X_{i})]^{2}=\sigma^{2}+\mu^{2},$
$E(\bar{X}^{2})=var(\bar{X})+[E(\bar{X})]^{2}=\frac{\sigma^{2}}{n}+\mu^{2},$
then,
$E(\hat{\sigma}^{2})=\sigma^{2}+\mu^{2}-(\frac{\sigma^{2}}{n}+\mu^{2})=\frac{n-1}{n}\sigma^{2}\neq \sigma^{2}$ .
So, $\hat{\sigma}^{2}$ is a biased estimate. If we use $\hat{\sigma}^{2}$ to estimate the value of $\sigma^{2}$ , it will be less than the real vaule. (However, when the sample size $\rightarrow \infty$ , $\lim_{n\rightarrow \infty}[E(\hat{\sigma}^{2})-\sigma^{2}]=0$ , so the $\hat{\sigma}^{2}$ is called asymptotically unbiased estimation).
For the Sample variance,
$S^{2}=\frac{1}{n-1}\sum_{i=1}^{n}(X_{i}-\bar{X})^{2}=\frac{n-1}{n}\hat{\sigma}^{2},$
$E(S^{2})=\frac{n}{n-1}E(\hat{\sigma}^{2})=\frac{n}{n-1} \cdot \frac{n-1}{n}\sigma^{2}=\sigma^{2}.$
That is to say, the sample variance $S^{2}$ is a unbiased estimator of $\sigma^{2}$ . Thus we usually use $S^{2}$ as the estimator of $\sigma^{2}$ .