【学习笔记】数理统计习题十一

最新推荐文章于 2025-04-20 21:57:22 发布

勤勉233

最新推荐文章于 2025-04-20 21:57:22 发布

阅读量3k

点赞数 4

文章标签：概率论

本文链接：https://blog.csdn.net/kinben_qinmian/article/details/111654555

版权

这篇博客详细解答了一系列数理统计中的问题，涉及线性模型的最大似然估计（MLE）、参数的无偏性、随机预测变量的影响，以及不等方差情况下的最小二乘估计。通过对线性模型的分析，博主探讨了如何在不同条件下求解参数估计，并解释了估计量的性质和变化。同时，博主还讨论了如何在误差不等方差的情况下变换模型以满足标准统计假设。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Q1: Consider the linear model

$y_i=\beta_0+\beta_1x_i+\epsilon_i,\ \epsilon_i\stackrel{iid}{\sim} N(0,\sigma^2), i=1,\dots,n.$

Derive the maximum likelihood estimators (MLE) for $\beta_0,\beta_1$ . Are they consistent with the least square estimators (LSE)?
Derive the MLE for $\sigma^2$ and look at its unbiasedness.
A very slippery point is whether to treat the $x_i$ as fixed numbers or as random variables. In the class, we treated the predictors $x_i$ as fixed numbers for sake of convenience. Now suppose that the predictors $x_i$ are iid random variables (independent of $\epsilon_i$ ) with density $f_X(x;\theta)$ for some parameter $\theta$ . Write down the likelihood function for all of our data $(x_i,y_i),i=1,\dots,n$ . Derive the MLE for $\beta_0,\beta_1$ and see whether the MLE changes if we work with the setting of random predictors?

解：注意到 $y_i\sim N(\beta_0+\beta_1x_i,\sigma^2)$ 是独立的，则似然函数为 $L(\beta_0,\beta_1,\sigma^2)=\displaystyle\prod_{i=1}^n\frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{(y_i-\beta_0-\beta_1x_i)^2}{2\sigma^2}}= (2\pi\sigma^2)^{-n/2}e^{-\frac{Q(\beta_0,\beta_1)}{2\sigma^2}}$ 其中 $Q(\beta_0,\beta_1)=\sum_{i=1}^n(y_i-\beta_0-\beta_1x_i)^2$ ，对于给定的 $\sigma^2$ ，为了使 $L(\beta_0,\beta_1,\sigma^2)$ 最大化，则需要使 $Q(\beta_0,\beta_1)$ 最小化，可知这与最小二乘估计量的估计方法一致，因此，最大似然估计值与最小二乘估计量一样。
也就是说
$\hat\beta_1=\frac{\ell_{xy}}{\ell_{xx}}=\frac{\sum_{i=1}^n(y_i-\bar{y})(x_i-\bar{x})}{\sum_{i=1}^n(x_i-\bar{x})^2},\hat\beta_0=\bar{y}-\hat\beta_1\bar{x}$ 接下来我们选择 $\sigma^2$ ，令似然函数 $L(\hat\beta_0,\hat\beta_1,\sigma^2)=(2\pi\sigma^2)^{-n/2}e^{-\frac{Q(\hat\beta_0,\hat\beta_1)}{2\sigma^2}}$ 最大化，容易得到这样的 $\sigma^2$ 为 $\hat\sigma_{MLE}^2=\frac{Q(\hat\beta_0,\hat\beta_1)}{n}=\frac{S_e^2}{n}$ 我们已经证明了 $E[S_e^2]=(n-2)\sigma^2$ ，因此 $E[\hat\sigma_{MLE}^2]=\frac{n-2}{n}\sigma^2$ ，所以这不是 $\sigma^2$ 的无偏估计。
如果 $x_i$ 是密度函数为 $f_X(x;\theta)$ 的随机变量，则关于 $x_i,y_i)$ 的似然函数为 $\begin{aligned} L(\beta_0,\beta_1,\sigma^2,\theta)&=\displaystyle\prod_{i=1}^nf_X(x_i;\theta)f(y_i|x_i)\\ &=\displaystyle\prod_{i=1}^n\Big[f_X(x_i;\theta)\frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{(y_i-\beta_0-\beta_1x_i)^2}{2\sigma^2}}\Big]\\ &=(2\pi\sigma^2)^{-n/2}e^{-\frac{Q(\beta_0,\beta_1)}{2\sigma^2}}f_X(x_i;\theta) \end{aligned}$ 对于固定的 $\sigma^2,\theta$ ，为了最大化 $L(\beta_0,\beta_1,\sigma^2,\theta)$ ，则需要使 $Q(\beta_0,\beta_1)$ 最小化，因此最大似然估计值没有发生变化。

Q2: Consider the linear model without intercept

$y_i = \beta x_i+\epsilon_i,\ i=1,\dots,n,$

where $\epsilon_i$ are independent with $E[\epsilon_i]=0$ and $Var[\epsilon_i]=\sigma^2$ .

Write down the least square estimator $\hat \beta$ for $\beta$ , and derive an unbiased estiamtor for $\sigma^2$ .
For fixed $x_0$ , let $\hat{y}_0=\hat\beta x_0$ . Work out $Var[\hat{y}_0]$ .

解：令 $Q(\beta)=\sum_{i=1}^n(y_i-\beta x_i)^2$ ，不难发现，最小值点 $\hat{\beta}$ 满足
$\frac{\partial Q}{\partial\beta}=-2\displaystyle\sum_{i=1}^n(y_i-\beta x_i)x_i=0$ 于是得到最小二乘估计量：
$\hat{\beta}=\frac{\sum_{i=1}^nx_iy_i}{\sum_{i=1}^nx_i^2}$ 注意到 $E [ Q ( β ^ ) ] = E [ ∑ i = 1 n y i 2 + β ^ 2 ∑ i = 1 n x i 2 − 2 β ^ ∑ i = 1 n x i y i ] = ∑ i = 1 n { V a r [ y i ] + ( E [ y i ] ) 2 } − E [ β ^ ∑ i = 1 n x i y i ] = ∑ i = 1 n ( σ 2 + β 2 x i 2$

最低0.47元/天解锁文章